Logo
S&P Global

Senior Data Engineer - Big Data

S&P Global, New York, New York, us, 10261


The Team:You will be an expert contributor and part of the Rating Organization’s Ingestion Pipelines Engineering Team. This team, who has a broad and expert knowledge on Ratings organization’s critical data domains, technology stacks and architectural patterns, fosters knowledge sharing and collaboration that results in a unified strategy. All Data Services team members provide leadership, innovation, timely delivery, and the ability to articulate business value. Be a part of a unique opportunity to build and evolve S&P Ratings next gen Ingestion pipelines platform.Responsibilities and Impact :The Data Engineer will support our data department on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projectsDesign & Develop “Transformations” aspects using ELT framework to modernize the Ingestion pipelines and build data transformations at scaleExperience in the areas of design and implementation of Ratings Data Ingestion pipelines with modern AWS cloud and other technologies such as S3, Hive, Databricks, Scala, Python and Distributed data processing frameworkBuild processes supporting data transformation, data structures, metadata, dependency and workload management.Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources into SQL Server, MongoDB, and othersIdentify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etcBe in tune with emerging trends Big data and cloud technologies and participate in evaluation of new technologiesEnsure compliance through the adoption of enterprise standards and promotion of best practice / guiding principles aligned with organization standardsCompensation/Benefits Information :S&P Global states that the anticipated base salary range for this position is $95,000 to $166,000. Final base salary for this role will be based on the individual’s geographic location, as well as experience level, skill set, training, licenses and certifications.In addition to base compensation, this role is eligible for an annual incentive plan. This role is eligible to receive additional S&P Global benefits. For more information on the benefits we provide to our employees, please click here.What We’re Looking For:Basic Required Qualifications:BE, MCA or MS degree in Computer Science or Information Technology3+ years of hands-on experience in implementing data lake systems using AWS/Azure cloud technologies such as S3, Databricks, Hive.1+ years of Expertise in building application using Data stream processing tool, APIs and DBMS for building ingestion pipeline for Bulk and incremental data loads.Experience with development frameworks as well as data and integration technologies such Python, ScalaExperience in microservices and API design and implementation, with service-oriented architectures, SOAP and RESTful APIsHands-on experience in developing scalable data pipeline using technologies like Data stream processing tools, Databricks, Distributed data processing framework and Scala applying ETL and ELT conceptsDeep Experience with three or more technologies of Java/J2EE, C#, AWS, Distributed data processing framework, Python, Scala, any RDBMS, Data stream processing tools, Informatica, Angular/ReactJS, Databricks, Cloud-native orchestration toolsExperience with Continuous integration and deployment tools like Jenkins and Azure DevOpsExperience working in UNIX/Linux environment including shell scriptingStrong understanding of cloud native architectures, design patterns and best practicesShould be in position to articulate and convert requirements into solution.Knowledgeable in technology and industry trends with ability to develop and present substantive technical solutionsKnowledge of Agile approaches to software development and able to put key Agile principles into practice to deliver solutions incrementallyQuality first mindset with a strong background and experience developing products for a global audience at scaleExcellent analytical thinking, interpersonal, oral, and written communication skills with strong ability to influence both IT and business partnersAdditional Preferred Qualifications:Experience With Machine Learning Libraries and Frameworks (TensorFlow, MLlib, Pandas, Numpy) is an added advantageMonitors industry trends and directions; develops and presents substantive technical recommendations to senior managementAbility to prioritize and manage work to critical project timelines in a fast-paced environmentFinancial services industry experience