Logo
Saxon Global

Senior Data Engineer

Saxon Global, Malvern, Pennsylvania, United States, 19355


This is a 12 month contract with Vanguard. 3 days onsite in Malvern, PA and 2 days remote. All visa - No H1B.

Job Summary

Must have strong experience with:

AWS GlueAWS SagemakerSQLPythonResponsibilities

Provides advanced data solutions by using software to process, store, and serve data to others.Tests data quality and optimizes data availability. Ensures that data pipelines are scalable, repeatable, and secure.Utilizes a deep dive analytical skillset on a variety of internal and external data.Ability to process data using Pyspark on EMR or Glue (distributed computing) and transform the data as features for machine learning models.Ability to understand the data and business requirements for ETL.Qualifications

Deep technical knowledge - including proficiency in Python, Pyspark, SQL, Hive, Spark, Amazon Web Services / cloud computing (e.g., Elastic MapReduce, EC2, S3), Bash shell scripting,Proficiency in GLUE ETLKnowledge of PostGres RDSBasis or advanced experience Sagemaker and Sagemaker Data WranglerExperience in comparing data between two sources for parity checkExperience writing production quality code to create data productsAbility to effectively communicate technical concepts to non-technical audiencesUnderstanding and applied knowledge of Agile delivery methodologiesWrites ETL (Extract / Transform / Load) processes, designs database systems and, develops tools for real-time and offline analytic processing.Troubleshoots software and processes for data consistency and integrity. Integrates complex and large scale data from a variety of sources for business partners to generate insight and make decisions.Translates business specifications into design specifications and code. Responsible for writing complex programs, ad hoc queries, and reports. Ensures that all code is well structured, includes sufficient documentation, and is easy to maintain and reuse.Partners with internal clients to gain an expert understanding of business functions and informational needs. Works closely with other technical and data analytics experts across the business to implement data solutions.Leads all phases of solution development. Explains technical considerations at related meetings, including those with internal clients and less experienced team members.Assesses data quality and tests code thoroughly for accuracy of intended purpose. Provides data analysis guidance and serves as a technical consultant for the client.Educates and develops junior data engineers on the team while applying quality control to their work. Develops data engineering standards and contributes expertise to other data expert teams across Vanguard.Tests and implements new software releases through regression testing. Identifies issues and engages with vendors to resolve and elevate software into production.Participates in special projects and performs other duties as assigned.