Logo
Marathon TS

Sr. Data Engineer/ETL Engineer with Security Clearance

Marathon TS, Huntsville, Alabama, United States, 35824


Position Overview: Marathon TS is looking for a Senior ETL Engineer to join our team supporting our Federal customer out of Huntsville, AL. This position will provide the ability to make a significant impact to the mission while also allowing the candidate to grow their skills and career. Successful candidate will need to be able to maintain existing software as the transition from server-based to cloud-based occurs. The candidate will work with a GOTS-based Extract, Transform and Load (ETL) system that loads information into Postgres database with billions of rows and deals with a file repository that is in a Petabyte range. In addition, the candidate will be highly involved in the engineering planning and transition of the existing system software / data into a Cloud-based (currently AWS) solution.

Core Responsibilities:

Design, develop, and maintain the ETL (Extract, Transform, Load) processes for master data management (MDM) system.

Build and optimize data pipelines to extract data from various sources, transform it into the required format, and load using Databricks and AWS services.

Collaborate with stakeholders to gather data requirements, understand data sources, and ensure data quality and integrity throughout the ETL process.

Implement data validation, cleansing, and enrichment techniques to improve the accuracy and completeness of data.

Monitor and troubleshoot ETL processes to identify and resolve issues in a timely manner.

Requirements:

Active Top-Secret Clearance with the ability to obtain an SCI.

A Counter Intelligence Polygraph is required after start.

Bachelor's degree in Engineering, Computer Science, or other related analytical, scientific, or technical discipline.

At least 6 years experience with ETL.

Database ETL engineers should have experience with Oracle 11g/12c, Sun Solaris OS, Linux (CentOS, Red Hat), and Windows environments.

Strong proficiency in programming languages such as Scala or Java.

Experience in designing and developing ETL workflows using tools like Apache Spark or AWS Glue.

In-depth knowledge of ETL best practices, data integration techniques, and data quality management.

Familiarity with different data storage technologies and databases, such as Amazon S3 or Amazon Redshift.

MUST have strong UNIX/LINUX experience.

Preferred Qualifications:

Understanding of concepts of Data Lakehouse architecture as well as OpenSearch is a plus.

Experience using software repository tools such as GIT or SVN.

#CJJOBS Marathon TS is committed to the development of a creative, diverse and inclusive work environment. In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Marathon TS will be based on merit, qualifications, and abilities. Marathon TS does not discriminate against any person because of race, color, creed, religion, sex, national origin, disability, age or any other characteristic protected by law (referred to as 'protected status').

#J-18808-Ljbffr