Logo
TechnoGen

Lead Data Engineer

TechnoGen, Atlanta, Georgia, United States, 30383


Role: Lead Data Engineer

Client:

Alimentation Couche-Tard

Location: Remote

Duration: 6-month contract

Requirements: Looking for 8+ years of experience

Building pods (about 4-5 different pods in org.) all reporting to Pradeep. This person will support one project at first (could be retail, finance operations, loyalty, etc.) Will have 2-6 Data Engineers under the Lead (still reporting to Pradeep)

Python, SQL and PySpark Azure preferred but open to other (GCP, AWS) Databricks, Snowflake, and Fabric

Batch and real-time streaming - building pipelines Gathering requirements from the business Mentoring some of the Data Engineers on the team - room for growth

Any nice-to-have skills that would make them exceptional?

Certifications , 2+ yrs data architecture, data engineering patters , git repo, self-starter and go-getter attitude, software engineering background , ML experience

Job Description:

As the Technical Lead Data Engineer, your primary responsibility will be to spearhead the design, development, and implementation of data solutions aimed at empowering our organization to derive actionable insights from intricate datasets. You will take the lead in guiding a team of data engineers, fostering collaboration with cross-functional teams, and spearheading initiatives geared towards fortifying our data infrastructure, CI/CD pipelines, and analytics capabilities

Responsibilities:

Apply advanced knowledge of Data Engineering principles, methodologies and techniques to design and implement data loading and aggregation frameworks across broad areas of the organization. Gather and process raw, structured, semi-structured and unstructured data using batch and real-time data processing frameworks. Implement and optimize data solutions in enterprise data warehouses and big data repositories, focusing primarily on movement to the cloud. Drive new and enhanced capabilities to Enterprise Data Platform partners to meet the needs of product / engineering / business. Experience building enterprise systems especially using Databricks, Snowflake and platforms like Azure, AWS, GCP etc Leverage strong Python, Spark, SQL programming skills to construct robust pipelines for efficient data processing and analysis. Implement CI/CD pipelines for automating build, test, and deployment processes to accelerate the delivery of data solutions. Implement data modeling techniques to design and optimize data schemas, ensuring data integrity and performance. Drive continuous improvement initiatives to enhance performance, reliability, and scalability of our data infrastructure. Collaborate with data scientists, analysts, and other stakeholders to understand business requirements and translate them into technical solutions. Implement best practices for data governance, security, and compliance to ensure the integrity and confidentiality of our data assets. Qualifications:

Bachelor's or master's degree in computer science, Engineering, or a related field. Proven experience (8+) in a data engineering role, with expertise in designing and building data pipelines, ETL processes, and data warehouses. Strong proficiency in SQL, Python and Spark programming languages. Strong experience with cloud platforms such as AWS, Azure, or GCP is a must. Hands-on experience with big data technologies such as Hadoop, Spark, Kafka, and distributed computing frameworks. Knowledge of data lake and data warehouse solutions, including Databricks, Snowflake, Amazon Redshift, Google BigQuery, Azure Data Factory, Airflow etc. Experience in implementing CI/CD pipelines for automating build, test, and deployment processes. Solid understanding of data modeling concepts, data warehousing architectures, and data management best practices