Karkidi
Data Engineer, I
Karkidi, Lincolnshire, Illinois, United States, 60069
A Data Engineer will be responsible for understanding the client's technical requirements, designing andbuilding data pipelines to support the requirements. In this role, the Data Engineer, besides developing thesolution, will also oversee other Engineers' development. This role requires strong verbal and writtencommunication skills and the ability to effectively communicate with the client and internal team. A strongunderstanding of databases, SQL, cloud technologies, and modern data integration and orchestrationtools like Azure Data Factory (ADF), Informatica, and Airflow are required to succeed in this role.Responsibilities:
Play a critical role in the design and implementation of data platforms for the AI products.Develop productized and parameterized data pipelines that feed AI products leveraging GPUs.Develop efficient data transformation code in Spark (in Python and Scala) and Dask.Build workflows to automate data pipelines using Python and Argo.Develop data validation tests to assess the quality of the input data.Conduct performance testing and profiling of the code using a variety of tools and techniques.Build data pipeline frameworks to automate high-volume and real-time data delivery.Operationalize scalable data pipelines to support data science and advanced analytics.Optimize customer data science workloads and manage cloud services costs/utilization.Qualifications:
Minimum Education:
Bachelor's, Master's or Ph.D. Degree in Computer Science or Engineering.Minimum Work Experience:
1+ years of experience programming with at least one of the following languages: Python, Scala, Go.1+ years of experience in SQL and data transformation.1+ years of experience in developing distributed systems using open-source technologies such as Spark and Dask.1+ years of experience with relational databases or NoSQL databases running in Linux environments (MySQL, MariaDB, PostgreSQL, MongoDB, Redis).Key Skills and Competencies:
Experience working with AWS / Azure / GCP environment is highly desired.Experience in data models in the Retail and Consumer products industry is desired.Experience working on agile projects and understanding of agile concepts is desired.Demonstrated ability to learn new technologies quickly and independently.Excellent verbal and written communication skills, especially in technical communications.Ability to work and achieve stretch goals in a very innovative and fast-paced environment.Ability to work collaboratively in a diverse team environment.
#J-18808-Ljbffr
Play a critical role in the design and implementation of data platforms for the AI products.Develop productized and parameterized data pipelines that feed AI products leveraging GPUs.Develop efficient data transformation code in Spark (in Python and Scala) and Dask.Build workflows to automate data pipelines using Python and Argo.Develop data validation tests to assess the quality of the input data.Conduct performance testing and profiling of the code using a variety of tools and techniques.Build data pipeline frameworks to automate high-volume and real-time data delivery.Operationalize scalable data pipelines to support data science and advanced analytics.Optimize customer data science workloads and manage cloud services costs/utilization.Qualifications:
Minimum Education:
Bachelor's, Master's or Ph.D. Degree in Computer Science or Engineering.Minimum Work Experience:
1+ years of experience programming with at least one of the following languages: Python, Scala, Go.1+ years of experience in SQL and data transformation.1+ years of experience in developing distributed systems using open-source technologies such as Spark and Dask.1+ years of experience with relational databases or NoSQL databases running in Linux environments (MySQL, MariaDB, PostgreSQL, MongoDB, Redis).Key Skills and Competencies:
Experience working with AWS / Azure / GCP environment is highly desired.Experience in data models in the Retail and Consumer products industry is desired.Experience working on agile projects and understanding of agile concepts is desired.Demonstrated ability to learn new technologies quickly and independently.Excellent verbal and written communication skills, especially in technical communications.Ability to work and achieve stretch goals in a very innovative and fast-paced environment.Ability to work collaboratively in a diverse team environment.
#J-18808-Ljbffr