Logo
Dice

Python Data Engineer

Dice, Houston, Texas, United States, 77246


Dice is the leading career destination for tech experts at every stage of their careers. Our client, Kaizer Software Solutions, is seeking the following. Apply via Dice today!

We are currently seeking an experienced Python Data Engineer to join the Big Data and Advanced Analytics department. As part of the Data Engineering team, the Lead Python Data Engineer will work closely with Business domain experts and Data Scientists to solve real-world oil and gas midstream problems using advanced analytics, machine learning, and artificial intelligence. This individual will provide analytical and technical leadership to the team to advance the data engineering practice within the organization.

Responsibilities include:

Work directly with Business domain experts and Data Scientists to develop high quality, reliable, scalable, machine learning systemsDesign and implement frameworks and tools to streamline the machine learning processAutomate manual data collection and processing tasks to improve efficiencyLeverage software architecture and design patterns to develop fault tolerant microservicesConvert research-based machine learning models into production-ready softwareImplement processes to ensure coding standards, code quality, documentation, and test coverageThe successful candidate will meet the following qualifications:

10+ years in IT and 7+ years of programming experience in PythonExpertise in developing and maintaining data pipelinesExperience in testing, packaging, and deploying machine learning modelsExperience in software engineering practices such as Design Principles and Patterns, Unit Testing, Refactoring, CI/CD, and version controlExpertise in Object-Oriented Design Principles and Functional Programming PrinciplesExperience with common Python Data Engineering packages including Pandas, Numpy, Pyarrow, Pytest, Scikit-Learn, and Boto3Experience in storage technologies including SQL relational databases and Object Storage such as AWS S3Experience in implementing distributed computing systemsExperience in designing modular, reusable software componentsExperience in developing API endpoints and microservicesKnowledgeable of MLOps PrinciplesKnowledgeable of ML platform technologies including Apache Airflow, Kubernetes, Dask, Ray, and MLFlow

#J-18808-Ljbffr