Altea Healthcare

Senior Machine Learning Engineer

Altea Healthcare, Houston, TX, United States

Senior Machine Learning Engineer, MLOps

ALTEA Healthcare is a leading healthcare organization committed to revolutionizing the delivery of outpatient/post-acute care. We are seeking an experienced Machine Learning Engineer to join our team. The ideal candidate will have a strong background in deploying scalable ML models, including predictive/classification and NLP/NLU models. As an important member of the AI team, this person will contribute significantly to designing, implementing, and deploying various AI/ML product features to improve care delivery and quality for post-acute patients.

Responsibilities:

Develop and deploy production-ready ML models, with a focus on scalability and monitoring across a broad range of applications within healthcare
Write efficient, maintainable, and scalable Python code
Collaborate with machine learning scientists and data engineers to translate prototype code to production-ready code
Set up and maintain end-to-end pipelines including data ingress, egress, model inference, and model retraining
Build high-performance deployment architectures and model monitoring systems
Incorporate feedback from cross-functional teams and refine the ML-driven applications through quick iteration cycles
Maintain best practices of MLOps practices within the healthcare industry
Document the system architecture, design decisions, and codebase to facilitate future maintenance and enhancements.

Requirements:

Bachelor’s or Master’s degree in Engineering, Computer Science, or equivalent experience
At least 4-5 years of relevant experience as an MLOps Engineer
4-5+ years of experience doing MLOps, model monitoring, drift detection, and model retraining
Azure Machine Learning Studio
Experience with transformer-based models and NLP, preferably in a healthcare context
Extensive experience with TensorFlow or PyTorch, and familiarity with HuggingFace
Track record of fine-tuning, running large-scale training jobs, and managing model servers like vLLM, TGI, or TorchServe
Strong proficiency in LangChain, vectorDB and cloud platforms (Azure), model experimentation tools like MLflow, and monitoring tools like Grafana/Splunk, and CI/CD like airflow, gitlab, and Big Data management like Spark, Kafka
Ability to work independently and collaboratively, manage priorities, and deliver high-quality results within project timelines

Job Type: Full-time

Pay: Competitive pay and benefits and extremely valuable startup stock options

Schedule: Full Time

Work Location: Hybrid position in Houston

Benefits:

401(k)
Dental insurance
Health insurance
Vision insurance