Senior Software Engineer, Data & ML Infrastructure
Tapjoy, San Francisco, CA
Role Description
The opportunity
The Data & ML Platform team at Unity manages a comprehensive range of data and machine learning systems and tools, covering every stage of the lifecycle—from data ingestion and storage to processing, feature generation, machine learning model training, deployment, and serving. Our work facilitates diverse applications such as simulations, gaming experiences, and improved game monetization. We empower other Unity teams to create exceptional tools and services for our users.
We're seeking a talented Senior Data and ML Infrastructure Engineer to join our team. In this role, you will collaborate with engineers, product specialists, and data experts to research, prototype, and develop world-class applications, services, and pipelines, leveraging over 60 billion daily data events.
What you'll be doing
- Design and optimize large-scale data platforms and machine learning infrastructure systems for efficiency, reliability, and cost-effectiveness.
- Lead improvements in infrastructure tooling by researching, prototyping, and implementing solutions that enhance operational excellence and development agility.
- Implement next-generation ML Ops capabilities in our model training, deployment, and serving pipelines.
- Champion infrastructure optimization projects across the company, driving significant cost savings while maintaining or improving service performance.
- Engage in on-call rotations, utilizing your deep understanding of infrastructure to identify and resolve complex issues quickly, minimize impact, and prevent future occurrences through automation and system enhancement.
What we're looking for
- Experience in infrastructure and back-end engineering, with a track record of building robust distributed systems.
- Daily hands-on experience with model training, deployment, experimentation, and serving tools for machine learning.
- Deep understanding of cloud infrastructure using GCP, AWS, or Azure and proficiency in either Go, Java, or Python.
- Experience with the latest data and machine learning technologies such as Google's Vertex AI, Ray, and SageMaker.
- Experience in building Kubernetes infrastructure stacks, experience from either the ML or Data domain and its associated technologies is advantageous.
You might also have
- An 'in-it-together' mentality that appreciates engineering as a collective effort.
- Hands-on experience with tools such as Vertex AI, Protobuf, Kafka, Flink, BigQuery, Druid, Feast, Kubeflow, Ray, or KServe.
- Experience with Terraform, ArgoCD, GitHub Actions, and similar tools used for CI/CD.
Additional information
- Relocation support is not available for this position.
- International relocation support is not available for this position.
- Work visa/immigration sponsorship is not available for this position.
Gross pay salary
$111,000—$211,300 USD