Logo
Character.AI

Software Engineer, Machine Learning Infrastructure

Character.AI, New York, New York, us, 10261


About the role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.Responsibilities :Provide infrastructure support to our ML research and product

Build tooling to diagnose cluster issues and hardware failures

Monitor deployments, manage experiments, and generally support our research

Maximize GPU allocation and utilization for both serving and training

Requirements:4+ years of experience supporting the infrastructure within an ML environment

Experience in developing tools used to diagnose ML infrastructure problems and failures

Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)

Experience working with GPUs

Nice to haveExperience with large GPU clusters and high-performance computing/networking

Experience with supporting large language model training

Experience with ML frameworks like Pytorch/TensorFlow/JAX

Experience with GPU kernel development

About Character.AI

Founded in 2021, Character is a leading AI company offering personalized experiences through customizable AI 'Characters.' As one of the most widely used AI platforms worldwide, Character enables users to interact with AI tailored to their unique needs and preferences.In just two years, we achieved unicorn status and were named Google Play's AI App of the Year – a testament to our groundbreaking technology and vision.Ready to shape the future of Consumer AI?

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

#J-18808-Ljbffr