Tbwa Chiat/Day Inc
Senior Software Engineer, AI Infra New Milpitas, CA
Tbwa Chiat/Day Inc, Milpitas, California, United States, 95035
We are seeking a highly skilled and experienced Senior Software Engineer to join our innovative AI robotics company, taking our AI infrastructure to the next level. In this role, you will build robust offline workflows including training, validation, and production of the robot. You will work closely with AI Robotics expertise to build the foundations for next-gen robots.
Why it’s interesting:
RoboForce, an AI-Robotics company, is building a first-of-its-kind Robotic Workforce System to take on the most tedious, force-demanding, and dangerous work humans don't have to do.
At RoboForce, we're building a small team with a flat org structure and extreme talent density. Currently, we have top tech leaders from CMU Robotics, Michigan Robotics, Amazon Robotics, Tesla Robotics, Google, Waymo, Apple, and Microsoft. RoboForce is backed by world-class investors, including the Nobel Prize Laureate, Carnegie Mellon University, and beyond.
Responsibilities:
Design and develop Python-based training infrastructure to ensure stable and accelerated model training.
Design and implement software tools for data collection and management, training deep neural networks, and deploying them on RoboForce robots.
Develop and optimize tools for deploying trained neural networks to hardware platforms.
Design and maintain pipelines for running and validating PyTorch models.
Develop, operate, and maintain reliable and scalable backend distributed systems.
Minimum Qualifications:
Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field with at least 8 years of working experience.
Strong proficiency in both C++ and Python.
Decent experience with at least 2 low-level programming languages such as C, C++, or Rust.
Experience with Python and an ML framework, including PyTorch, JAX, TensorFlow, etc.
Experience using and managing data stores, including Postgres, MySQL, ElasticSearch, Redis, etc.
Experience in managing cloud infrastructure, including platforms such as AWS, Azure, and GCP.
Preferred Qualifications:
Experience in profiling and optimizing CPU-GPU interactions, including techniques like pipelining compute and data transfers.
Experience in scaling neural network training jobs across GPU clusters.
Experience with GPU programming with CUDA.
Experience in developing data annotation and dataset management tools.
#J-18808-Ljbffr
#J-18808-Ljbffr