Logo
Karkidi

Senior/Staff ML Infrastructure Engineer

Karkidi, Mountain View, California, us, 94039


About the role:We're looking for high-energy, creative, and collaborative candidates who want to work in a fast-paced, execution-oriented team. You will play an essential role in helping accelerate the development and deployment of our AV software stack. This position has a strong technical background, hands-on software engineering experience, and a knack for solving hard problems.This role is onsite 4 days a week at our Mountain View, CA office!What you'll do:Own development of ML models end-to-end from data strategy, initial development, optimization, production platform validation, and fine-tuning based on metrics and on-road performanceLead efficient neural network development including quantization, pruning, sparsification, compression, and novel differentiable compute primitivesBuild the foundation models for the on-vehicle and offline applications; Develop metrics and tools to analyze errors and understand improvements in our systemsTrain and evaluate DNNs for the purpose of benchmarking neural network optimization algorithms – optimizing for latency and power consumptionDesign and implement a horizontally scalable, high-throughput cloud inference pipeline for evaluation and KPI calculationStreamline workflows to allow creation of verified, deployable artifacts from annotated dataSupport data preparation for training: building a horizontally scalable data preparation pipeline that is simple to use and doesn't delay trainingSupport development of tools for introspection and visualization to understand what is going well and what can be improved in our workWhat we're looking for:Bachelor's Degree in Computer Science, Machine Learning or relevant fieldMaster's Degree with a focus on Machine Learning, Statistics, Optimization or a related field (preferred) or relevant work experience7+ years of experience working with large ML projects and/or building production ML systemsExcellent C++, Python, and/or CUDA programming skillsFamiliarity with modern machine learning environments such as PytorchExpert experience with optimization techniques from high-level ML algorithms to low-level HW utilizationExperience in software architecture, system performance, latency, and data flowExpert experience in machine learning workflows: data sampling and curation, pre-processing, model training, ablation studies, evaluation, deployment, inference optimizationStrong analytical skills, especially for performance troubleshooting (e.g. profiling, roofline model)Industry experience in building large-scale ML pipelinesExperience with cloud ML training pipelines in Azure (preferred)High Performance Computing experience (preferred)Taking care of our team:At Gatik, we connect people of extraordinary talent and experience to an opportunity to create a more resilient supply chain and contribute to our environment’s sustainability. We are diverse in our backgrounds and perspectives yet united by a bold vision and shared commitment to our values. Our culture emphasizes the importance of collaboration, respect and agility.

#J-18808-Ljbffr