Logo
NVIDIA

Senior Software Engineer, Quantized Training

NVIDIA, Santa Clara, California, us, 95053


We are now looking for a Senior Software Engineer for Quantized Training.

We are a team committed to developing next-generation quantized training recipes for Hopper and future GPUs. We are seeking software engineers to help rethink and create tailored solutions to accelerate the discovery of new recipes. This is a coding-heavy role focused on building infrastructure, tooling, and visualizations.The candidate's work directly supports NVIDIA's production SW systems including Megatron-LM and Transformer Engine. The candidate will be part of a core team of engineers and researchers working in lock step to improve quantized training convergence and efficiency.What You'll Be Doing

Create well-tested SW systems and PoCs in support of quantized trainingBuild visualization tools to track and assess the health of model trainingBenchmark internal and external methods for quantized trainingBuild an insights platform for tracking model metrics and benchmarksArchitect CI/CD systems for versioning training recipesParticipate in code reviewsWhat We Need To See

A Masters Degree or PhD or meaningful equivalent experience in Computer Science/Computer Engineering or a related field.5+ years of relevant software development experience.Strong software engineering background with a focus on building concise and well-tested code in C++ and PythonExperience working with ML accelerators and PyTorch or similar frameworksGood foundation in ML training and quantizationStrong written and oral communication skillsWays To Stand Out From The Crowd

Experience with CUDA, performance optimization and debuggingProficient in precision and numerics for MLGPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.NVIDIA accepts applications on an ongoing basis.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr