Nvidia

Principal Infrastructure SW Engineer, AI Cloud Services

Nvidia, Santa Clara, CA

We are now looking for a Principal Software Engineer, AI Cloud Services Infrastructure:NVIDIA's Deep Learning Libraries Group is seeking an experienced software engineering leader to accelerate our efforts to bring our world-leading AI optimization technologies to bear as cloud services. In this cross-functional role, your mission will be to empower developers across the world to create AI applications that easily use NVIDIA hardware to its fullest though cloud APIs like TensorRT Cloud and NVIDIA AI Foundation Model Endpoints. Your work will focus on the foundational layers needed to consistently deliver services that remain scalable, reliable, secure, while rapidly evolving; and your impact will span the full breadth of NVIDIA’s hardware products, from Drive AGX for autonomous vehicles to DGX servers for datacenter. Join our technically diverse team of software engineers and infrastructure experts to expand the accessibility and reach of NVIDIA’s world-leading AI platforms.What you'll be doing:Guide development and operations of cloud services that enable external developers to easily access the latest AI models, optimizations, and serving techniquesLead and directly contribute to implementation of key infrastructure features to enable product goals and improve productivity of internal engineersMentor engineers to develop their technical skills and ability to make an impactCollaborate with product and engineering leads on feature roadmaps and execution planningPromote and support methodologies that improve efficiency, product quality, security, and scalability.Identify and seize opportunities to build common infrastructure that can be shared across various AI-related servicesWhat we need to see:MS, or PhD in Computer Science, Computer Engineering, or closely related field (or Bachelors with additional equivalent experience).12+ years of relevant experience as a developer, technical lead, and/or engineering managerProven technical skills in architecting, designing, implementing and delivering high-quality cloud services.Proficiency in one or more programming languages (e.g., Python, TypeScript, Go)Proficiency in SW development and DevOps best practices (SW development life cycle, developer workflows, continuous integration, infrastructure as code, etc.)Experience building applications or services that incorporate AIExcellent interpersonal skills and a collaborative, pragmatic approach to solving problems.Ways to stand out from the crowd:Experience building and operating publicly accessible services that incorporate AI at scaleStrong grasp of the latest trends in AI inference serving and performance optimizationDeep knowledge of GPU infrastructure management and/or CUDA applicationsExperience with multiple major cloud platforms (AWS, Azure, GCP, OCI, etc.)This is an opportunity to have a wide impact at NVIDIA by expanding our platform and improving development velocity for our unparalleled ecosystem of AI developers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, driven, and love a challenge, come join our team!The base salary range is 272,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.SummaryLocation: US, CA, Santa Clara; US, TN, Remote; US, CO, Remote; US, CA, Remote; US, MO, RemoteType: Full time