NVIDIA
Solutions Architect, Generative AI
NVIDIA, Santa Clara, California, us, 95053
We are looking for a AI Solution Architect Engineer with experience in Generative AI software development and deployment. As part of the Solution Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and AI with NVIDIA’s key customers. This role offers an excellent opportunity to build your career in the rapidly growing field of AI while working with the world's most successful technology companies. Primary responsibilities will be to lead software customer technical engagements with NVIDIA products and technologies. Join us in this exciting endeavor!
What you’ll be doing:
Develop and demonstrate software solutions based on NVIDIA’s ground breaking AI software and hardware technologies to customers. Develop GenAI model pipeline and perform in-depth analysis and optimization to ensure the best performance on current- and next-generation GPU architectures
Develop and debug software for NVIDIA and OSS AI frameworks and libraries
Lead and develop proof-of-concepts (PoCs) for software solutions applied to Consumer Internet industry use-cases such as NLP/LLM, retrieval, recommender, etc. by working closely with customer's AI developers. Build collateral (notebook/code) for PoCs
Work closely with business development team owning the technical relationship and enabling customer in building innovative solutions based on NVIDIA technologies
Partner with NVIDIA software engineering, product, sales teams to secure design wins at customers. Enable development and growth of NVIDIA product features through customer feedback and PoC evaluations
What we need to see:
BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience
5+ years of experience as an AI/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUs
Experience with GenAI applications and LLM training/fine-tuning, inference optimization and/or RAG pipelines
Ability to communicate your ideas/code clearly through GitHub, documentation
Great teammate who enjoys collaborating with teams across the organization such as Engineering/Research, Sales, Product, and Marketing
Effective verbal/written communication, and technical presentation skills
Self-starter with passion for growth, enthusiasm for continuous learning and sharing findings across the team
Ways to stand out from the crowd:
Experience working with enterprise developers and customer facing skills
Experience with large-scale production data pipelines and AI model training/deployment
Knowledge of MLOps technologies such as containers, Kubernetes, data center deployments etc.
Able to think creatively to debug and solve complex problems
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and data science conferences. We are open to remote work location. We look forward to have you join our team!
With highly competitive salaries, a comprehensive benefits package, and an excellent engineering work culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on meaningful problems that are defining the field of ML/DL, data science, robotics, and graphics.
The base salary range is 148,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
What you’ll be doing:
Develop and demonstrate software solutions based on NVIDIA’s ground breaking AI software and hardware technologies to customers. Develop GenAI model pipeline and perform in-depth analysis and optimization to ensure the best performance on current- and next-generation GPU architectures
Develop and debug software for NVIDIA and OSS AI frameworks and libraries
Lead and develop proof-of-concepts (PoCs) for software solutions applied to Consumer Internet industry use-cases such as NLP/LLM, retrieval, recommender, etc. by working closely with customer's AI developers. Build collateral (notebook/code) for PoCs
Work closely with business development team owning the technical relationship and enabling customer in building innovative solutions based on NVIDIA technologies
Partner with NVIDIA software engineering, product, sales teams to secure design wins at customers. Enable development and growth of NVIDIA product features through customer feedback and PoC evaluations
What we need to see:
BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience
5+ years of experience as an AI/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUs
Experience with GenAI applications and LLM training/fine-tuning, inference optimization and/or RAG pipelines
Ability to communicate your ideas/code clearly through GitHub, documentation
Great teammate who enjoys collaborating with teams across the organization such as Engineering/Research, Sales, Product, and Marketing
Effective verbal/written communication, and technical presentation skills
Self-starter with passion for growth, enthusiasm for continuous learning and sharing findings across the team
Ways to stand out from the crowd:
Experience working with enterprise developers and customer facing skills
Experience with large-scale production data pipelines and AI model training/deployment
Knowledge of MLOps technologies such as containers, Kubernetes, data center deployments etc.
Able to think creatively to debug and solve complex problems
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and data science conferences. We are open to remote work location. We look forward to have you join our team!
With highly competitive salaries, a comprehensive benefits package, and an excellent engineering work culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on meaningful problems that are defining the field of ML/DL, data science, robotics, and graphics.
The base salary range is 148,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.