Logo
Nvidia

Solutions Architect, Generative AI

Nvidia, New York, NY


We are looking for a AI Solution Architect Engineer with experience in Generative AI software development and deployment. As part of the Solution Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and AI with NVIDIA’s key customers. This role offers an excellent opportunity to build your career in the rapidly growing field of AI while working with the world's most successful technology companies. Primary responsibilities will be to lead software customer technical engagements with NVIDIA products and technologies. Join us in this exciting endeavor!What you’ll be doing:Develop and demonstrate software solutions based on NVIDIA’s ground breaking AI software and hardware technologies to customers. Develop GenAI model pipeline and perform in-depth analysis and optimization to ensure the best performance on current- and next-generation GPU architecturesDevelop and debug software for NVIDIA and OSS AI frameworks and librariesLead and develop proof-of-concepts (PoCs) for software solutions applied to Consumer Internet industry use-cases such as NLP/LLM, retrieval, recommender, etc. by working closely with customer's AI developers. Build collateral (notebook/code) for PoCsWork closely with business development team owning the technical relationship and enabling customer in building innovative solutions based on NVIDIA technologiesPartner with NVIDIA software engineering, product, sales teams to secure design wins at customers. Enable development and growth of NVIDIA product features through customer feedback and PoC evaluationsWhat we need to see:BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience5+ years of experience as an AI/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUsExperience with GenAI applications and LLM training/fine-tuning, inference optimization and/or RAG pipelinesAbility to communicate your ideas/code clearly through GitHub, documentationGreat teammate who enjoys collaborating with teams across the organization such as Engineering/Research, Sales, Product, and MarketingEffective verbal/written communication, and technical presentation skillsSelf-starter with passion for growth, enthusiasm for continuous learning and sharing findings across the teamWays to stand out from the crowd:Experience working with enterprise developers and customer facing skillsExperience with large-scale production data pipelines and AI model training/deploymentKnowledge of MLOps technologies such as containers, Kubernetes, data center deployments etc.Able to think creatively to debug and solve complex problemsWe make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and data science conferences. We are open to remote work location. We look forward to have you join our team!With highly competitive salaries, a comprehensive benefits package, and an excellent engineering work culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on meaningful problems that are defining the field of ML/DL, data science, robotics, and graphics.The base salary range is 148,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.SummaryLocation: US, CA, Santa Clara; US, TX, Remote; US, TN, Remote; US, CO, Boulder; US, NY, New York; US, MA, RemoteType: Full time