Advanced Micro Devices

Sr. Applied Research Scientist

Advanced Micro Devices, San Jose, California, United States, 95199

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming, and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. THE ROLE: We are looking for a Sr. Applied Research Scientist experienced with training large language models and/or large multimodal models. In this role, you will explore novel LLM/LMM architectures and large-scale training techniques to advance the state-of-the-arts. You will be part of a world-class research team working on pre-training, fine-tuning, and aligning large language and multimodal models, in addition to keeping up to date on the latest progress and trends in LLM/LMM and foundation models. THE PERSON: Do you like to design and implement novel research ideas, improve the quality of the large language and multimodal models, accelerate the training and inference speed of LLMs/LMMs, and influence future hardware and software direction? If so, this role is for you. The ideal candidate will have expertise and hands-on experience in training LLMs/LMMs, be familiar with hyper-parameter tuning techniques, data preprocessing, tokenization methods, and latest training approaches for LLMs/LMMs. A successful candidate needs to be knowledgeable with the latest transformer architectures. KEY RESPONSIBILITIES: Train and finetune LLMs/LMMs. Improve on the state-of-the-art LLMs/LMMs. Accelerate the training and inference speed of LLMs/LMMs. Research novel ML techniques and model architectures. Influence the direction of AMD AI platform. Publish your work at top-tier venues. Engage with academia and open-source ML communities. PREFERRED EXPERIENCE: Experience in developing and debugging in Python. Experienced with text-to-image / text-to-video / image-to-text or video-to-text models. Experience in ML frameworks such as PyTorch, JAX, or TensorFlow. Experience with distributed training. Expertise on LLM/LMM pretraining, finetuning, and/or RLHF. Expertise on transformer architecture. Strong publication record in top tier conferences and journals. Strong communication and problem-solving skills. ACADEMIC CREDENTIALS: A PhD degree or equivalent in machine learning, computer science, artificial intelligence, or a related field. LOCATION: San Jose or Seattle; other US locations may be considered.

#J-18808-Ljbffr