Apple Inc.
AIML - Machine Learning Engineer, Foundation Model Services
Apple Inc., Seattle, Washington, us, 98127
Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren’t afraid to take risks and are passionate to build the best of class technology? If yes, what better place to be at and do this than Apple? At Apple, “we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people’s face.” The Foundation Model Services team, within the Machine Learning Platform Technologies organization, is the backbone of Apple Intelligence. It builds frameworks, services, and tools that power the largest Apple foundation models on servers. Our infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in the lives of people. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state-of-the-art technologies and make it run at the scale of Apple.
Description
* Work closely with product teams to build production-grade solutions to launch models serving millions of customers in real time. * Work alongside the Foundation Model Research team to prototype and develop inference for cutting-edge model architectures. * Build tools to understand bottlenecks in Inference for different hardwares and use cases. * Mentor and guide engineers in the organization. Minimum Qualifications
8+ years of experience leading and driving complex, ambiguous projects. Strong industry background and experience in ML technologies (LLMs, Machine Learning, NLP, Information Retrieval, Statistics). Rich experience with high throughput services, particularly at supercomputing scale. Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker, etc. Proficient in building and maintaining systems written in modern languages (e.g., Golang, Python). Preferred Qualifications
Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow. Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models. Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server, etc. Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
#J-18808-Ljbffr
* Work closely with product teams to build production-grade solutions to launch models serving millions of customers in real time. * Work alongside the Foundation Model Research team to prototype and develop inference for cutting-edge model architectures. * Build tools to understand bottlenecks in Inference for different hardwares and use cases. * Mentor and guide engineers in the organization. Minimum Qualifications
8+ years of experience leading and driving complex, ambiguous projects. Strong industry background and experience in ML technologies (LLMs, Machine Learning, NLP, Information Retrieval, Statistics). Rich experience with high throughput services, particularly at supercomputing scale. Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker, etc. Proficient in building and maintaining systems written in modern languages (e.g., Golang, Python). Preferred Qualifications
Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow. Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models. Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server, etc. Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
#J-18808-Ljbffr