NLP PEOPLE

AIML – ML Engineer, Machine Learning Platform & Infra

NLP PEOPLE, Cupertino, California, United States, 95014

Summary Posted: Jun 26, 2024

Role Number:

200556947

Do you feel you think differently, you are eager to break the status quo, are bold and ambitious, aren’t afraid to take risks, and are passionate about building best-in-class technology? If yes, what better place to do this than Apple? At Apple, "we think different, we push the boundaries of computing and intelligence. We build products that bring smiles to people’s faces." The Foundation Model Infrastructure team, within the Machine Learning Platform Technologies organization, is the backbone of Apple Intelligence. It builds frameworks, services, and tools that power the largest Apple foundation models on servers. Our infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring intelligence to billions of users across the world and make a difference in the lives of people. You will have a chance to work on optimizing billions of parameter language, vision, and speech models using state-of-the-art technologies and make them run at the scale of Apple.

Description

Work alongside the Foundation Model Research team to optimize inference for cutting-edge model architectures. Work closely with product teams to build production-grade solutions to launch models serving millions of customers in real-time. Build tools to understand bottlenecks in inference for different hardware and use cases. Mentor and guide engineers in the organization.

Minimum Qualifications

5+ years of experience leading and driving complex, ambiguous projects.

Experience with high throughput services, particularly at supercomputing scale.

Proficient in running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker, etc.

Familiar with GPU programming concepts using CUDA.

Familiar with one of the popular ML frameworks like PyTorch or TensorFlow.

Preferred Qualifications

Proficient in building and maintaining systems written in modern languages (e.g., Golang, Python).

Familiar with fundamental deep learning architectures such as Transformers and Encoder/Decoder models.

Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server, etc.

Experience writing custom CUDA kernels using CUDA or OpenAI Triton.

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $175,800 and $312,200, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become Apple shareholders through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.

More Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.

#J-18808-Ljbffr