Logo
Spectrum

Staff Software Engineer, Machine Learning Systems, Accelerator Performance

Spectrum, Sunnyvale, California, United States, 94087


Minimum qualifications:

Bachelor's degree or equivalent practical experience.8 years of experience in software development, with data structures/algorithms.7 years of experience building and developing infrastructure, distributed systems, networks, compute technologies, storage, or hardware architecture.5 years of experience with design and architecture, and testing/launching software products.

Preferred qualifications:

Experience with Large language Models (LLMs).Experience with ML based performance.Knowledge of performance analysis.Knowledge of computer architecture.About the job

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.Google Cloud accelerates organizations’ ability to digitally transform their business with the best infrastructure, platform, industry solutions and expertise. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology – all on the cleanest cloud in the industry. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.Responsibilities

Explore and define future ML accelerator system and chip architecture with objective.Build and maintain system simulation infrastructure to enable system, understand hardware and software co-design and optimization.Understand the latest business-critical production ML models (e.g., Large language models, large embedding models).Build and maintain robust AutoML infrastructure for automated hardware friendly model optimization/enablement at scale.

#J-18808-Ljbffr