Apple
Service Reliability Engineer (SRE), Data Infrastructure
Apple, Seattle, Washington, us, 98127
Service Reliability Engineer (SRE), Data Infrastructure
Seattle, Washington, United States
Software and Services
The Apple Services Engineering team (ASE) is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. They do it at an extensive scale, meeting high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. These engineers build secure, end-to-end solutions and develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services. Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision that includes a deep commitment to strengthening Apple’s privacy policy, one of our core values.
Description
The Service Reliability Engineer (SRE) role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability, and availability. We manage jobs as well as applications on bare-metal and cloud computing platforms to deliver data processing for many of Apple’s global products. Our teams work with exabytes of data, petabytes of memory, and tens of thousands of jobs to enable predictable and performant data analytics for features in Apple Music, TV+, App Store, and other world-class products. If you love designing and running systems and infrastructure that will impact millions of users, then this is the place for you!
Minimum Qualifications
BS degree in computer science or equivalent field with 5+ years or MS degree with 3+ years experience, or equivalent.
At least 5 years in a Service Reliability Engineering (SRE), DevOps, or infrastructure-focused role.
5+ years of running services in a large scale *nix environment.
Understanding of SRE principles and goals along with prior on-call experience.
The ability to design, author, and release code in any language (Go, Python, Ruby, or Java would be a plus).
Deep understanding and experience in one or more of the following: Hadoop, Spark, Flink, Kubernetes, AWS.
Key Qualifications
Preferred Qualifications
Fast learner with excellent analytical problem-solving and interpersonal skills.
Experience working on supporting Java applications.
Experience using monitoring and logging solutions like Splunk, Grafana, etc.
Familiarity with DNS, HTTP, message queues, queueing theory, RPC frameworks, and datastore.
Experience working with geographically distributed teams and implementing high-level projects and migrations.
Strong communication skills and ability to deliver results on time with high quality.
#J-18808-Ljbffr
#J-18808-Ljbffr