Apple
Compute SRE - Engineer
Apple, Cupertino, California, United States, 95014
Compute SRE - Engineer
Cupertino, California, United States
Software and Services
Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Join Apple’s Cloud Service Infrastructure team as a site reliability engineer to help support and scale cloud services for thousands of development and operations engineers. This is a hands-on role to establish SRE practices for a private cloud service to accelerate our ability to reliably and consistently deliver thousands of applications.
Description
As a Site Reliability Engineer you will be responsible for providing the platform for important cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to thrive. The successful candidate will be highly self-motivated with a passion for excellence, quality and detail. The SRE will not only support operations, but also work closely with the developers and architects within the team to aid in the design and assist with the implementation to improve stability, security and scalability. AS AN SRE AT APPLE, YOU WILL:
Operate, monitor, and creatively prioritize all aspects of our production and non-production environments.
Design, build and implement innovative solutions for previous, present and future issues.
Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams.
Automate deployment and orchestration of services into the cloud environment as well as other routine processes.
Actively participate in capability planning, scale testing, and disaster recovery exercises.
Get along with and support partner teams, including engineering, QA, and program management.
Cultivate and manage relationships with internal and external third-party vendors.
Minimum Qualifications
8+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role
Must be an expert and have in-depth professional experience with cloud operations, with a focus on "infrastructure-as-a-service" (compute, storage, and network virtualization)
Proficient in Python and a solid understanding of GoLang
You bring experience operating large-scale multi-tenant Infrastructure as a Managed service
Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc)
Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc) is a plus
Able to solve issues across the entire infrastructure stack
Experience with Linux system virtualization (Libvirt, QEMU, KVM, etc), along with the APIs
Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and Prometheus
Working understanding of common authentication schemes, certificates, and securely handling secrets
Outstanding interpersonal and communications skills
Preferred Qualifications
B.S. in computer science or similar field or equivalent experience.
Additional Requirements
This posting is not for a specific job opening and by submitting your resume you are expressing interest in being contacted about this type of role at Apple in the future.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.
#J-18808-Ljbffr
Cupertino, California, United States
Software and Services
Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Join Apple’s Cloud Service Infrastructure team as a site reliability engineer to help support and scale cloud services for thousands of development and operations engineers. This is a hands-on role to establish SRE practices for a private cloud service to accelerate our ability to reliably and consistently deliver thousands of applications.
Description
As a Site Reliability Engineer you will be responsible for providing the platform for important cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to thrive. The successful candidate will be highly self-motivated with a passion for excellence, quality and detail. The SRE will not only support operations, but also work closely with the developers and architects within the team to aid in the design and assist with the implementation to improve stability, security and scalability. AS AN SRE AT APPLE, YOU WILL:
Operate, monitor, and creatively prioritize all aspects of our production and non-production environments.
Design, build and implement innovative solutions for previous, present and future issues.
Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams.
Automate deployment and orchestration of services into the cloud environment as well as other routine processes.
Actively participate in capability planning, scale testing, and disaster recovery exercises.
Get along with and support partner teams, including engineering, QA, and program management.
Cultivate and manage relationships with internal and external third-party vendors.
Minimum Qualifications
8+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role
Must be an expert and have in-depth professional experience with cloud operations, with a focus on "infrastructure-as-a-service" (compute, storage, and network virtualization)
Proficient in Python and a solid understanding of GoLang
You bring experience operating large-scale multi-tenant Infrastructure as a Managed service
Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc)
Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc) is a plus
Able to solve issues across the entire infrastructure stack
Experience with Linux system virtualization (Libvirt, QEMU, KVM, etc), along with the APIs
Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and Prometheus
Working understanding of common authentication schemes, certificates, and securely handling secrets
Outstanding interpersonal and communications skills
Preferred Qualifications
B.S. in computer science or similar field or equivalent experience.
Additional Requirements
This posting is not for a specific job opening and by submitting your resume you are expressing interest in being contacted about this type of role at Apple in the future.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.
#J-18808-Ljbffr