Apple Inc.
Software Engineer (Site Reliability), Retail Engineering
Apple Inc., Austin, Texas, us, 78716
Software Engineer (Site Reliability), Retail Engineering
Austin
,
Texas
,
United StatesSoftware and ServicesCarrier Services offers seamless integration of Apple Retail Stores and Apple Online store with major US Carriers for iPhone activations. We are looking for a talented Site Reliability Engineer to join our growing team.As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and services. You will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.If you are an SRE engineer who can thrive in a dynamic environment and can make a meaningful impact through your technical expertise and dedication to excellence, come join our team as a Site Reliability Engineer (SRE).DescriptionThis role demands extensive hands on experience of working as SRE engineer for large scale, customer facing Cloud applications. Candidate should have good understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts. Candidate should have excellent troubleshooting and problem solving skills.Candidate will be expected to represent the SRE organization in design reviews and operational readiness exercises for new and existing services. They will also be required to collaborate with technical and non technical teams and analyze statistics to come up with a clear picture on current state of our system. Having good working knowledge of Oracle and Cassandra databases will be beneficial in this regard.Candidate should have a passion to automate manual operations and to improve them through repeated iteration.They should have good understanding of networking and load balancing concepts and should be able to lead a small team and come up with innovative solutions. They should be self motivated, capable of taking business critical decisions and should be comfortable working in a dynamic, ever changing environment. Candidate should be proactive in dealing with critical production issues and take them to closure while working with required partners. Participate in an on call rotation providing hands-on technical expertise during service impacting events.Minimum Qualifications2 years of hands on experience as an SRE engineer, managing and debugging customer-reported incidents, prioritizing them based on impact, and ensuring timely resolutions.2 years of hands on experience building complex queries and dashboard using Splunk2 years of practical experience in performing root cause analysis, documenting defects, and working alongside engineering and leadership teams to prioritize resolutions2 years of promoting observability of systems for monitoring, alerting, and metrics reporting using Datadog, Prometheus and similar tools2 years proficiency with at least 1 scripting language like Python etc.2 years working on Oracle and Cassandra databases (writing complex queries to fetch data)BS in Computer Science or equivalent work experience is preferredKey Qualifications
Preferred QualificationsWillingness to participate in on-call rotations and provide weekend coverage as neededStrong problem solving skills, software development and debugging skillsProven track record of taking ownership and successfully delivering resultsShould be comfortable working in fast paced and dynamic environmentFluency in Japanese language is a plus!Education & Experience
Additional RequirementsApple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.To view your favorites, sign in with your Apple Account.
#J-18808-Ljbffr
Austin
,
Texas
,
United StatesSoftware and ServicesCarrier Services offers seamless integration of Apple Retail Stores and Apple Online store with major US Carriers for iPhone activations. We are looking for a talented Site Reliability Engineer to join our growing team.As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and services. You will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.If you are an SRE engineer who can thrive in a dynamic environment and can make a meaningful impact through your technical expertise and dedication to excellence, come join our team as a Site Reliability Engineer (SRE).DescriptionThis role demands extensive hands on experience of working as SRE engineer for large scale, customer facing Cloud applications. Candidate should have good understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts. Candidate should have excellent troubleshooting and problem solving skills.Candidate will be expected to represent the SRE organization in design reviews and operational readiness exercises for new and existing services. They will also be required to collaborate with technical and non technical teams and analyze statistics to come up with a clear picture on current state of our system. Having good working knowledge of Oracle and Cassandra databases will be beneficial in this regard.Candidate should have a passion to automate manual operations and to improve them through repeated iteration.They should have good understanding of networking and load balancing concepts and should be able to lead a small team and come up with innovative solutions. They should be self motivated, capable of taking business critical decisions and should be comfortable working in a dynamic, ever changing environment. Candidate should be proactive in dealing with critical production issues and take them to closure while working with required partners. Participate in an on call rotation providing hands-on technical expertise during service impacting events.Minimum Qualifications2 years of hands on experience as an SRE engineer, managing and debugging customer-reported incidents, prioritizing them based on impact, and ensuring timely resolutions.2 years of hands on experience building complex queries and dashboard using Splunk2 years of practical experience in performing root cause analysis, documenting defects, and working alongside engineering and leadership teams to prioritize resolutions2 years of promoting observability of systems for monitoring, alerting, and metrics reporting using Datadog, Prometheus and similar tools2 years proficiency with at least 1 scripting language like Python etc.2 years working on Oracle and Cassandra databases (writing complex queries to fetch data)BS in Computer Science or equivalent work experience is preferredKey Qualifications
Preferred QualificationsWillingness to participate in on-call rotations and provide weekend coverage as neededStrong problem solving skills, software development and debugging skillsProven track record of taking ownership and successfully delivering resultsShould be comfortable working in fast paced and dynamic environmentFluency in Japanese language is a plus!Education & Experience
Additional RequirementsApple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.To view your favorites, sign in with your Apple Account.
#J-18808-Ljbffr