Logo
Salesforce

Principal/Architect- Availability Engineering & SRE

Salesforce, San Francisco, CA


To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job CategorySoftware EngineeringJob DetailsAbout SalesforceWe’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Salesforce services have reliability, capacity, performance and the availability to deliver our customer's needs and a rate of improvement that our customers expect. Our software development focuses on enabling service owners to operate their services safely at scale, whether through paved path integrations onto observability frameworks, optimizing existing systems, designing infrastructure or eliminating work through AI/ML investments or traditional automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Salesforce, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.The SRE practice at Salesforce is evolving, and this role will shape the technical strategy for SRE and influence the strategy for the Availability Cloud as a whole.  You will embed with product owning teams, define the availability roadmap and deliver directly against it.  Most importantly, you will mature  the SRE practice, mentoring and actively developing the engineers around you.  Your success is measured by scaling the impact and delivery of your community. Responsibilities:Spearhead and enable the culture of Service Ownership to flourish and thrive.   Define healthy service ownership practices and work with embedded teams to develop the knowledge and ownership practiceEngage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.Develop full paved path observability platform  integrations and necessary automations to maintain service, system and product healthScale systems sustainably through mechanisms like automation, and evolve systems by pushing for and delivering changes that improve reliability and velocity.Practice sustainable incident response and blameless post mortems. Uphold the quality and high standards  of post mortems as part of the Architect community at SalesforceComfortable with hands on coding at least 25%Develop and grow the engineering talent around youMinimum Requirements15+ years of software development and engineering experience, 5+ years in a technical leadership roleHands-on experience designing, building and operating large scale distributed systems, identifying shortcomings and optimization opportunities, and making data driven cost performance tradeoffs to influence design decisionsDemonstrated experience of leading initiatives spanning multiple teams and leveraging deep domain expertise to influence tech roadmap planning and executionDemonstrated ability to effectively collaborate across multiple teams and stakeholders to drive business outcomesExperience, mentoring, and investing in the development of engineers and peersAbility to reverse engineer solutions via independent code and architecture review, envision, define and then contribute to delivery of availability improvement refactoring projectsMastery of one or more object oriented delivery with languages such as Java, Golang, Python, C++, CExperience in: Kubernetes, Istio, Public Cloud (AWS or other)Deep experience working with core web technologies: HTTP, JSON, REST, XMLExperience owning and operating multiple instances of a critical serviceRunning critical infrastructure services; monitoring, alerting, logging, tracing and reportingSubject matter expertise on Service ownership best practices, SLO/I/A definition, driving proactive operational awareness and experience with Incident / Problem managementThorough knowledge of Agile development methodology with experience in both Test / Behavioral Driven Development practiceExperience in fault modeling and tolerance, chaos engineering, performance and load testing.AccommodationsIf you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.Posting StatementAt Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at and explore our company benefits at .Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce.Salesforce welcomes all.Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.For Washington-based roles, the base salary hiring range for this position is $211,500 to $351,800.For California-based roles, the base salary hiring range for this position is $230,800 to $384,100.Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: https://www.salesforcebenefits.com.SummaryLocation: California - San Francisco; Washington - BellevueType: Full time