The Charles Schwab Corporation
Site Reliability Engineer - Quote Plant Support
The Charles Schwab Corporation, Austin, TX
Your OpportunityAt Schwab, you are empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us “challenge the status quo” and transform the finance industry together.As a Site Reliability Engineer for Schwab's Client Trading Experience Technology, you will be responsible for a sustainable approach to reliability using SRE principles. Our team is essential in supporting the operational reliability of real-time quotes and Market Data for the firm. You will partner with multiple support teams to provide guidance and drive adoption of key reliability engineering practices in support of large-scale and mission-critical services. We are looking for skilled candidates who are enthusiastic about learning new and existing technologies to deliver exceptional solutions for the production resiliency of our systems. The role will require a high level of responsibility and accountability yet has a support structure necessary for development growth. This role requires time sensitive attention as this market data is heavily tied to real time client trading.The role will encompass multiple aspects ranging from preventing & resolving production incidents and supporting application releases in our software deployment pipeline. During Blameless Post-mortem, you will have the opportunities to recommend how to improve monitoring and other processes in our production environment and work with the respective teams to design and implement the recommendations. Return to service activities, on-call rotation, and proactive monitoring are key aspects of this role. You will be working closely with the development team as a SAvE member for symbol updates, server restarts when appropriate, certificate and system account management and patching coordination.In this role, you will: Demonstrate a Site Reliability Engineering mindset and solve problems through automation and instrumentation.Partner with the Architects, Development Leads, Business Partners and other SREs in the team, to ensure implementations are architected and designed from the aspect of production resiliency.Skilled in scripting Ability to work on and create monitoring tools to help support the systems and identify problems.Identify opportunities to build innovative tools and solve unique operations problems on large enterprise and mission critical applications.Develop tools, frameworks, and instrumentation to validate and increase rollout success for applications.Partner within the Support organizations to build and rollout plans for enhanced telemetry and reduce defects for software delivery to production.Perform real-time troubleshooting of mission critical application workflows and incorporate feedback to product development.Work closely with development teams during design phase, build and perform infrastructure upgrades to support our applications availability and reliability.Monitor the current-state solution portfolio to identify deficiencies through aging of the technologies used by the application, or misalignment with business requirements.Understand, advocate and augment the Schwab Reliability Engineering principles, guidelines and standards.Analyze the business-IT environment (run, grow and transform the business) to detect critical deficiencies, and recommend solutions for improvement.Assist with the evaluation and selection of software product standards and services, as well as the design of standard and custom software configurations.Proven track record supporting production application development and support efforts adhering to a mix of DevOps & SRE frameworks.Ability to grasp difficult concepts, large architectures, and sophisticated designs quickly.Progressive experience supporting highly available, mission critical environments, experience leveraging tools to instrument and automate proactive and eventually predictive availability solutions.Ability to understand multiple technologies and how they inter-relate and integrate.Proven capability to provide operational visibility on environment health to technology and business partners.Strong automation, innovation, and problem-solving skillsReceptive, approachable teammate, with the ability to positively interact with business partners, technology teams, recruiting personnel, offshore, and professional services.Strong customer advocate with good written and verbal communication skillsFlexibility to participate in on call support rotation.What you haveRequired Qualifications:BS in Computer Science or related technical field with at least 10 years of experience with listed technical skills8-10 years of experience with enterprise level administration and support8-10 years of experience practicing SDLC (Software Development Lifecycle) practice, process improvements8-10 years of experience in writing automation scripts, building application dashboards for proactive monitoring, setting up Alerts for early determination of the issuesExtensive experience in Enterprise level Infrastructure orchestration with Ansible, Chef, SALT, PuppetExperience in High Availability and distributed systems, Linux and Windows administration, troubleshooting and support.Strong understanding & experience of Platform as a Service (PaaS) and Infrastructure as a Service (IaaS) such as Pivotal Cloud Foundry (PCF).Experience with Continuous Integration/Continuous Delivery (Bamboo, Harness)Experience in On-Call during Market and off-hours, related with Trading.Experience with Atlassian tools Jira, Confluence, Bamboo, Bitbucket and Agile FrameworksWorking knowledge of Monitoring tools - Splunk, Elastic, AppDynamics, DynatraceHands on enterprise systems administration, monitoring, and deployment activitiesKnowledge of networking including DNS, DHCP, firewalls, load balancers and IP routingFamiliarity with one or more databases- Oracle, SQL Server, Mongo DBExcellent debugging skills across a variety of integrated platformsPreferred Qualifications:Preferred experience with Java, and Spring BootIn addition to the salary range, this role is also eligible for bonus or incentive opportunities.Job SummaryRequisition ID: 2024-105671Posted Date: 5 days ago(11/21/2024 10:14 AM)Category: Engineering & Software DevelopmentSalary Range: USD $140000.00 - $165000.00 / YearApplication deadline: 11/30/2024Position Type: Full time