Fannie Mae
Lead Site Reliability Engineer - Production Support Services (Flexible Hybrid)
Fannie Mae, Reston, VA, United States
Job Description
As a valued colleague on our team, you will act as a team lead in the designing, producing, testing, or implementing software, technology, or processes, as well as lead processes for creating and maintaining IT architecture, large scale data stores, and cloud-based systems.
THE IMPACT YOU WILL MAKE
The Support and Tools - Software Engineering - Lead Associate role will offer you the flexibility to make each day your own, while working alongside people who care so that you can deliver on the following responsibilities:
* Independently determine the needs of the customer while identifying and resolving conflicting or complementary needs across customer groups.
* Applying advanced skill, knowledge and experience, design and develop software solutions to meet customer needs.
* Use a process-driven approach to leading design solutions.
* Implement new software technology and coordinate simultaneous implementation tasks across teams.
* May maintain or oversee the maintenance of existing software.
Qualifications
THE EXPERIENCE YOU BRING TO THE TEAM
Required Experience
* 4 plus years of experience developing enterprise applications
* 4 plus years of engineering enterprise cloud infrastructure
* Experience managing technical stakeholders
* Experience mentoring and coaching junior engineers
* Experience with Application Performance Management and Observability
Desired Experience
* Bachelor's degree in computer science, Management Information Systems (MIS), Systems Engineering, or related field
* Certification in AWS Solutions Architect Associate or Developer Associate, Splunk Certification Developer, or Sun Certified Java Developer
* Experience with application production / operations support, including incident response, problem management, runbooks, and knowledge articles
* Experience with post-mortems, root-cause analysis (RCA), and / or AWS Correction-of-Errors (CoE)
* Experience with Failure Mode Effect Analysis (FMEA) and Chaos testing / engineering
* Experienced in application monitoring / observability, including building dashboards, establishing service level indicators / objectives / agreements (SLIs / SLOs / SLAs), and logging / tracing
Skills
* Skilled in programming in Java and / or Python with an understanding J2EE frameworks, such as Spring Boot / Spring Cloud, and REST
* Skilled in AWS cloud applications and technologies, including containerization, virtualization, microservices, and server-less architecture in tools
* Understanding of error budgeting and toil reduction
* Ability to create disaster recovery plans and execute failover tests
* Skilled in capacity planning and performance testing / engineering tools, such as JMeter and / or LoadRunner
* Skilled in Scaled Agile Framework (SAFe) and Jira / Confluence
* Understanding of fault tolerant / resilience architectural design patterns, such as Bulkhead, Circuit-breaker, Retry, Timeout, etc.
* Ability to create automation solutions using tools such as BluePrism and / or Selenium
* Excellent problem-solving skills and proactivity in resolving issues / blockers
* Excellent verbal / written communication skills, relationship management skills, and ability to collaborate with multiple stakeholders
Tools
* AWS (ECS, EC2, RDS, Redshift, EMR, Lambda, Route 53, Step Functions)
* Programming using Python/Java
* DevOps - Infrastruture as Code, CICD - Jenkins, GitLab, Terraform
* ServiceNow, Moogsoft, StatusHub, and / or Blameless
* Gremlin, Chaos Monkey, Chaos Toolkit, AWS Fault Injection Service (FIS)
Additional Information
The future is what you make it to be. Discover compelling opportunities at careers.fanniemae.com. (www.fanniemae.com/careers)
Fannie Mae is an Equal Opportunity Employer, which means we are committed to fostering a diverse and inclusive workplace. All qualified applicants will receive consideration for employment without regard to race, religion, national origin, gender, gender identity, sexual orientation, personal appearance, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation in the application process, email us at careers_mailbox@fanniemae.com.
Fannie Mae is a flexible hybrid company. We embrace flexibility for our employees to work where they choose, while also providing office space for in-person work if desired. At times, business need may call for on-site collaboration, which means proximity within a reasonable commute to your designated office location is preferred unless job is noted as open to remote.
The hiring range for this role is set forth on each of our job postings located on Fannie Mae's Career Site. Final salaries will generally vary within that range based on factors that include but are not limited to, skill set, depth of experience, certifications, and other relevant qualifications. This position is eligible to participate in a Fannie Mae incentive program (subject to the terms of the program). As part of our comprehensive benefits package, Fannie Mae offers a broad range of Health, Life, Voluntary Lifestyle, and other benefits and perks that enhance an employee's physical, mental, emotional, and financial well-being. See more here. (https://www.fanniemae.com/careers/benefits) PandoLogic. Keywords: Reliability Engineer, Location: Reston, VA - 20190