Fidelity Investments
Senior DevOps Engineer
Fidelity Investments, Louisburg, North Carolina, United States, 27549
Job Description
The Role...You will be part of a tremendously talented and diverse software development team responsible for building and maintaining multiple environments! You will bring cloud management and sysadmin skills for delivering mission critical infrastructure ensuring the highest levels of availability, performance and security while assisting in system engineering activities and looking for ways to automate the end-to-end configuration and build out process. You have a passion for technology tempered by a level-headed approach to problem solving, and experience with cloud technologies.The Expertise and Skills You Bring
Ability to automate with various scripting languages (Python, Shell scripting, etc.)Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef)Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelinesHands-on Kubernetes skills and knowledge.Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, Datadog, etc.)Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scaleProven experience in maintaining scalability and resiliency of complex environment.Proven experience in implementing advanced observability practices and techniques at scale.Demonstrated ability to use modern monitoring tools (DataDog, Prometheus, Splunk)Proficient communication skills with an ability to reach both technical and non-technical audienceExperience with configuration management and infrastructure management systems like Ansible, Chef, Docker, CloudFormation.Experience with container technologies like Docker.LWC and containerization orchestration tools like ECS, AKS, EKS preferred.Basic knowledge of open source platforms (Apache, Tomcat etc.).Experience with Cloud technologies with cloud providers AWS, Azure, GCP etc.Knowledge of AWS Cloud Devops services such as IAM, VPC, ECS, Lambda, RDS.Have a working knowledge of databases, SQL and NOSQL (MongoDB/CouchDB/DynamoDB).Experience with configuring and installing Mem Cached databases like Redis.Experience with streaming platforms like Kafka preferred.Responsibilities
Help define and implement a comprehensive reliability and observability strategy, ensuring that Fidelity’s systems are always available when our customers need them.Bring together technical, procedural, and financial data to reduce toil and increase efficiency.You will execute plans for technical standardization and process refinement within the engineering organization, especially for Site Reliability Engineers.Solve stack-wide engineering issues related to hardware, software, network, applications, and cloud service providers.Coach peer SREs and development teams on how to build highly available systems.Work with internal Fidelity release groups to setup and maintain Non-Prod and Production environments infrastructure and CI/CD efforts.Seek out opportunities to develop and improve existing automation processes.Monitor the health of our production applications.Troubleshoot and debug CI/CD issues, with a willingness to resolve problems.Collect and report on operational metrics for SLA reporting and capacity planning.Strong grasp of Unix-based operating processing systems (Linux).Strong containerization technologies experience in hybrid cloud platforms.Cloud and Infrastructure experience is a strong plus.Thrive in a fast-paced, results driven environment.Benefits
You can take advantage of flexible benefits that support you through every stage of your career, empowering you to thrive at work and at home.
#J-18808-Ljbffr
The Role...You will be part of a tremendously talented and diverse software development team responsible for building and maintaining multiple environments! You will bring cloud management and sysadmin skills for delivering mission critical infrastructure ensuring the highest levels of availability, performance and security while assisting in system engineering activities and looking for ways to automate the end-to-end configuration and build out process. You have a passion for technology tempered by a level-headed approach to problem solving, and experience with cloud technologies.The Expertise and Skills You Bring
Ability to automate with various scripting languages (Python, Shell scripting, etc.)Experience managing systems using infrastructure as code tools (IAM, ARM, Terraform, Chef)Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelinesHands-on Kubernetes skills and knowledge.Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, Datadog, etc.)Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scaleProven experience in maintaining scalability and resiliency of complex environment.Proven experience in implementing advanced observability practices and techniques at scale.Demonstrated ability to use modern monitoring tools (DataDog, Prometheus, Splunk)Proficient communication skills with an ability to reach both technical and non-technical audienceExperience with configuration management and infrastructure management systems like Ansible, Chef, Docker, CloudFormation.Experience with container technologies like Docker.LWC and containerization orchestration tools like ECS, AKS, EKS preferred.Basic knowledge of open source platforms (Apache, Tomcat etc.).Experience with Cloud technologies with cloud providers AWS, Azure, GCP etc.Knowledge of AWS Cloud Devops services such as IAM, VPC, ECS, Lambda, RDS.Have a working knowledge of databases, SQL and NOSQL (MongoDB/CouchDB/DynamoDB).Experience with configuring and installing Mem Cached databases like Redis.Experience with streaming platforms like Kafka preferred.Responsibilities
Help define and implement a comprehensive reliability and observability strategy, ensuring that Fidelity’s systems are always available when our customers need them.Bring together technical, procedural, and financial data to reduce toil and increase efficiency.You will execute plans for technical standardization and process refinement within the engineering organization, especially for Site Reliability Engineers.Solve stack-wide engineering issues related to hardware, software, network, applications, and cloud service providers.Coach peer SREs and development teams on how to build highly available systems.Work with internal Fidelity release groups to setup and maintain Non-Prod and Production environments infrastructure and CI/CD efforts.Seek out opportunities to develop and improve existing automation processes.Monitor the health of our production applications.Troubleshoot and debug CI/CD issues, with a willingness to resolve problems.Collect and report on operational metrics for SLA reporting and capacity planning.Strong grasp of Unix-based operating processing systems (Linux).Strong containerization technologies experience in hybrid cloud platforms.Cloud and Infrastructure experience is a strong plus.Thrive in a fast-paced, results driven environment.Benefits
You can take advantage of flexible benefits that support you through every stage of your career, empowering you to thrive at work and at home.
#J-18808-Ljbffr