Triumph
AWS DevOps Engineer
Triumph, Richmond, Virginia, United States, 23214
DevOps Engineer
\nContract to Hire
\nOnsite
\n
\nTriumph is seeking a DevOps Engineer to join our client's growing team. This position is an excellent opportunity for a motivated individual with a passion for cloud technologies and a desire to develop their skills in multiple areas: Cloud Engineering, DevOps, SRE, and Security. As a DevOps Engineer, you will work alongside experienced professionals, contributing to the design, implementation, and maintenance of cloud infrastructure solutions that support the organization's digital transformation.
\n
\n
Responsibilities:
\n
\n\t
Cloud Infrastructure Management:
\n\t
\n\t\t
Assist in the deployment, configuration, and maintenance of AWS resources, including (but not limited to) containerized workloads, monitoring, storage, data, and networking components. \n\t\t Work with Infrastructure as Code tools such as CloudFormation/Terraform to automate resource provisioning. \n\t\t Recommend new tooling to creatively solve problems and meet needs as they arise \n\t \n\t\n\t CICD
\n\t
\n\t\t
Design and implement flexible CICD Pipeline features to meet development team needs \n\t\t Ensure consistency in builds, deployments, releases, versioning, etc. \n\t\t Add new features to the pipeline as they are needed in a timely and organized manner \n\t \n\t\n\t Monitoring/Reliability:
\n\t
\n\t\t
Implement and maintain industry-standard monitoring solutions to ensure the performance, availability, and security of AWS resources. \n\t\t Think critically, address reliability concerns and incidents from a business-focused mindset \n\t\t Drive cross-team resolution of incidents to completion, ensuring all necessary parties are involved with the solution and postmortem \n\t\t Ensure all necessary metrics, logs, and data points are collected, visualized, and configured for alerting \n\t \n\t\n\t Disaster Recovery and Business Continuity:
\n\t
\n\t\t
Set up and/or maintain AWS environments in a multi-region configuration \n\t\t Participate in testing of recovery environments to meet published RTO metrics. \n\t \n\t\n\t Security and Compliance:
\n\t
\n\t\t
Implement security best practices and compliance measures for AWS infrastructure. \n\t\t Wherever possible, improve our existing cloud security posture \n\t \n\t\n \n
Required Skills:
\n\n
\n\t
\n\t
\n\t\t
Monitoring: CloudWatch, Prometheus, Grafana, Etc. \n\t\t Cloud Platform: AWS Solutions Architect or equivalent experience \n\t\t CICD/Tooling: GitHub Actions, Docker, bash, Python \n\t\t IaC: Terraform, CloudFormation \n\t \n\t\n \n
Nice to have:
\n\n
\n\t
\n\t
\n\t\t
Monitoring: Loki, New Relic \n\t\t Cloud Platform: Experience with multiple cloud providers, Kubernetes \n\t\t CICD/Tooling: ArgoCD \n\t\t IaC: Helm, Kustomize \n\t \n\t\n \n \n \n#Dice
Responsibilities:
\n
\n\t
Cloud Infrastructure Management:
\n\t
\n\t\t
Assist in the deployment, configuration, and maintenance of AWS resources, including (but not limited to) containerized workloads, monitoring, storage, data, and networking components. \n\t\t Work with Infrastructure as Code tools such as CloudFormation/Terraform to automate resource provisioning. \n\t\t Recommend new tooling to creatively solve problems and meet needs as they arise \n\t \n\t\n\t CICD
\n\t
\n\t\t
Design and implement flexible CICD Pipeline features to meet development team needs \n\t\t Ensure consistency in builds, deployments, releases, versioning, etc. \n\t\t Add new features to the pipeline as they are needed in a timely and organized manner \n\t \n\t\n\t Monitoring/Reliability:
\n\t
\n\t\t
Implement and maintain industry-standard monitoring solutions to ensure the performance, availability, and security of AWS resources. \n\t\t Think critically, address reliability concerns and incidents from a business-focused mindset \n\t\t Drive cross-team resolution of incidents to completion, ensuring all necessary parties are involved with the solution and postmortem \n\t\t Ensure all necessary metrics, logs, and data points are collected, visualized, and configured for alerting \n\t \n\t\n\t Disaster Recovery and Business Continuity:
\n\t
\n\t\t
Set up and/or maintain AWS environments in a multi-region configuration \n\t\t Participate in testing of recovery environments to meet published RTO metrics. \n\t \n\t\n\t Security and Compliance:
\n\t
\n\t\t
Implement security best practices and compliance measures for AWS infrastructure. \n\t\t Wherever possible, improve our existing cloud security posture \n\t \n\t\n \n
Required Skills:
\n\n
\n\t
\n\t
\n\t\t
Monitoring: CloudWatch, Prometheus, Grafana, Etc. \n\t\t Cloud Platform: AWS Solutions Architect or equivalent experience \n\t\t CICD/Tooling: GitHub Actions, Docker, bash, Python \n\t\t IaC: Terraform, CloudFormation \n\t \n\t\n \n
Nice to have:
\n\n
\n\t
\n\t
\n\t\t
Monitoring: Loki, New Relic \n\t\t Cloud Platform: Experience with multiple cloud providers, Kubernetes \n\t\t CICD/Tooling: ArgoCD \n\t\t IaC: Helm, Kustomize \n\t \n\t\n \n \n \n#Dice