Apex Systems
Site Reliability Engineer, Senior
Apex Systems, Fairfax, Virginia, United States, 22032
Apex Systems, a World-Class Technology Solutions Provider, is seeking applicants for the below position on behalf of our client.
Please email an updated resume to Jordan at [email protected] if interested and qualified.
*Please note that only qualified candidates will be contacted.
Position: Site Reliability Engineer, Senior
Clearance Requirement:
Candidates must be US citizens able to obtain and/or maintain a Department of DefensePublic Trust as a condition and continuation of employment
Location: Hybrid; Fairfax, VA
Pay Rate Range: $118,000-177,500
Project Description:
Our client is seeking a talentedSeniorSite Reliability Engineer (SRE)to play a key role indefining, implementing, and growing our SRE practiceto ensure the reliability, availability, and performance of our critical production environments.The SeniorSREwill contribute toa culture ofcontinuous improvement,identifyingareas for enhancement, and driving initiatives to improve system reliability, scalability, and efficiency.Thesuccessful candidate willhavedemonstratedhands-on experiencedesigning, implementing, andmaintainingsolutions to ensure thatsystems, includinginfrastructure and applications,are resilient,highly available, and performant.TheSenior SREwillalsoplay a critical role in definingand measuringthe Service Level Objectives (SLOs) and Service Level Indicators (SLIs)for our solution.
Day to Day Responsibilities/typical day look like:TheSeniorSREwillbe responsible for:Settingup comprehensivelogging,monitoring,andalertingsolutionsusing the Elasticstackand other toolsas necessaryto ensure the continuous performance of services.Additionally, they will respond to incidents, perform root cause analyses, and implement solutions to preventreoccurrences.TheSeniorSRE will work in close collaborationwith other SRE team members,developers,testers,infrastructure engineers, DevOps engineers, and other stakeholders to integrate reliability and observability into the software development lifecycle.
Selling Points for Candidates:
Permanent hire with our client
Requirements:
UScitizenship with ability to obtain Public Trust Suitability6+ years of experience as aSite ReliabilityEngineer(SRE)or equivalent6+ years ofdemonstratedexperience designing,implementing, andmaintainingobservabilitysolutionsto include logging, monitoring, and alerting6+ years of hands-on experience withSREtools (e.g.,Elastic, Prometheus, Grafana, Splunk, etc.)3+ yearsdefiningand measuring SLOs and SLIs3+ years of relevant experience using cloud platforms (AWSGovCloud preferred)3+ years of hands-on programming or scripting (e.g., Python, Bash, etc.)Strong knowledge of microservices, containerization, and orchestration tools (Docker,Kubernetes)Proven ability to collaborate with cross-functional teams (development, testing,andproduct)to integrate reliability and observability into the software development lifecycleStrong problem-solving and analytical skillsProactive, detail-oriented approach toidentifyinginefficiencies and implementingimprovements
Desired Skills:
Bachelors degree in Computer Science, Engineering, or a related field(or4additionalyears of related experience)Experience working in an Agile/SAFeenvironment using ALM tools (Jira, Confluence, or similar)Strong understanding of CI/CD principles and platforms (Jenkins,CircleCI, GitLab, GitHub Actions, Argo, Travis CI, etc.)Expertisein configuration management tools (Ansible, Puppet, Chef)Experience with infrastructure as code (Terraform, CloudFormation)In-depth understanding of networking, security, and system administration of Linux operating systemsKnowledgeof version controlplatformsand branching strategiesKnowledge of disaster recovery planning, backup strategies, and data replicationExperience supporting large Federal programs ($200M+)
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at [email protected] or 844-463-6178 .
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing® in Talent Satisfaction in the United States and Great Place to Work® in the United Kingdom and Mexico.
Please email an updated resume to Jordan at [email protected] if interested and qualified.
*Please note that only qualified candidates will be contacted.
Position: Site Reliability Engineer, Senior
Clearance Requirement:
Candidates must be US citizens able to obtain and/or maintain a Department of DefensePublic Trust as a condition and continuation of employment
Location: Hybrid; Fairfax, VA
Pay Rate Range: $118,000-177,500
Project Description:
Our client is seeking a talentedSeniorSite Reliability Engineer (SRE)to play a key role indefining, implementing, and growing our SRE practiceto ensure the reliability, availability, and performance of our critical production environments.The SeniorSREwill contribute toa culture ofcontinuous improvement,identifyingareas for enhancement, and driving initiatives to improve system reliability, scalability, and efficiency.Thesuccessful candidate willhavedemonstratedhands-on experiencedesigning, implementing, andmaintainingsolutions to ensure thatsystems, includinginfrastructure and applications,are resilient,highly available, and performant.TheSenior SREwillalsoplay a critical role in definingand measuringthe Service Level Objectives (SLOs) and Service Level Indicators (SLIs)for our solution.
Day to Day Responsibilities/typical day look like:TheSeniorSREwillbe responsible for:Settingup comprehensivelogging,monitoring,andalertingsolutionsusing the Elasticstackand other toolsas necessaryto ensure the continuous performance of services.Additionally, they will respond to incidents, perform root cause analyses, and implement solutions to preventreoccurrences.TheSeniorSRE will work in close collaborationwith other SRE team members,developers,testers,infrastructure engineers, DevOps engineers, and other stakeholders to integrate reliability and observability into the software development lifecycle.
Selling Points for Candidates:
Permanent hire with our client
Requirements:
UScitizenship with ability to obtain Public Trust Suitability6+ years of experience as aSite ReliabilityEngineer(SRE)or equivalent6+ years ofdemonstratedexperience designing,implementing, andmaintainingobservabilitysolutionsto include logging, monitoring, and alerting6+ years of hands-on experience withSREtools (e.g.,Elastic, Prometheus, Grafana, Splunk, etc.)3+ yearsdefiningand measuring SLOs and SLIs3+ years of relevant experience using cloud platforms (AWSGovCloud preferred)3+ years of hands-on programming or scripting (e.g., Python, Bash, etc.)Strong knowledge of microservices, containerization, and orchestration tools (Docker,Kubernetes)Proven ability to collaborate with cross-functional teams (development, testing,andproduct)to integrate reliability and observability into the software development lifecycleStrong problem-solving and analytical skillsProactive, detail-oriented approach toidentifyinginefficiencies and implementingimprovements
Desired Skills:
Bachelors degree in Computer Science, Engineering, or a related field(or4additionalyears of related experience)Experience working in an Agile/SAFeenvironment using ALM tools (Jira, Confluence, or similar)Strong understanding of CI/CD principles and platforms (Jenkins,CircleCI, GitLab, GitHub Actions, Argo, Travis CI, etc.)Expertisein configuration management tools (Ansible, Puppet, Chef)Experience with infrastructure as code (Terraform, CloudFormation)In-depth understanding of networking, security, and system administration of Linux operating systemsKnowledgeof version controlplatformsand branching strategiesKnowledge of disaster recovery planning, backup strategies, and data replicationExperience supporting large Federal programs ($200M+)
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at [email protected] or 844-463-6178 .
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing® in Talent Satisfaction in the United States and Great Place to Work® in the United Kingdom and Mexico.