LCG
Site Reliability Engineer (Mid) Cloud
LCG, Bethesda, Maryland, us, 20811
This job opportunity is part of an RFP process; candidates are invited to submit their resumes detailing relevant experience.
Location: Bethesda, MD (Hybrid)
LCG is a minority-owned technology consulting firm that has been a trusted partner to more than 40 federal agencies, including 21 of the 27 Institutes and Centers (ICs) at the National Institutes of Health (NIH). For over 25 years, LCG has brought digitization and innovation to the Health and Human Services (HHS) and the NIH ecosystems. We support IT organizations by bringing precision technology and operation models that achieve mission capabilities and performance success.
Job Title: Site Reliability Engineer (Mid) Cloud
Overview: We are seeking a Site Reliability Engineer (Mid) Cloud to manage and maintain optimal platform infrastructure performance, reliability, and security. The SRE will use CI/CD tools, processes, and designs to ensure seamless infrastructure operations, providing critical support for cloud services. The ideal candidate will have a background in cloud computing, automation, and performance optimization. This role involves deploying scalable software tools, automating monitoring systems, and addressing incidents to improve system reliability.
Key Responsibilities:
Manage and optimize platform infrastructure performance, reliability, and security through CI/CD tools and practices.Implement services for automated monitoring, providing critical information for quick response and resolution of performance issues.Deploy standardized and scalable software tools, ensuring uninterrupted system operations at peak performance.Troubleshoot and analyze service disruptions, identify root causes, and implement solutions to enhance reliability.Utilize cloud and virtualization technologies like AWS to scale and automate processes.Work with configuration management tools (e.g., Ansible, Puppet, Chef), scripting languages (e.g., Python, Bash), and containerization technologies (e.g., Docker, Kubernetes).Implement load balancing, monitoring, and analysis tools to ensure operational stability.Collaborate with teams to develop strategic roadmaps and architecture that align with NIH's cloud services goals.Provide support in data management, cloud migrations, and security, ensuring adherence to industry best practices.Qualifications:
Bachelor's degree in Computer Science or a related field, or equivalent experience.2-4 years of relevant experience in site reliability engineering or related cloud operations.Experience with cloud infrastructure technologies, particularly AWS, and cloud migration strategies.Proficiency with CI/CD tools, cloud monitoring systems, and configuration management tools.Knowledge of scripting languages (e.g., Python, Bash) and containerization technologies (e.g., Docker, Kubernetes).Strong troubleshooting and root cause analysis skills.Familiarity with cloud security standards and best practices, including NIST and FIPS compliance.Preferred Qualifications:
Experience working in large-scale environments, particularly with NIH or government agencies.Familiarity with cloud architecture and data management strategies.Certifications in AWS, Azure, or other cloud platforms.
Compensation and Benefits
The projected compensation range for this position is $108,000 to $150,750 per year benchmarked in the Washington, D.C. metropolitan area. The target salary is $129,000. The salary range provided is a good faith estimate representative of all experience levels. Salary at LCG is determined by various factors, including but not limited to role, location, the combination of education/training, knowledge, skills, competencies, certifications, and work experience.
LCG offers a competitive, comprehensive benefits package which includes health insurance options (medical, dental, vision), life and disability insurance, retirement plan contributions, as well as paid leave, federal holidays, professional development, and lifestyle benefits.
Devoted to Fair and Inclusive Practices
All qualified applicants will receive consideration for employment without regard to sex, race, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, or any other basis prohibited by law.
If you are interested in applying for employment with LCG and need special assistance or an accommodation to apply for a posted position, contact our Human Resources department by email at .
Securing Your Data
Beware of fraudulent job offers using LCG's name. LCG will never request payment-related details or advancement of money during the application process. Legitimate communication will only come from lcginc.com or emails, not free commercial services like Gmail or WhatsApp. If you receive suspicious emails asking for payment or personal information, contact us immediately at .
If you believe you are the victim of a scam, contact your local law enforcement and report the incident to the .
Location: Bethesda, MD (Hybrid)
LCG is a minority-owned technology consulting firm that has been a trusted partner to more than 40 federal agencies, including 21 of the 27 Institutes and Centers (ICs) at the National Institutes of Health (NIH). For over 25 years, LCG has brought digitization and innovation to the Health and Human Services (HHS) and the NIH ecosystems. We support IT organizations by bringing precision technology and operation models that achieve mission capabilities and performance success.
Job Title: Site Reliability Engineer (Mid) Cloud
Overview: We are seeking a Site Reliability Engineer (Mid) Cloud to manage and maintain optimal platform infrastructure performance, reliability, and security. The SRE will use CI/CD tools, processes, and designs to ensure seamless infrastructure operations, providing critical support for cloud services. The ideal candidate will have a background in cloud computing, automation, and performance optimization. This role involves deploying scalable software tools, automating monitoring systems, and addressing incidents to improve system reliability.
Key Responsibilities:
Manage and optimize platform infrastructure performance, reliability, and security through CI/CD tools and practices.Implement services for automated monitoring, providing critical information for quick response and resolution of performance issues.Deploy standardized and scalable software tools, ensuring uninterrupted system operations at peak performance.Troubleshoot and analyze service disruptions, identify root causes, and implement solutions to enhance reliability.Utilize cloud and virtualization technologies like AWS to scale and automate processes.Work with configuration management tools (e.g., Ansible, Puppet, Chef), scripting languages (e.g., Python, Bash), and containerization technologies (e.g., Docker, Kubernetes).Implement load balancing, monitoring, and analysis tools to ensure operational stability.Collaborate with teams to develop strategic roadmaps and architecture that align with NIH's cloud services goals.Provide support in data management, cloud migrations, and security, ensuring adherence to industry best practices.Qualifications:
Bachelor's degree in Computer Science or a related field, or equivalent experience.2-4 years of relevant experience in site reliability engineering or related cloud operations.Experience with cloud infrastructure technologies, particularly AWS, and cloud migration strategies.Proficiency with CI/CD tools, cloud monitoring systems, and configuration management tools.Knowledge of scripting languages (e.g., Python, Bash) and containerization technologies (e.g., Docker, Kubernetes).Strong troubleshooting and root cause analysis skills.Familiarity with cloud security standards and best practices, including NIST and FIPS compliance.Preferred Qualifications:
Experience working in large-scale environments, particularly with NIH or government agencies.Familiarity with cloud architecture and data management strategies.Certifications in AWS, Azure, or other cloud platforms.
Compensation and Benefits
The projected compensation range for this position is $108,000 to $150,750 per year benchmarked in the Washington, D.C. metropolitan area. The target salary is $129,000. The salary range provided is a good faith estimate representative of all experience levels. Salary at LCG is determined by various factors, including but not limited to role, location, the combination of education/training, knowledge, skills, competencies, certifications, and work experience.
LCG offers a competitive, comprehensive benefits package which includes health insurance options (medical, dental, vision), life and disability insurance, retirement plan contributions, as well as paid leave, federal holidays, professional development, and lifestyle benefits.
Devoted to Fair and Inclusive Practices
All qualified applicants will receive consideration for employment without regard to sex, race, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, or any other basis prohibited by law.
If you are interested in applying for employment with LCG and need special assistance or an accommodation to apply for a posted position, contact our Human Resources department by email at .
Securing Your Data
Beware of fraudulent job offers using LCG's name. LCG will never request payment-related details or advancement of money during the application process. Legitimate communication will only come from lcginc.com or emails, not free commercial services like Gmail or WhatsApp. If you receive suspicious emails asking for payment or personal information, contact us immediately at .
If you believe you are the victim of a scam, contact your local law enforcement and report the incident to the .