Logo
McKinsey & Company

Senior Site Reliability Engineer II

McKinsey & Company, Atlanta, GA


Who You'll Work With

McKinsey & Company is a global management-consulting firm. We work with leading organizations across the private, public and social sectors. Our scale, scope, and knowledge allow us to address problems that no one else can. We have deep functional and industry expertise as well as breadth of geographical reach. We are passionate about taking on immense challenges that matter to our clients and, often, to the world. We work with our clients as we do with our colleagues. We build their capabilities and leadership skills at every level and every opportunity. We do this to help build internal support, get to real issues, and reach practical recommendations.

You'll work with our Secure Foundations - MCS team, which is part of McKinsey's Tech Ecosystem organization, developing new products/services and integrating them into our client work.

Our company is moving fast from traditional IT world to a Digital era embracing Agile principles. We look for highly skilled developers with an SRE mindset to help us with this transformation.

Your impact within our firm

You'll work in small teams (incl. product managers, developers and operations people) in a highly collaborative way, use the latest technologies and enjoy seeing the direct impact from your work.

You'll combine 'Agile' with expertise in cloud, big data and mobile to create and maintain custom solutions, in a way consistent with SRE principles, that help clients increase productivity and make timely decision. This includes but is not limited to; development, implementation and operation of IT systems and processes supporting SaaS applications and platforms, automation of provisioning, quality controls, security auditing and maintenance, and, continuous measurement and improvement of efficiency of operational activities and resources.

Your qualifications and skills
  • Proficiency in one or more programming languages, such as Python, JavaScript, Golang, or Ruby.
  • Hands-on experience implementing infrastructure as code using Terraform, or similar automation tools like Ansible and CloudFormation.
  • Experience designing and building CI/CD pipelines using tools like GitHub Actions, ArgoCD, CircleCI, or Jenkins along with package management tools like Jfrog or Nexus.
  • Experience with public cloud environments, specifically AWS and either Azure or Google Cloud Platform (GCP).
  • Expertise with container technologies and orchestration tools, including Docker, Kubernetes, Helm, and service mesh solutions such as Linkerd or Istio.
  • Experience with infrastructure and reliability testing frameworks such as Test-Kitchen, AWSpec and InSpec.
  • Experience in managing front-end and back-end workloads such as React, TypeScript, Python, Node.js, Nginx, and API management tools like Apigee and AWS API Gateway.
  • Proficiency with databases such as Neo4j, Redis, PostgreSQL, and MongoDB.
  • Familiarity with monitoring and logging tools such as Dynatrace, Splunk, CloudWatch, and other similar platforms like ELK, Prometheus, or Grafana.
  • Expertise in networking concepts, including prior experience managing CDN+WAF configurations in Akamai, Cloudflare, AWS CloudFront, and experience with VPCs, Load Balancers, and SSH tunnels.
  • Identity & Access Management: Experience with Okta, Azure AD, Ping Identity, and other OIDC/OAuth2 providers. - Access Control: Implementing and managing RBAC for least-privilege access. - Secrets Management: Proficiency with HashiCorp Vault for managing secrets and implementing token rotation. - Compliance and Vulnerability Management: Experience with SOC 2 audits, vulnerability management, and SSL certificate management.
  • Strong skills in developing technical documentation such as architecture diagrams, runbooks, and technical documents, with experience in complex platform migrations and managing multiple workstreams.