Camping World
Lead Platform Engineer (Azure Kubernetes Services)
Camping World, Lincolnshire, Illinois, United States, 60069
Camping World is seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment. This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production support of our 18-component Kubernetes ecosystem and all associated CI/CD pipeline services. In this position, you will collaborate closely with digital product, software development, infrastructure, and operations teams to enhance the Developer Experience. You’ll lead the utilization of CI/CD tools, including GitHub Actions and Flux CD, and leverage monitoring tools such as Grafana to ensure optimal performance of our applications and API services. As a thought leader in Kubernetes, you will play a key role in shaping and executing our Kubernetes platform strategy, managing technical debt efficiently, and ensuring a robust, scalable, and secure platform. Additionally, as a member of the Enterprise Architecture team, you will leverage your deep expertise in Application and Cloud Platform Engineering to help drive Camping World's growth and long-term success.
What You'll Do:
Architect, design, and implement Kubernetes clusters on Azure Kubernetes Service (AKS), ensuring high availability, scalability, and reliability.
Develop, manage, and support Infrastructure as Code (IaC) components, leveraging Terraform to deploy and maintain primary and supporting infrastructures.
Design, implement, and maintain CI/CD pipelines for Kubernetes deployments, utilizing GitHub Actions and Flux CD.
Collaborate with development teams by offering guidance throughout the development and deployment phases, reviewing and modifying code within GitHub repositories to ensure smooth integration and fully automated deployment processes.
Provide on-call production support, troubleshoot, and resolve complex issues related to AKS and container orchestration, ensuring quick resolution and minimal downtime.
Optimize cluster performance, scalability, and security to meet evolving requirements and resolve technical challenges.
Monitor and manage Kubernetes resources using observability tools (Grafana, SolarWinds, Dynatrace, Datadog, New Relic, etc.) to proactively identify and resolve issues.
Troubleshoot and address malfunctioning or underperforming applications, ensuring root causes are identified and long-term solutions are implemented.
Serve as a thought leader in Kubernetes, driving the platform strategy, advocating for best practices, and fostering continuous improvement and innovation.
What You'll Need to Have for the Role:
5+ years of hands-on experience in designing, managing, and supporting complex, enterprise-grade Microsoft AKS environments.
Extensive experience with Azure cloud services, including Azure SQL Database, Storage Accounts, and Azure Container Registry.
Strong understanding and hands-on experience with Terraform for automating infrastructure deployment and management.
Deep knowledge of containerization technologies (Docker) and orchestration (Kubernetes), including Helm for managing Kubernetes applications.
Proven experience in designing, implementing, and managing CI/CD pipelines using GitHub Actions and Flux CD.
Proficient in reading, understanding, and modifying code in GitHub, supporting development teams, and ensuring smooth integration with Kubernetes platforms.
Expertise in security best practices within Kubernetes environments, ensuring secure and compliant deployments.
Hands-on experience with monitoring and observability tools, including the Grafana stack (Grafana, Loki, Mimir, Tempo), for creating dashboards and alerts.
Practical experience with Kuma/Kong Mesh service mesh technologies.
Hands-on experience managing Kong API gateways.
Exceptional problem-solving skills and strong communication abilities, capable of leading troubleshooting sessions and guiding cross-functional teams.
Experience in platform architecture (IaaS, PaaS), site reliability engineering (SRE), quality assurance (QA), system design, integrations, and end-to-end implementation.
Experience working with Enterprise Architecture (EA) teams, participating in EA processes, and engaging with Architecture Review Boards (ARB), Change Advisory Boards (CAB), and other governance bodies (GRC).
This position includes on-call rotation, triage, and incident response responsibilities. *
** This position can be remote, with the expectation of travel to our Lincolnshire, IL or Chicago, IL offices occasionally as business needs dictate. **
Why Join Us?
At Camping World, you'll thrive in a dynamic environment where your contributions drive innovation and shape the future of our infrastructure. You'll be part of a collaborative team that values continuous improvement and embraces cutting-edge technologies, offering you the opportunity to make a meaningful impact on our journey toward excellence.
General Compensation Disclosure
The pay range for this role considers several factors in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. At Camping World, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the factors stated. A reasonable estimate of the current range is listed below.
Pay Range:
$131,145.00-$196,665.00 Annual
In addition to competitive pay, we offer Paid Time Off, 401(k), an Employee Assistance Program, Good Sam Roadside Assistance, discounts, paid parental leave (if eligibility is met), Tuition Reimbursement (if eligibility is met), and on the job training opportunities. Full-time associates are offered a comprehensive benefit package including medical, dental, vision and more! Part-time associates are offered access to dental & vision coverage! For more information please visit: www.mycampingworldbenefits.com
We are an equal employment opportunity employer. The Company's policy is not to discriminate against any applicant or employee based on race, color, sex, sexual orientation, gender identity, religion, national origin, age (40 and over), disability, veteran or uniformed service-member status, genetic information, or any other basis protected by applicable federal, state, or local laws.
What You'll Do:
Architect, design, and implement Kubernetes clusters on Azure Kubernetes Service (AKS), ensuring high availability, scalability, and reliability.
Develop, manage, and support Infrastructure as Code (IaC) components, leveraging Terraform to deploy and maintain primary and supporting infrastructures.
Design, implement, and maintain CI/CD pipelines for Kubernetes deployments, utilizing GitHub Actions and Flux CD.
Collaborate with development teams by offering guidance throughout the development and deployment phases, reviewing and modifying code within GitHub repositories to ensure smooth integration and fully automated deployment processes.
Provide on-call production support, troubleshoot, and resolve complex issues related to AKS and container orchestration, ensuring quick resolution and minimal downtime.
Optimize cluster performance, scalability, and security to meet evolving requirements and resolve technical challenges.
Monitor and manage Kubernetes resources using observability tools (Grafana, SolarWinds, Dynatrace, Datadog, New Relic, etc.) to proactively identify and resolve issues.
Troubleshoot and address malfunctioning or underperforming applications, ensuring root causes are identified and long-term solutions are implemented.
Serve as a thought leader in Kubernetes, driving the platform strategy, advocating for best practices, and fostering continuous improvement and innovation.
What You'll Need to Have for the Role:
5+ years of hands-on experience in designing, managing, and supporting complex, enterprise-grade Microsoft AKS environments.
Extensive experience with Azure cloud services, including Azure SQL Database, Storage Accounts, and Azure Container Registry.
Strong understanding and hands-on experience with Terraform for automating infrastructure deployment and management.
Deep knowledge of containerization technologies (Docker) and orchestration (Kubernetes), including Helm for managing Kubernetes applications.
Proven experience in designing, implementing, and managing CI/CD pipelines using GitHub Actions and Flux CD.
Proficient in reading, understanding, and modifying code in GitHub, supporting development teams, and ensuring smooth integration with Kubernetes platforms.
Expertise in security best practices within Kubernetes environments, ensuring secure and compliant deployments.
Hands-on experience with monitoring and observability tools, including the Grafana stack (Grafana, Loki, Mimir, Tempo), for creating dashboards and alerts.
Practical experience with Kuma/Kong Mesh service mesh technologies.
Hands-on experience managing Kong API gateways.
Exceptional problem-solving skills and strong communication abilities, capable of leading troubleshooting sessions and guiding cross-functional teams.
Experience in platform architecture (IaaS, PaaS), site reliability engineering (SRE), quality assurance (QA), system design, integrations, and end-to-end implementation.
Experience working with Enterprise Architecture (EA) teams, participating in EA processes, and engaging with Architecture Review Boards (ARB), Change Advisory Boards (CAB), and other governance bodies (GRC).
This position includes on-call rotation, triage, and incident response responsibilities. *
** This position can be remote, with the expectation of travel to our Lincolnshire, IL or Chicago, IL offices occasionally as business needs dictate. **
Why Join Us?
At Camping World, you'll thrive in a dynamic environment where your contributions drive innovation and shape the future of our infrastructure. You'll be part of a collaborative team that values continuous improvement and embraces cutting-edge technologies, offering you the opportunity to make a meaningful impact on our journey toward excellence.
General Compensation Disclosure
The pay range for this role considers several factors in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. At Camping World, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the factors stated. A reasonable estimate of the current range is listed below.
Pay Range:
$131,145.00-$196,665.00 Annual
In addition to competitive pay, we offer Paid Time Off, 401(k), an Employee Assistance Program, Good Sam Roadside Assistance, discounts, paid parental leave (if eligibility is met), Tuition Reimbursement (if eligibility is met), and on the job training opportunities. Full-time associates are offered a comprehensive benefit package including medical, dental, vision and more! Part-time associates are offered access to dental & vision coverage! For more information please visit: www.mycampingworldbenefits.com
We are an equal employment opportunity employer. The Company's policy is not to discriminate against any applicant or employee based on race, color, sex, sexual orientation, gender identity, religion, national origin, age (40 and over), disability, veteran or uniformed service-member status, genetic information, or any other basis protected by applicable federal, state, or local laws.