Logo
Diverse Lynx

Sr Cloud DevOps Engineer

Diverse Lynx, Helm, California, us, 93627


What are the top 3 skills required for this role?

1. Kubernetes and Helm Administration

2. At least one public cloud infrastructure (AWS/Azure/OCI) 3. Devops

Job Description/ Responsibilities

Primary Responsibilities:• Understand the product inside-out and how this product can solve customer challenges. Be proficient in the product capabilities and position as a subject matter expert.• Understand the product suite and ecosystem, such as related products in the suite and other integrated applications.• Work on multi-cloud environments like AWS, Azure, Oracle Cloud Infrastructure, and GCP using the native Kubernetes platform (like EKS, AKS, GKE, and OKE in respective cloud environments).• Understand customers' requirements and challenges, customize, and configure the product to meet the customers' needs.• Analyze the performance requirements and perform the sizing exercise to recommend appropriate compute and infra specifications to the customer. As necessary, assist the customer with the Kubernetes cluster setup.• Deploy the application and its components in the Kubernetes platform in the execution platform of the customer's infrastructure (like AWS, Azure, OCI, GCP, on-prem, or custom private cloud infra).• Perform the functional quality assurance and the performance assessment and optimize the solution to meet the performance characteristics of the customer requirements.• Implement Application Performance Monitoring using the tools of customers' choice (like Datadog, Dynatrace, AppDynamics, New Relic, Prometheus, etc.) and set up appropriate monitoring to ensure the products' responsiveness, security, resilience, and efficiency.• Provide ongoing support should the customer require assistance, the monitors go off, or the customer report any concerns/challenges. This role is expected to be on-call periodically to provide operational support post-implementation.

The ideal candidate Is:• Kubernetes certified professional or an expert administrator of Kubernetes and Helm • A self-learner, self-driven, and able to operate with minimal supervision.• Able to demonstrate expertise in at least one public cloud infrastructure (AWS/Azure/OCI).• Be proficient in APM (Application Performance Monitoring) tools like Datadog APM, Dynatrace, AppDynamics, etc.• Able to successfully communicate with business partners, management, and technical team members.• Experienced SRE with development or DevOps background, worked on enterprise-scale applications.• Proficient user of Monitoring and alerting tools. Proactive in raising problems and identifying solutions.• AWS SysOps Associate or DevOps professional certified (or equivalent in other cloud service providers).• Strong sense of customer service. Able to work in a highly collaborative team setting. Approaching work with a DevOps and continuous improvement mindset

Minimum Qualifications:• Bachelor's degree• Minimum of 5 years of experience in enterprise-level DevOps role. (Minimum 3 years with Cloud AWS/Azure and 2 years with Kubernetes Administration) • Expertise in Kubernetes administration/development, hands-on experience in Helm • Strong knowledge of infrastructure components (e.g., routers, load balancers, cloud products, container systems, compute, storage, and networks) • Expertise is required in observability and monitoring tools like Dynatrace, Datadog, AppDynamics, Splunk, etc.• A deep understanding of Application performance monitoring (APM) and user monitoring is essential.• Sound knowledge of ITSM process, SI/SLO/SLA management, incident resolution, and automation techniques • Strong IP networking fundamentals and experience with usage of standard application protocols and messages (e.g., TCP/IP, HTTP, SOAP, RESTful APIs, XML/JSON, JDBC, JMS/MQ) • Knowledge of Infrastructure as Code (IaC): Ansible, AWS Cloud Formation, etc., is preferable.• Apply standards of cloud compliance to application design to achieve reliability.• Able to analyze application and server logs and error interpretation.• Ability to code in one of the programming languages (Java, Python, Shell, etc.) • Experience in site reliability engineering in Java, Kubernetes, and Database platforms (like Postgres) • The candidate should possess excellent written and verbal communication and collaboration skills.

Years of Experience: 15.00 Years of Experience

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.