Logo
TekWissen ®

IT|DevOps Engineering - Lead II - DevOps Engineering

TekWissen ®, Atlanta, Georgia, United States, 30383


Client Job Description:

SRE with Kubernetes, Advanced Python scripting experience.Role:

SREMandatory Skills:

Kubernetes, DevOps, Reliability Engineering, PythonRate:

85Location:

Atlanta / Frisco – RemoteOptional Skills:

AWSAdditional Comments:

Role: Site Reliability EngineerUST’s telecommunications practice is looking for dynamic and driven professionals to join a rapidly growing high-performance team. Our client is a leading provider of digital Global System for Mobile Communications/wireless voice and data technology standards.Position Duties and Responsibilities:Provide consulting services for improved system stability, availability, performance, and reliability.Assist in determining the impact of operational issues and provide input into their resolution via data extraction and quantification.Work through day-to-day support issues, ensuring effective and timely resolution of issues in the production environment, troubleshoot customer impacting issues.Forecast and plan for a rapidly growing environment.Support multiple applications, specifically running in Kubernetes/Java-based systems in an enterprise environment.Apply monitoring and create complex alerts and dashboards for production systems using Grafana, Prometheus.Provide capacity analysis, tuning analysis for Cloud applications in a LINUX and container platform.Available to provide 24X7 on-call support on a rotating basis with other team members.Lead efforts in troubleshooting, recovery, and root cause investigation.Perform analysis of user requirements and problems to automate or improve systems and review system capabilities, workflow, and scheduling limitations.Facilitate DR (Disaster Recovery) exercises to ensure that the team is fully prepared in any event.Lead root cause analysis sessions to understand what causes issues in Production and come up with solutions that will prevent them from happening in the future.Ensure documentation is created and remains updated for any related work.Skill Requirements:Strong experience with infrastructure and support.Strong experience with Linux OS.Strong experience with Kubernetes.Experience with Cloud Native Applications.Experience with REST or SOAP API support.Experience with tools like: Docker, PostMan, SOAP UI, ELK, App Dynamics, CI/CD tools, and GITLab, Prometheus, Grafana.Good experience in performance measures and tuning, capacity planning, and management, contingency and disaster recovery.Strong scripting knowledge and experience, preferably in Python.Good understanding of networking and routing.Job Description: Expectations from this role:Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications.Adapts existing DevOps solutions and creates relevant DevOps solutions for new contexts.Codes, debugs, tests, documents, and communicates DevOps development stages/status of DevOps develop/support issues.Selects appropriate technical options for development such as reusing, improving, or reconfiguration of existing components.Optimizes efficiency, cost, and quality of DevOps process, tools, and technology development.Validates results with user representatives; integrates and commissions the overall solution.Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs.Design, install, and troubleshoot CI/CD pipelines and software.Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects.Provides guidance to DevOps Engineers so that they can support existing components.Good understanding of Agile methodologies and is able to work with diverse teams.Knowledge of more than 1 DevOps toolstack (AWS, Azure, GCP, opensource).Typical Performance Measures:Quality of Deliverables.Error rate/completion rate at various stages of SDLC/PDLC.# of components/reused.# of domain/technology certification/product certification obtained.SLA/KPI for onboarding projects or applications.Stakeholder Management.Percentage achievement of specification/completeness/on-time delivery.Performance Areas:Automated components: Deliver components that automate parts to install components/configure software/tools in on-premises and on cloud.Configured components: Configure tools and automation framework into the overall DevOps design.Scripts: Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks.Training/SOPs: Create Training plans/SOPs to help DevOps Engineers with DevOps activities and to onboard users.Measure Process Efficiency/Effectiveness: Deployment frequency, innovation, and technology changes.Operations:Change lead time/volume.Failed deployments.Defect volume and escape rate.Meantime to detection and recovery.Skill Examples:Experience in design, installation, and configuration to troubleshoot CI/CD pipelines and software using Jenkins/Bamboo/Ansible/Puppet/Chef/PowerShell/Docker/Kubernetes.Experience in integrating with code quality/test analysis tools like Sonarqube/Cobertura/Clover.Experience in integrating build/deploy pipelines with test automation tools like Selenium/Junit/NUnit.Experience in scripting skills (Python, Linux/Shell, Perl, Groovy, PowerShell).Experience in infrastructure automation skill (Ansible/Puppet/Chef/PowerShell).Experience in repository management/migration automation – GIT, BitBucket, GitHub, Clearcase.Experience in build automation scripts – Maven, Ant.Experience in artifact repository management – Nexus/Artifactory.Experience in dashboard management & automation - ELK/Splunk.Experience in configuration of cloud infrastructure (AWS, Azure, Google).Experience in migration of applications from on-premises to cloud infrastructures.Experience in working on Azure DevOps, ARM (Azure Resource Manager), & DSC (Desired State Configuration) & strong debugging skill in C#, C Sharp, and Dotnet.Setting and managing Jira projects and Git/Bitbucket repositories.Skilled in containerization tools like Docker & Kubernetes.Knowledge Examples:Knowledge of installation/config/build/deploy processes and tools.Knowledge of IAAS - Cloud providers (AWS, Azure, Google etc.) and their tool sets.Knowledge of the application development lifecycle.Knowledge of Quality Assurance processes.Knowledge of Quality Automation processes and tools.Knowledge of multiple tool stacks, not just one.Knowledge of build and release, branching/merging.Knowledge about containerization.Knowledge of Agile methodologies.Knowledge of software security compliance (GDPR/OWASP) and tools (Blackduck/veracode/checkmarx).Additional Skills:

Kubernetes, DevOps, Reliability Engineering, Python.

#J-18808-Ljbffr