Logo
Ernst & Young Advisory Services Sdn Bhd

TTT | Devops SRE - EY Global Delivery Service

Ernst & Young Advisory Services Sdn Bhd, Little Rock, Arkansas, United States,


Location: CABA

Other locations: Primary Location Only

Date: Sep 20, 2024

Requisition ID: 1531520

The EY Foundation teams develop systems and infrastructure for the Reporting & Analysis Platform for Tax and Other Regulations (RAPToR). Our work supports EY's software developers in creating key products for clients.We seek dedicated Site Reliability Engineers (SREs) to maintain our high service standards. Our services are designed for global scalability, continuous availability, and seamless operation.The SRE role involves managing and improving our Azure Cloud infrastructure, ensuring our applications are reliable, efficient, and scalable. Key responsibilities include system monitoring, issue resolution, process automation, and collaborating with development teams on cloud operations best practices. Proficiency in Azure, Infrastructure as Code (IaC), CI/CD pipelines, and cloud security is essential.This role is ideal for those passionate about building and managing systems that benefit thousands of customers. Join us to contribute to reliable, high-performing services.Key Qualifications

Azure Cloud Expertise:

Extensive knowledge of Azure services, including Azure Virtual Machines, Azure App Services, Azure Kubernetes Service (AKS), Azure SQL Database, and Azure Storage. Skilled in designing and managing scalable, reliable, and secure cloud infrastructure.

Infrastructure as Code (IaC):

Proficient in automating the deployment and management of Azure resources using Azure Resource Manager (ARM) templates, Terraform, and Azure Bicep.

CI/CD Pipelines:

Strong experience in building and managing CI/CD pipelines with Azure DevOps, GitHub Actions, or Jenkins.

Monitoring and Observability:

Skilled in using Azure Monitor, Application Insights, and Log Analytics to monitor application performance, identify issues, and ensure system reliability.

Automation and Scripting:

Proficiency in scripting languages such as PowerShell, Python, or Bash to automate operational tasks and enhance system efficiency.

Security and Compliance:

In-depth understanding of Azure Security best practices, including Identity and Access Management (IAM), Azure Policy, and Azure Security Center.

Disaster Recovery and Backup:

Experience in designing and implementing backup, disaster recovery, and business continuity plans using Azure Backup, Azure Site Recovery, and other relevant services.

Collaboration and Communication:

Ability to collaborate closely with development teams, architects, and stakeholders to integrate DevOps practices.

Software Development:

Proficient in designing, authoring, and releasing code in .NET and C#.

Problem-Solving:

Excellent troubleshooting and problem-solving skills.

Additional Skills:

Experience with scale testing, disaster recovery, and capacity planning.

DescriptionEY's RAPToR platform is a distributed, cloud-based ecosystem operating on Microsoft Azure, catering to a diverse client base across multiple regions. As a Site Reliability Engineer (SRE) at EY, you will address challenges through analytical problem-solving, collaborative efforts, and technical acumen. SREs oversee the RAPToR platform's production stack, encompassing application functionality and infrastructure resilience.

The RAPToR platform is built on a microservices architecture and utilizes a combination of open-source, proprietary, and custom-developed tools for provisioning, deployment, logging, and monitoring. In this role, you will master these technologies and drive enhancements.

Education & ExperienceBS/MS in Computer Science or equivalent (software development or production operations experience in a large-scale environment).

Additional RequirementsWillingness to participate in on-call rotation.

#J-18808-Ljbffr