Extreme Reach
Cloud Ops Engineer
Extreme Reach, Toronto, ON
XR is a global technology platform powering the creative economy. Its unified platform moves creative and productions forward, simplifying the fragmentation and delivering global insights that drive increased business value. XR operates in 130 countries and 45 languages, serving the top global advertisers and enabling $150 billion in video ad spend around the world. More than half a billion creative brand assets are managed in XR's enterprise platform.
Above all, we are a supportive and collaborative culture dedicated to DEI. We are caring, dedicated, positive, genuine, trustworthy, experienced, passionate and fun people with loyalty to our customers and our fellow teammates. It is our belief that the better we work together to help our clients achieve their goals, the more successful XR will be.
The Opportunity
If building and maintaining cloud scalable systems and solving complex problems is your passion, you are reliable, collegial and thrive in a fast-paced collaborative environment, this is the job for you. The Cloud Operations Engineer will use their expertise to design, develop, and document our cloud infrastructure, and cloud monitoring solutions. This individual will work with team members to gather requirements and deploy cloud technology to help scale our platform used to power the world's video advertising.
Job Responsibilities:
Requirements
Benefits
ER Culture & Why You Will Love Working Here
Above all, we are a supportive and collaborative culture dedicated to DEI. We are caring, dedicated, positive, genuine, trustworthy, experienced, passionate and fun people with loyalty to our customers and our fellow teammates. It is our belief that the better we work together to help our clients achieve their goals, the more successful XR will be.
The Opportunity
If building and maintaining cloud scalable systems and solving complex problems is your passion, you are reliable, collegial and thrive in a fast-paced collaborative environment, this is the job for you. The Cloud Operations Engineer will use their expertise to design, develop, and document our cloud infrastructure, and cloud monitoring solutions. This individual will work with team members to gather requirements and deploy cloud technology to help scale our platform used to power the world's video advertising.
Job Responsibilities:
- Automate manual ops tasks to streamline processes and reduce manual effort.
- Identify areas where systems can be improved to increase system reliability and reduce system incidents.
- Manage and support proactive monitoring solutions across the Production environment
- Look for trends and themes in issues reported in Live Applications and facilitate investigations by Developers to avoid repeated occurrences
- Perform actions on the Product codebase (backend/frontend) for real-time diagnosis of major incidents in Live systems
- Analyze and diagnose 'difficult' or tricky to reproduce problems
- Perform analysis and reporting on frequently occurring Live problems
- Assist Developers who are fixing bugs to understand the detail and user scenarios around reported bugs to accelerate triage and fixing
- Serve in IT Tier 3 support of Extreme Reach Production infrastructure, part of on-call rotations supporting infrastructure and services 24x7
- Creates and manages Automated Infrastructure solutions. Automated builds and configuration management.
- Understands the fundamentals of large scale on-prem and cloud mission critical systems; networking, security, redundancy, scalability, monitoring, & performance KPIs.
- Fast, adaptable, with a proven ability to integrate and exploit new technologies, PAAS offerings, and API's
Requirements
- 5+ years of hands-on experience with DevOps and tools including GIT, CI / CD environments; designing/writing/delivering system automation
- AWS Professional certification preferred
- Strong PowerShell experience, as well as other programming or scripting languages Python, Bash/Shell, Java, JavaScript and/or node.js.
- Experience working with Jenkins, Ansible and Terraform
- Ability to think holistically, putting the customer first for a given project or problem.
- Creative, resourceful, problem solver with an aptitude for systems thinking.
- Strong written and oral communication skills including the ability to communicate complex issues to technical and non-technical staff and management.
- Hands-on experience of AWS preferably in a large-scale enterprise system
- Understanding of Docker & Kubernetes and Container technology
- Knowledge of Monitoring and alerting tools such as Grafana and DataDog
- Understanding of general security architecture and design.
- Understanding of source control and change management.
- Ability to create and maintain technical references for team members through either Api integrations or static intranet articles including diagrams, spreadsheets, and checklists
- Ability to prioritize and multitask in a fast-paced environment
Benefits
ER Culture & Why You Will Love Working Here
- XR has 23 offices worldwide and teams spread throughout the US, EMEA and APAC, our multicultural teams work cross-departmentally and across continents and cultures towards a shared goal
- It is our belief that the better we work together to help our clients achieve their goals, the more successful XR will be
- Our leadership is provided a great deal of autonomy and freedom in their individual roles, they are encouraged to be self starters and to continuously develop their skills
- Feedback from internal Employee Engagement Surveys cites the People, Teamwork and Flexibility as the most rewarding aspects of working at XR.
- We are a supportive and collaborative culture that values multiple perspectives, fresh thinking and is dedicated to DEI
- XR celebrates diversity of ideas, people and experiences
- Generous PTO, flexible work schedules and hybrid working arrangements create a rewarding work-life balance