Logo
Dunhill Professional Search & Government Solutions

Site Reliability Engineer - Remote

Dunhill Professional Search & Government Solutions, Charlotte, NC, United States


The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer should be familiar with should be familiar with monitoring tools such as Splunk, AppDynamics, Dynatrace, Cloudwatch or other similar tools. The engineer will be responsible for implementing improvements to processes to improve site reliability and incident response.

  • Provide analysis of application performance and user behavior to support design, architecture and operations decisions.
  • Create and maintain alerts, dashboards and reports using Dynatrace, Splunk, AWS Cloudwatch and other monitoring tools.
  • Collaborate with Technical Operations, Technical Architecture and Development teams to develop improved logging and monitoring practices.
  • Build, improve and maintain tools to support the Technical Operations Teams.

Minimum Qualifications

  • Bachelor’s Degree in Information Technology, Computer Science or a related field or equivalent relevant experience.
  • 4-6 years of experience in information technology, systems administration or other IT related field.

Other Job Specific Skills

  • Bash, python or other scripting languages.
  • Familiarity with Operations Monitoring tools such as Dynatrace, Splunk, AppDynamics or AWS Cloudwatch.
  • Understanding of web site architecture and performance.
  • Knowledge of Agile Framework.
  • AWS Certification is a plus.
  • Exceptional customer service skills.