Logo
Iris Software Inc.

Senior Site Reliability Engineer

Iris Software Inc., Chicago, Illinois, United States


Greetings One of our direct client (Logistics) is looking to hire Sr. SRE Engineer in Naperville IL (Hybrid – 3-4 days onsite per week ). Please find below job description. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications. You will work closely with cross-functional teams to design, implement, and maintain robust infrastructure and automation solutions. Key Skills: – GCP OR Oracle Cloud Infrastructure ( OCI ), CI/CD, Oracle Database, Scripting, infrastructure as code (IaC) tools, such as Terraform or Puppet, Docker and Kubernetes. Key Responsibilities: Design, build, and maintain scalable and reliable infrastructure solutions. Implement automation tools and processes to streamline operations and improve efficiency. Monitor system performance and troubleshoot issues to ensure high availability and reliability. Collaborate with development teams to design and deploy applications in production environments. Conduct root cause analysis (RCA) and implement preventive measures to minimize downtime and outages. Develop and maintain documentation, runbooks, and playbooks for operational processes. Participate in on-call rotations and provide timely response to incidents and emergencies. Implement best practices for security, compliance, and disaster recovery. Continuously evaluate and improve system performance, reliability, and scalability. Skills and Qualifications: Bachelor's degree in Computer Science, Engineering, or related field. Proven experience as a Site Reliability Engineer or similar role. Strong knowledge of OCI cloud platforms, Oracle database and must have held SRE role for over 10 years minimum Experience with infrastructure as code (IaC) tools, such as Terraform or Puppet. Any scripting and programming languages knowledge - such as Python, Go, or Bash. Hands-on experience with monitoring and observability tools, such as NewRelic, Grafana, or Kibana. Solid understanding of containerization technologies, such as Docker and Kubernetes. Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Ability to work effectively in a fast-paced and dynamic environment. Best Regards,