Logo
National Black MBA Association

Site Reliability Engineer

National Black MBA Association, Atlanta, Georgia, United States, 30383


You know the moment. It’s the first notes of that song you love, the intro to your favorite movie, or simply the sound of someone you love saying “hello.” It’s in these moments that sound matters most.At Bose, we believe sound is the most powerful force on earth. We’ve dedicated ourselves to improving it for nearly 60 years. And we’re passionate down to our bones about making whatever you’re listening to a little more magical.The Information Technology team at Bose exists to deliver valuable and reliable business and technology solutions with an innovative, engaged, and collaborative team focused on contributing to our corporate vision.Job Description

Specific Responsibilities:Design, implement, and manage systems to ensure high availability and performance of production services.Develop and maintain monitoring, alerting, and logging systems to proactively identify and address issues.Create and enforce Service Level Objectives (SLOs), Service Level Agreements (SLAs), and Key Performance Indicators (KPIs).Lead the response to production incidents, including troubleshooting, resolution, and post-incident analysis.Develop and maintain incident response procedures and runbooks.Conduct root cause analysis and implement corrective actions to prevent recurrence.Automate repetitive tasks and processes to improve efficiency and reduce human error.Develop and maintain tools for deployment, configuration management, and system monitoring.Collaborate with development teams to integrate automation into the software delivery pipeline.Perform capacity planning to ensure systems can handle current and future workloads.Design and implement scaling strategies to accommodate changes in demand.Monitor resource utilization and optimize infrastructure to achieve cost efficiency.Collaborate with development teams to design scalable and reliable system architectures.Participate in architectural reviews and provide guidance on reliability and performance considerations.Evaluate and recommend new technologies and approaches to enhance system reliability and performance.Document system configurations, processes, and procedures.Create and maintain operational runbooks and knowledge base articles.Provide training and mentorship to team members and other stakeholders on reliability best practices.Work closely with software engineers, operations teams, R&D, automotive, and other stakeholders to ensure smooth deployment and operation of services.Communicate effectively about system status, incident responses, and reliability improvements.Participate in on-call rotations and be available to respond to incidents as needed.Required Competencies:Proficiency in scripting and programming languages (e.g., Python, Go, JSON, Java).Experience with monitoring and observability tools (e.g., Logic Monitor, Prometheus, New Relic, Grafana, Datadog) preferred.Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes).Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud).Experience with configuration management and infrastructure-as-code tools (e.g., Terraform, Ansible) preferred.Excellent problem-solving and analytical skills.Strong communication and collaboration abilities.Experience Requirements:Experience: 3+ years of experience in a similar role, with a strong background in systems engineering, software development, or operations.Education/Certification Requirements:Education: Bachelor’s degree in Computer Science, Information Technology, or a related field. Advanced degree or relevant certifications (e.g., AWS Certified DevOps Engineer, Google Professional DevOps Engineer) preferred.Bose is an equal opportunity employer that is committed to inclusion and diversity. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, veteran status, or any other legally protected characteristics.

#J-18808-Ljbffr