Vivint
Staff SRE (Site Reliability Engineer)
Vivint, Lehi, Utah, United States, 84043
Job Description
Welcome to the intersection of energy and home services. At NRG, we're driven by the idea of a smarter, cleaner, more connected future-and the possibilities that will bring to the world and to the 7.3 million customers we serve.Vivint Smart Home, an NRG-owned company, is a leading smart home company in the United States, dedicated to redefining the home experience with intelligent products and services. We find purpose in proactively protecting and keeping our customers connected to home, no matter where they are. Join the Smart Home team to create smarter, safer and more sustainable homes. More information is available at www.nrg.com or www.vivint.com.Primary Responsibilities:Design, implement, improve, and maintain infrastructure for containerized, microservice, and virtualization environmentsTroubleshoot and debug issues with a focus on resolving problems quickly with minimal impact on customers and developersManage processes, systems, and infrastructure, leveraging best practices and securityMonitor and manage service reliability, availability, and performanceSupport and troubleshoot Linux and ContainersBolster reliability and performance of customer-facing servicesBuild tools and systems to manage infrastructure and applicationsMeasure and optimize system performance, to push our capabilities forward, get ahead of customer needs, and innovateProvide primary operational support and engineering for multiple large, distributed software applicationsParticipate in on-call rotationRequired Skills, Experience & Education:Thorough experience with containerization and orchestration technologiesIn-depth understanding of hypervisor technology and secure implementationA strong grasp of configuration management and automation toolsExperience with code management and deployment processes (CI/CD), procedures, and toolsAutomation scripting for build and release processesProficiency in reading and authoring code using Bash, Python, or GoStrong experience with Linux operating systems such as Ubuntu and CentOS.Linux performance tuning, specifically kernel and networkingStrong understanding of networking (subnetting, routing, troubleshooting)A pragmatic approach to problem-solvingStrong teamwork and collaborative skillsLimited travel within the continental United States will be expectedBachelor's degree in an IT-related field or equivalent experienceDemonstrated understanding of industry-standard security principles/practices and a willingness to implement and follow themPreferred Skills, Experience & Education:Monitoring & Alerting (Prometheus, Grafana, Alert Manager, etc.)Configuration management (SaltStack, Ansible, etc.)CI/CD Tools (Jenkins, GitLab CI/CD, GitHub Actions, etc.)Scripting/Programming (Bash, Python, Go, Rust, etc.)Linux Networking (NetworkD, OVS, NetPlan, routing, etc.)Messaging Systems (Kafka, RabbitMQ, etc.)NoSQL Databases (MongoDB, etc.)Server Hardware Operations (PXE, Cloud-Init, Kickstart, IPMI, etc.)Cloud Operations (AWS, GCP, etc.)Server Hardware install, maintenance, and troubleshooting experience (HP, Dell similar)Here are some highlighted perks you should ask us about:Free daily lunch and drinks on-sitePaid holidays and flexible paid time awayEmployee/Friends/Family DiscountsOnsite health clinic, gym, gaming tablesMedical/dental/vision/life coverage & 24/7 Medical Hotline401(k) + Employer MatchEmployee Resource GroupsWORKING CONDITIONS:This job operates in a professional office environment. This role routinely uses standard office equipment such as computers, phones, photocopiers, filing cabinets, and fax machines.SAFETY:We enforce a safety culture whereby all employees have the responsibility for continuously developing and maintaining a safe working environment. Each new employee is responsible for completing all training requirements. Additionally, the employee must accept they have responsibility for maintaining the safety of themselves, their co-workers, and the public. Employees must adhere to all written and verbal instructions, promptly report and correct all hazards or unsafe conditions, question non-standard operations or unmitigated hazards, and provide feedback to management on all safety issues.
#J-18808-Ljbffr
Welcome to the intersection of energy and home services. At NRG, we're driven by the idea of a smarter, cleaner, more connected future-and the possibilities that will bring to the world and to the 7.3 million customers we serve.Vivint Smart Home, an NRG-owned company, is a leading smart home company in the United States, dedicated to redefining the home experience with intelligent products and services. We find purpose in proactively protecting and keeping our customers connected to home, no matter where they are. Join the Smart Home team to create smarter, safer and more sustainable homes. More information is available at www.nrg.com or www.vivint.com.Primary Responsibilities:Design, implement, improve, and maintain infrastructure for containerized, microservice, and virtualization environmentsTroubleshoot and debug issues with a focus on resolving problems quickly with minimal impact on customers and developersManage processes, systems, and infrastructure, leveraging best practices and securityMonitor and manage service reliability, availability, and performanceSupport and troubleshoot Linux and ContainersBolster reliability and performance of customer-facing servicesBuild tools and systems to manage infrastructure and applicationsMeasure and optimize system performance, to push our capabilities forward, get ahead of customer needs, and innovateProvide primary operational support and engineering for multiple large, distributed software applicationsParticipate in on-call rotationRequired Skills, Experience & Education:Thorough experience with containerization and orchestration technologiesIn-depth understanding of hypervisor technology and secure implementationA strong grasp of configuration management and automation toolsExperience with code management and deployment processes (CI/CD), procedures, and toolsAutomation scripting for build and release processesProficiency in reading and authoring code using Bash, Python, or GoStrong experience with Linux operating systems such as Ubuntu and CentOS.Linux performance tuning, specifically kernel and networkingStrong understanding of networking (subnetting, routing, troubleshooting)A pragmatic approach to problem-solvingStrong teamwork and collaborative skillsLimited travel within the continental United States will be expectedBachelor's degree in an IT-related field or equivalent experienceDemonstrated understanding of industry-standard security principles/practices and a willingness to implement and follow themPreferred Skills, Experience & Education:Monitoring & Alerting (Prometheus, Grafana, Alert Manager, etc.)Configuration management (SaltStack, Ansible, etc.)CI/CD Tools (Jenkins, GitLab CI/CD, GitHub Actions, etc.)Scripting/Programming (Bash, Python, Go, Rust, etc.)Linux Networking (NetworkD, OVS, NetPlan, routing, etc.)Messaging Systems (Kafka, RabbitMQ, etc.)NoSQL Databases (MongoDB, etc.)Server Hardware Operations (PXE, Cloud-Init, Kickstart, IPMI, etc.)Cloud Operations (AWS, GCP, etc.)Server Hardware install, maintenance, and troubleshooting experience (HP, Dell similar)Here are some highlighted perks you should ask us about:Free daily lunch and drinks on-sitePaid holidays and flexible paid time awayEmployee/Friends/Family DiscountsOnsite health clinic, gym, gaming tablesMedical/dental/vision/life coverage & 24/7 Medical Hotline401(k) + Employer MatchEmployee Resource GroupsWORKING CONDITIONS:This job operates in a professional office environment. This role routinely uses standard office equipment such as computers, phones, photocopiers, filing cabinets, and fax machines.SAFETY:We enforce a safety culture whereby all employees have the responsibility for continuously developing and maintaining a safe working environment. Each new employee is responsible for completing all training requirements. Additionally, the employee must accept they have responsibility for maintaining the safety of themselves, their co-workers, and the public. Employees must adhere to all written and verbal instructions, promptly report and correct all hazards or unsafe conditions, question non-standard operations or unmitigated hazards, and provide feedback to management on all safety issues.
#J-18808-Ljbffr