UKG
Principal Site Reliability Engineer
UKG, Alpharetta, GA
Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we’re only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all. Here, we know that you’re more than your work. That’s why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose — a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If you’re passionate about our purpose — people —then we can’t wait to support whatever gives you purpose. We’re united by purpose, inspired by you. About the Team:Principal Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery and extensive experience. They are on the forefront of next gen technologies and they develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation.About the Role:Principal Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an “automate everything” mindset, helping us bring value to our customers by deploy services with incredible speed, consistency and availability. • Engage in and improve the lifecycle of services from conception to EOL, including: system design consulting, and capacity planning • Define and implement standards and best practices related to: System Architecture, Service delivery, metrics and the automation of operational tasks • Support SRE team members, services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response • Improve system performance, application delivery and efficiency through, automation, process refinement, postmortem reviews, and in-depth configuration analysis • Collaborate closely with engineering professionals within the organization to deliver reliable services • Increase operational efficiency, effectiveness, and quality of services by treating operational challenges as a software engineering problem (reduce toil) • Guide junior team members and serve as a champion for Site Reliability Engineering • Actively participate in incident response, including on-call responsibilities • Partner with stakeholders to influence and help drive the best possible technical and business outcomesAbout You:Basic Qualifications:• 10+ years of hands-on experience working within Engineering or Cloud • Minimum 5 years' experience with public cloud platforms (e.g. GCP, AWS, Azure) • Minimum 5 years' Experience in configuration and maintenance of applications and/or systems infrastructure for large scale customer facing company • Experience with distributed system design and architecturePreferred Qualifications:• Engineering degree, or a related technical discipline, or equivalent work experience • Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java) • Knowledge of Cloud based applications & Containerization Technologies • Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing • Demonstrable fundamentals in 3 of the following: Computer Science, Cloud Architecture, Security, or Network Design fundamentals • Working experience with industry standards like Terraform, Ansible#LI-HybridWhere we’re going UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it’s our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow! Equal Opportunity Employer Ultimate Kronos Group is proud to be an equal opportunity employer and is committed to maintaining a diverse and inclusive work environment. All qualified applicants will receive considerations for employment without regard to race, color, religion, sex, age, disability, marital status, familial status, sexual orientation, pregnancy, genetic information, gender identity, gender expression, national origin, ancestry, citizenship status, veteran status, and any other legally protected status under federal, state, or local anti-discrimination laws. View The EEO Know Your Rights poster and its supplement.View the Pay Transparency Nondiscrimination ProvisionUKG participates in E-Verify. View the E-Verify posters here. Disability Accommodation For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com. The pay range for this position is $137,900.00 to $198,250.00, however, base pay offered may vary depending on skills, experience, job-related knowledge and location. This position is also eligible for a short-term incentive and a long-term incentive as part of total compensation. Information about UKG’s comprehensive benefits can be reviewed on our careers site at Job ID:PRINC008579Employment Type:RegularWork Style:hybridLocation:Alpharetta,GA,United States, Atlanta,GA,United StatesTravel:25%Role:Principal Site Reliability EngineerDepartment:Software & Product Development