Logo
hp

SRE - CloudPlatform

hp, Spring, Texas, us, 77391


SRE - CloudPlatformDescription -Job SummaryThe SRE - DevOps will be responsible for the reliability, scalability, and automation of the Gen AI Platform. The role will work across AWS, Azure, and GCP to ensure seamless deployment, operation, and monitoring of the platform.This role is responsible for supporting a team of skilled DevOps engineers and collaborating closely with development, operations, and other cross-functional teams to drive the implementation and enhancement of DevOps practices throughout the software development lifecycle. The role creates real-time monitoring, alerting, response, and fault analysis to achieve the organization's solution and business performance requirements and ensures deployments meet needed security and compliance requirements. The role maintains clear and effective communication with stakeholders, including executives, users, and other departments, to provide updates, address concerns, and manage expectations.ResponsibilitiesMaintains and improves continuous integration systems for efficient development and release processes.Designs and implements secure automation solutions for development, testing, and production environments.Develops complex solutions for operational administration, system/data backup, disaster recovery, and security/performance monitoring.Designs and engineers the DevOps CI/CD tooling and platform roadmap, taking ownership of these platforms' lifecycle.Collaborates with security teams to ensure best practices for application and infrastructure security, including vulnerability scanning, access controls, and compliance with relevant regulations.Takes on a leadership role in a team that implements DevOps infrastructure projects.Manages the organization's continuous integration and delivery pipeline to maximize efficiency.Implements industry best practices for system hardening and configuration management.Seeks understanding of customer/end-user requirements and project KPIs.Demonstrates experience with large-scale, distributed systems design and architectural decisions.Education & Experience RecommendedFour-year or Graduate Degree in Computer Science, Information Systems, or any other related discipline or commensurate work experience or demonstrated competence.Typically has 7-10 years of work experience, preferably in software development, information technology, engineering environment, or a related field.Preferred ExperiencePrior experience in supporting AI or machine learning platforms.Certifications in AWS, Azure, or GCP.Experience with monitoring and logging tools like Grafana, Kibana, or Splunk.Background in software development or system administration.QualificationProficiency in cloud services across AWS, Azure, and GCP.Strong experience with infrastructure as code tools such as Terraform or CloudFormation.Expertise in CI/CD tools like Jenkins, GitHub Actions, or Azure DevOps.Knowledge of containerization and orchestration technologies, including Docker and Kubernetes.Familiarity with scripting languages for automation, such as Bash or Python.Understanding of network and security principles in a cloud environment.Excellent problem-solving skills and the ability to work in a fast-paced, evolving environment.Cross-Org SkillsEffective CommunicationResults OrientationLearning AgilityDigital FluencyCustomer CentricityImpact & ScopeImpacts function and leads and/or provides expertise to functional project teams and may participate in cross-functional initiatives.ComplexityWorks on complex problems where analysis of situations or data requires an in-depth evaluation of multiple factors.DisclaimerThis job description describes the general nature and level of work performed in this role. It is not intended to be an exhaustive list of all duties, skills, responsibilities, knowledge, etc. These may be subject to change and additional functions may be assigned as needed by management.Job - SoftwareSchedule - Full timeShift - No shift premium (United States of America)Travel - NoRelocation - No

#J-18808-Ljbffr