Advanced Micro Devices , Inc.
Senior Cloud Administrator
Advanced Micro Devices , Inc., San Jose, California, United States, 95199
Overview:
WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.AMD together we advance_Responsibilities:THE ROLE:The Cloud Administrator will be responsible for providing technical support to Engineering and Corporate organizations at AMD. This position will be required to support AMDs global cloud infrastructure in a dynamic, fast-paced environment. Furthermore, this person will be collaborating globally on efforts for various IT activities related to the AMD Engineering AI/GPU - Compute Environment, in accordance with AMD Worldwide IT strategies and objectives.THE PERSON:You're a highly motivated team player with a strong development background, problem solving mentality, excellent communication skills, ability to prioritize tasks along with willingness to learn and adapt. Excellent teamwork skills and capable of working independently.KEY RESPONSIBILITIES:Design, develop, deploy, monitor, maintain, and evolve cloud-native resources, tools, services, reusable modules (infrastructure-as-code-practices) and frameworks to secure and automate provisioning of cloud infrastructure that empowers our users across Azure, AWS, GCP.Provide customers with standards and best practices on how to deploy and consume cloud-based services.Proactively seek opportunities to improve operational efficiency of teams and usage of cloud services.Contribute to a strong team-culture and an atmosphere of cross-functional teamwork.Work with internal customers in managing incident tickets to achieve operational excellence.Work with global team to provide support and complete IT projects.Create secure hybrid deployments of virtual machines, and PaaS solutions in Azure, AWS, GCP.Work with Project teams to understand and accommodate application architecture and the Apps specific requirements for Azure, AWS, and GCP.Collaborate with other engineers and stakeholders to share knowledge and build expertise for IaaS, PaaS, and Saas deployment.Collaborate with onshore and offshore resources.Implementing and automating security controls, governance processes, and compliance validation by closely partnering with the Security Team to incorporate respective requirements and best practices to keep our Cloud Env safe and secure.Applies experience in migrating on-premises applications and workloads to Azure, AWS, GCP using cloud technologies and provide support.Drives identity (IAM), access, and configuration management for cloud native tools.Responsible for the Recovery and Continuity process for cloud environments.PREFERRED EXPERIENCE:Cloud Systems Engineer general experience of various CSPs fundamentals with:Experiences in Azure, AWS, GCPTerraform, YAML, Jenkins, GitHub actions, Hashicorp, CI/CD buildout.Python, Golang, Shell, Java/J2EE, NodeJS, ReactJS, HTML5, PyTorch, TensorFlowREST API, GraphQL, Design Patterns, NOSQL, RDBMS, Elasticsearch, Redis CacheAble to build and support a full CI/CD pipeline to support consistent code deployment.Preferred understanding of AI framework where you can model large datasets, build, and test AI software to ensure desired model performance results.Preferred experience in developing and implement machine learning models and algorithms to solve complex business problems and enhance AI-driven applications.Managing GPU clusters optimizing GPU-based services/tools/softwareExperience with Container technologies (GKE, EKS, ECS, Docker, Kubernetes) is desirable.Understand CHANGE Management/Release ProcessStrong analytical and problem-solving skills.Strong understanding of Agile/Scrum methodologies.Strong written and verbal communication skills. Ability to effectively communicate technical issues and solutions to peers and external vendors.Strong active listening and consensus-building skills and passionate about learning and sharing knowledge with others.Infrastructure automation like Ansible, Terraform, or Cloud Formation, Deployment Mgr., and Resource Mgr.Designing, developing, and implementing solutions that improve efficiency and reduce costs through Kubernetes/containers, virtualization, functions, and automation.Building and managing complex cloud environments in Azure, AWS, GCP including security measures for encryption, authorization, and protocols.Monitoring system performance, conducting capacity planning, identifying trends, and providing recommendations to improve service levels via automation.Working closely with software development teams to troubleshoot and resolve issues.Understand cloud networking (VPCs), Load balancers, WAFs and CDNsExperience deploying, managing, administering, and migrating Infrastructure platforms in a Hybrid environment.Strong understanding of different deployment resource types and when to deploy each type (IaaS, PaaS, SaaS).Knowledge of HPC environments including cloud providers like Microsoft Azure, Google Cloud, and AWS Partners.Experience with Cloud native monitoring tools and also Nagios, ELK stack, Kibana/Prometheus.Proactive and empathetic mindset - you love to roll up your sleeves to fix problems for our customers.Skilled in effectively working with senior level management.Ability to juggle multiple projects and priorities and re-prioritize as necessary to align with current business.Strong organizational ability.ACADEMIC CREDENTIALS:Bachelor's degree in Computer Science, Engineering, or a related field.LOCATION:
San Jose, CA#LI-MF2#LI-HYBRIDQualifications:At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMDs Employee Stock Purchase Plan. Youll also be eligible for competitive benefits described in more detail here.AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.
WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.AMD together we advance_Responsibilities:THE ROLE:The Cloud Administrator will be responsible for providing technical support to Engineering and Corporate organizations at AMD. This position will be required to support AMDs global cloud infrastructure in a dynamic, fast-paced environment. Furthermore, this person will be collaborating globally on efforts for various IT activities related to the AMD Engineering AI/GPU - Compute Environment, in accordance with AMD Worldwide IT strategies and objectives.THE PERSON:You're a highly motivated team player with a strong development background, problem solving mentality, excellent communication skills, ability to prioritize tasks along with willingness to learn and adapt. Excellent teamwork skills and capable of working independently.KEY RESPONSIBILITIES:Design, develop, deploy, monitor, maintain, and evolve cloud-native resources, tools, services, reusable modules (infrastructure-as-code-practices) and frameworks to secure and automate provisioning of cloud infrastructure that empowers our users across Azure, AWS, GCP.Provide customers with standards and best practices on how to deploy and consume cloud-based services.Proactively seek opportunities to improve operational efficiency of teams and usage of cloud services.Contribute to a strong team-culture and an atmosphere of cross-functional teamwork.Work with internal customers in managing incident tickets to achieve operational excellence.Work with global team to provide support and complete IT projects.Create secure hybrid deployments of virtual machines, and PaaS solutions in Azure, AWS, GCP.Work with Project teams to understand and accommodate application architecture and the Apps specific requirements for Azure, AWS, and GCP.Collaborate with other engineers and stakeholders to share knowledge and build expertise for IaaS, PaaS, and Saas deployment.Collaborate with onshore and offshore resources.Implementing and automating security controls, governance processes, and compliance validation by closely partnering with the Security Team to incorporate respective requirements and best practices to keep our Cloud Env safe and secure.Applies experience in migrating on-premises applications and workloads to Azure, AWS, GCP using cloud technologies and provide support.Drives identity (IAM), access, and configuration management for cloud native tools.Responsible for the Recovery and Continuity process for cloud environments.PREFERRED EXPERIENCE:Cloud Systems Engineer general experience of various CSPs fundamentals with:Experiences in Azure, AWS, GCPTerraform, YAML, Jenkins, GitHub actions, Hashicorp, CI/CD buildout.Python, Golang, Shell, Java/J2EE, NodeJS, ReactJS, HTML5, PyTorch, TensorFlowREST API, GraphQL, Design Patterns, NOSQL, RDBMS, Elasticsearch, Redis CacheAble to build and support a full CI/CD pipeline to support consistent code deployment.Preferred understanding of AI framework where you can model large datasets, build, and test AI software to ensure desired model performance results.Preferred experience in developing and implement machine learning models and algorithms to solve complex business problems and enhance AI-driven applications.Managing GPU clusters optimizing GPU-based services/tools/softwareExperience with Container technologies (GKE, EKS, ECS, Docker, Kubernetes) is desirable.Understand CHANGE Management/Release ProcessStrong analytical and problem-solving skills.Strong understanding of Agile/Scrum methodologies.Strong written and verbal communication skills. Ability to effectively communicate technical issues and solutions to peers and external vendors.Strong active listening and consensus-building skills and passionate about learning and sharing knowledge with others.Infrastructure automation like Ansible, Terraform, or Cloud Formation, Deployment Mgr., and Resource Mgr.Designing, developing, and implementing solutions that improve efficiency and reduce costs through Kubernetes/containers, virtualization, functions, and automation.Building and managing complex cloud environments in Azure, AWS, GCP including security measures for encryption, authorization, and protocols.Monitoring system performance, conducting capacity planning, identifying trends, and providing recommendations to improve service levels via automation.Working closely with software development teams to troubleshoot and resolve issues.Understand cloud networking (VPCs), Load balancers, WAFs and CDNsExperience deploying, managing, administering, and migrating Infrastructure platforms in a Hybrid environment.Strong understanding of different deployment resource types and when to deploy each type (IaaS, PaaS, SaaS).Knowledge of HPC environments including cloud providers like Microsoft Azure, Google Cloud, and AWS Partners.Experience with Cloud native monitoring tools and also Nagios, ELK stack, Kibana/Prometheus.Proactive and empathetic mindset - you love to roll up your sleeves to fix problems for our customers.Skilled in effectively working with senior level management.Ability to juggle multiple projects and priorities and re-prioritize as necessary to align with current business.Strong organizational ability.ACADEMIC CREDENTIALS:Bachelor's degree in Computer Science, Engineering, or a related field.LOCATION:
San Jose, CA#LI-MF2#LI-HYBRIDQualifications:At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMDs Employee Stock Purchase Plan. Youll also be eligible for competitive benefits described in more detail here.AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.