Roush Enterprises
HPC System Administrator I
Roush Enterprises, Troy, Michigan, United States, 48083
At Roush, we fuse technology and engineering to provide product development solutions to customers in a diverse range of industries. Widely recognized for providing engineering, testing, prototype, and manufacturing services to the transportation industry, Roush also provides significant support to the aerospace, defense and theme park industries. With over 2,400 employees in facilities throughout the United States, Europe, Asia, and South America, our unique combination of creativity and tenacity activates big ideas on a global stage. We want motivated, ambitious people who put the needs of our customers first, bring creativity to their work and will do whatever it takes to achieve success. If you share our passion for providing innovative solutions to complex challenges, we want you on our team. At Roush, we work alongside the best and brightest to do incredibly cool things you wouldn't believe. At Roush, you are part of building the future. The HPC System Administrator I will be responsible for day-to-day operational support of the Roush CAE HPC and VDI hardware and software infrastructure. Day to Day operations include supporting end-users with issues, driving root cause analysis and design task automations. This role will work cross functionally on various project teams and operations based on the direction of HPC System lead engineer. This role will also be involved in developing tools and scripts for simulation tests and optimization of simulation jobs and document all the changes. This position is located in Troy, MI. Responsibilities: Responsible for the day-to-day operational support of the Roush CAE HPC Clusters, VDI and backup servers: manage and solve any hardware and software issues that may arise. (Systems Administration) Assist in hardware and software upgrade programs to implement new technologies. They will include developing cluster tools or solutions, automation of deployments, HPC job optimization, pre/post processing workflows, alerts, usage and performance metrics. Write Help documents for users, develop functional and technical designs for automated tools that can assist users with HPC job optimization following the Roush CAE HPC change management guidelines. []{styl""}