Logo
University of Tennessee

HPC System Administrator - Office of Innovative Technologies

University of Tennessee, Knoxville, Tennessee, United States, 37955


IT Administrator 3, HPC System Administrator The University of Tennessee, Knoxville Office of Information Technology, High Performance and Scientific Computing Group Market Range 13 (anticipated $70,000 to $90,000) Applicants must be legally authorized to work in the United States on a full-time basis without need now or in the future for sponsorship for employment visa status. A cover letter and resume in MS Word or PDF form must be provided as part of the application. The Office of Information Technology, High Performance & Scientific Computing group at The University of Tennessee Knoxville is seeking qualified applicants for a System Administrator position which will perform a key role in evaluating, deploying, maintaining, securing, and operating the research cyberinfrastructure resources used to support the research mission of the University. Under the guidance of the Associate CIO & Director, High Performance and Scientific Computing, this individual will have knowledge of system administration and be part of a team that manages centralized high performance computing and storage resources, a secure research computing environment, and works with faculty and researchers to make effective use of the resources for research. Major Duties/Responsibilities The successful candidate will perform analysis, troubleshooting, and provide problem solving to maintain and administer the research cyberinfrastructure resources used to support the research mission of the University. The responsibilities of the System Administrator includes, but is not limited to: coordinating the configuration and maintenance of the storage resources, including hardware and software; configure and maintain server-class, HPC resources, including compute and storage hardware, OS software, application software, and SLURM resource management system; maintaining an effective security posture including implementing the required security controls specified in the system security plans; diagnosing and resolving hardware, software, networking, and system issues when they arise; monitoring services and responding to service failures; implementing configuration management processes and procedures; providing technical support and user support as needed; documenting processes and procedures to administer and maintain the storage systems, computational systems, and infrastructure; ensuring required backups are performed regularly and successfully; and working with other OIT groups, such as, Networking Services, Help Desk, and Systems to support the continued and efficient operation of the research cyberinfrastructure. Qualifications: Qualifications Required Qualification Applicants must be legally authorized to work in the United States on a full-time basis without need now or in the future for sponsorship for employment visa status. Bachelor's degree in computer science, information technology, engineering field, or natural sciences field Three years performing information technology administration - experience can be substituted for equivalent time pursuing education related to information technology at an institution of higher education Knowledge of system administrator role, duties, and activities for Linux-based information technology system Knowledge of information technology administrator technologies, best practices, and troubleshooting technique Knowledge of installation and use of Red Hat Linux or other Linux operating system Knowledge and proficiency in configuring, operating and managing a large information technology environment Knowledge and skill in operating and maintaining Ethernet based networking equipment and system Skills in writing, presenting, and interpersonal communication - sample letter and/or paper required and providing a presentation will be part of the interview proce Preferred Qualifications: Knowledge and proficiency in high performance computing technology (hardware, software and services) Knowledge of operation and support of Lustre parallel file system technologie Knowledge of hardware, software and services in cloud computing technologie Knowledge of best practices and technologies related to research computing in a University environment Knowledge of installation and use of Windows based operating system Knowledge, skills and abilities in securing information technology systems including certifications, such as, ISC2 and SANS security certification Experience with Icinga monitoring tool Experience with the use of Microsoft productivity tools, such as, Teams, OneDrive, and Sharepoint Experience using Team Dynamics ticket system Experience with Ansible provisioning, configuration management, and application deployment