Logo
Rutgers University

Systems Administrator III

Rutgers University, New Brunswick, New Jersey, us, 08933


Position Summary:

Rutgers, the State University of New Jersey is seeking a Systems Administrator III for the Office of Advanced Research Computing (OARC). OARC is a large and diverse team working to create an outstanding environment for research computing at Rutgers. A key part of OARC's responsibility to the University is to ensure that we are seeking and supporting the best solutions for constantly evolving computational research challenges. Doing that effectively requires us to carefully address the needs of the community we serve and to ensure that our support networks are diverse, inclusive, and equitable. Diversity is welcome and encouraged here because it fuels innovation and ensures exploration of the broadest range of possible solutions to research problems. Inclusivity and respect for all points of view empower the decision making and policy development that are the framework of a productive research support organization. Prioritizing equity ensures access to resources needed to support the ideas, initiatives, and efforts of all contributors to our research support operations. Above all, we value our team members and their unique capabilities, interests, and experiences that, in concert, form OARC's mission and vision.

Key Duties:

Supports the university's Advanced Research Computing (ARC) infrastructure, including High Performance Computing (HPC), High-Throughput Computing (HTC), and Data-Intensive Computing environments.

Manages High Performance Computing services in a multiuser environment.

Helps build relationships in Rutgers' research community and gathering requirements.

Manages end-user accounts and usage utilizing queuing software (schedulers) and other tools.

Provides system services and analyze system performance for stakeholders, internal teams and intended end users.

Assists with all activities necessary to activate new operating systems or new releases of existing systems, including analysis, design, implementation, and related documentation.

Assists with all activities necessary to expand an existing operating system or significant expansion of an existing system, including analysis, design, implementation, and related documentation.

Assists analyzing systems performance and modify programs to increase the efficiency of operation.

Assists with reinstatement of integrity of system as quickly as possible following an outage in order to minimize work and data loss.

Participates in on-call rotation.

Works with the rest of the OARC team in the development of training materials and user education for internal and external use as needed.

Understands and adheres to Rutgers' compliance standards as they appear in RBHS's Corporate Compliance Policy, Code of Conduct and Conflict of Interest Policy.

Performs other related duties as assigned.

Conceives, develops, optimizes, integrates, and maintains HPC systems, technical operation and continued development of HPC, on-site cloud infrastructure and storage services.

Provides hardware, software, and end-user administration and support to a diverse group of end users that need access to ARC resources.

Operates as a member of the ARC team with focus on one of the University's campuses.

Minimum Education and Experience:

Bachelor's degree required, preferably in computer science or engineering.

Equivalent education, experience and/or training may be substituted for the degree requirements.

A minimum of four years of relevant experience, which includes three (3) years' of experience in the following:

Familiarity with Linux configuration management including Ansible and Puppet.

Familiarity with common virtualization and cloud technologies in a Linux command-line context.

Management and use of monitoring tools and frameworks using (but not limited to) Nagios, Ganglia, Elasticsearch.

Familiarity with traditional 2-layer/3-layer networking management.

Familiarity with supporting research groups using HPC or similar systems.

Linux support of clusters and infrastructure including troubleshooting, maintenance, management, and design.

Onsite hardware support of hardware used in Linux clusters.

City:

Newark

State:

NJ

Physical Demands and Work Environment:

Standing, sitting, walking, talking or hearing.

Visual acuity to perform activities such as: viewing a computer terminal, reading, analyzing written information/data, etc.

Ability to perform physical labor without restrictions, including lifting equipment up to fifty (50) pounds.

Office environment.

Moderate Noise.

Posting Number:

23ST3203

#J-18808-Ljbffr