Rutgers University
Systems Administrator III
Rutgers University, Newark, New Jersey, us, 07175
Below you will find the details for the position including any supplementary documentation and questions you should review before applying for the opening. To apply for the position, please click the
Apply for this Job
link/button.If you would like to bookmark this position for later review, click on the
Bookmark
link. If you would like to print a copy of this position for your records, click on the
Print Preview
link.Job Category:
Staff & Executive - Information TechnologyDepartment:
Office of Advanced Research ComputingOverview
Rutgers, The State University of New Jersey, stands among the nation’s highest-ranked, most diverse public research universities. The oldest, largest, and top-ranked public university in the New York/New Jersey metropolitan area, you’ll find us at our main locations in three New Jersey cities, and our footprint can be seen around the region. As one of the nation’s most diverse universities, Rutgers draws strength from the rich variety of perspectives and life experiences of our community. We’re an academic, health, and research powerhouse and a university of opportunity.The Office of Information Technology (OIT) is Rutgers’ enterprise IT office. OIT provides university-wide services and support and collaborates with department and unit IT professionals on projects and initiatives for the Rutgers community. OIT’s services and systems include the Rutgers network; email and calendaring systems; IDs/passwords and identity management; data centers; computer labs; help desk support; wireless connectivity; a software portal; information security, risk, and compliance services; research computing; and many others. OIT’s staff members work closely with the broader university community to advance Rutgers’ missions of teaching, research, and service. For more information, please visit
https://it.rutgers.edu.Posting Summary
Rutgers, the State University of New Jersey is seeking a Systems Administrator III for the Office of Advanced Research Computing (OARC). OARC is a large and diverse team working to create an outstanding environment for research computing at Rutgers. A key part of OARC’s responsibility to the University is to ensure that we are seeking and supporting the best solutions for constantly evolving computational research challenges. Doing that effectively requires us to carefully address the needs of the community we serve and to ensure that our support networks are diverse, inclusive, and equitable. Diversity is welcome and encouraged here because it fuels innovation and ensures exploration of the broadest range of possible solutions to research problems. Inclusivity and respect for all points of view empower the decision-making and policy development that are the framework of a productive research support organization. Prioritizing equity ensures access to resources needed to support the ideas, initiatives, and efforts of all contributors to our research support operations. Above all, we value our team members and their unique capabilities, interests, and experiences that, in concert, form OARC’s mission and vision.Key Duties
Supports the university’s Advanced Research Computing (ARC) infrastructure, including High Performance Computing (HPC), High-Throughput Computing (HTC), and Data-Intensive Computing environments.Manages High Performance Computing services in a multiuser environment.Helps build relationships in Rutgers’ research community and gathering requirements.Manages end-user accounts and usage utilizing queuing software (schedulers) and other tools.Provides system services and analyze system performance for stakeholders, internal teams and intended end users.Assists with all activities necessary to activate new operating systems or new releases of an existing systems, including analysis, design, implementation, and related documentation.Assists with all activities necessary to expand an existing operating system or significant expansion of an existing system, including analysis, design, implementation, and related documentation.Assists analyzing systems performance and modify programs to increase the efficiency of operation.Assists with reinstatement of integrity of system as quickly as possible following an outage in order to minimize work and data loss.Participates in on-call rotation.Works with the rest of the OARC team in the development of training materials and user education for internal and external use as needed.Understands and adheres to Rutgers’ compliance standards as they appear in RBHS’s Corporate Compliance Policy, Code of Conduct and Conflict of Interest Policy.Performs other related duties as assigned.Conceives, develops, optimizes, integrates, and maintains HPC systems, technical operation and continued development of HPC, on-site cloud infrastructure and storage services.Provides hardware, software, and end-user administration and support to a diverse group of end users that need access to ARC resources.Operates as a member of the ARC team with focus on one of the University’s campuses.Position Status
Full TimeWork Arrangement
Consistent with the current application of Rutgers Policy 60.3.22, this position may be eligible for a hybrid work arrangement. The flexible work arrangements outlined in Rutgers Policy 60.3.22 are part of a pilot program that is effective September 1, 2022 through August 31, 2024. Therefore, there is no guarantee that this flexible work arrangement will continue beyond that date. Flexible work arrangements are not permanent, are subject to change or cancellation and contingent on the employee receiving approval in the FlexWork@RU Application System. Additional information may be found at
https://futureofwork.rutgers.edu .Minimum Education and Experience
Bachelor’s degree required, preferably in computer science or engineering.Equivalent education, experience and/or training may be substituted for the degree requirements.A minimum of four years of relevant experience, which includes three (3) years’ of experience in the following:Familiarity with Linux configuration management including Ansible and Puppet.Familiarity with common virtualization and cloud technologies in a Linux command-line context.Management and use of monitoring tools and frameworks using (but not limited to) Nagios, Ganglia, Elasticsearch.Familiarity with traditional 2-layer/3-layer networking management.Familiarity with supporting research groups using HPC or similar systems.Linux support of clusters and infrastructure including troubleshooting, maintenance, management, and design.Onsite hardware support of hardware used in Linux clusters.Required Knowledge, Skills, and Abilities
With guidance, design, install, configure, optimize, and maintain integrity of highly complex High-Performance operating system platforms.Able to help resolve system emergencies with significant impact on the integrity of user data and systems.Preferred Qualifications
Expert management of batch cluster resource management using Slurm.Expert management of virtualization, cloud and containerized workflows in site using OpenStack, KVM, Docker, Kubernetes, Jupyter.Knowledge of security policies.Storage mesh management of low latency/high speed interconnects for parallel processing: Infiniband, OmniPath.Virtual 2-layer/3-layer networking management with a Linux environment via command line tools.Installation, maintenance and management of distributed file systems such as (but not limited to) GPFS.Expert support of Linux clusters and infrastructure including troubleshooting, maintenance, management, design and support HPC hardware troubleshooting, support and maintenance.Physical Demands and Work Environment
Standing, sitting, walking, talking or hearing.Visual acuity to perform activities such as: viewing a computer terminal, reading, analyzing written information/data, etc.Ability to perform physical labor without restrictions, including lifting equipment up to fifty (50) pounds.Posting Details
Posting Number:
23ST3203Posting Open Date:
11/28/2023Special Instructions to Applicants
All offers of employment are contingent upon successful completion of all pre-employment screenings. Under Policy 100.3.1 Immunization Policy for Covered Individuals, if employment will commence during Flu Season, Rutgers University may require certain prospective employees to provide proof that they are vaccinated against Seasonal Influenza for the current Flu Season, unless the University has granted the individual a medical or religious exemption.
#J-18808-Ljbffr
Apply for this Job
link/button.If you would like to bookmark this position for later review, click on the
Bookmark
link. If you would like to print a copy of this position for your records, click on the
Print Preview
link.Job Category:
Staff & Executive - Information TechnologyDepartment:
Office of Advanced Research ComputingOverview
Rutgers, The State University of New Jersey, stands among the nation’s highest-ranked, most diverse public research universities. The oldest, largest, and top-ranked public university in the New York/New Jersey metropolitan area, you’ll find us at our main locations in three New Jersey cities, and our footprint can be seen around the region. As one of the nation’s most diverse universities, Rutgers draws strength from the rich variety of perspectives and life experiences of our community. We’re an academic, health, and research powerhouse and a university of opportunity.The Office of Information Technology (OIT) is Rutgers’ enterprise IT office. OIT provides university-wide services and support and collaborates with department and unit IT professionals on projects and initiatives for the Rutgers community. OIT’s services and systems include the Rutgers network; email and calendaring systems; IDs/passwords and identity management; data centers; computer labs; help desk support; wireless connectivity; a software portal; information security, risk, and compliance services; research computing; and many others. OIT’s staff members work closely with the broader university community to advance Rutgers’ missions of teaching, research, and service. For more information, please visit
https://it.rutgers.edu.Posting Summary
Rutgers, the State University of New Jersey is seeking a Systems Administrator III for the Office of Advanced Research Computing (OARC). OARC is a large and diverse team working to create an outstanding environment for research computing at Rutgers. A key part of OARC’s responsibility to the University is to ensure that we are seeking and supporting the best solutions for constantly evolving computational research challenges. Doing that effectively requires us to carefully address the needs of the community we serve and to ensure that our support networks are diverse, inclusive, and equitable. Diversity is welcome and encouraged here because it fuels innovation and ensures exploration of the broadest range of possible solutions to research problems. Inclusivity and respect for all points of view empower the decision-making and policy development that are the framework of a productive research support organization. Prioritizing equity ensures access to resources needed to support the ideas, initiatives, and efforts of all contributors to our research support operations. Above all, we value our team members and their unique capabilities, interests, and experiences that, in concert, form OARC’s mission and vision.Key Duties
Supports the university’s Advanced Research Computing (ARC) infrastructure, including High Performance Computing (HPC), High-Throughput Computing (HTC), and Data-Intensive Computing environments.Manages High Performance Computing services in a multiuser environment.Helps build relationships in Rutgers’ research community and gathering requirements.Manages end-user accounts and usage utilizing queuing software (schedulers) and other tools.Provides system services and analyze system performance for stakeholders, internal teams and intended end users.Assists with all activities necessary to activate new operating systems or new releases of an existing systems, including analysis, design, implementation, and related documentation.Assists with all activities necessary to expand an existing operating system or significant expansion of an existing system, including analysis, design, implementation, and related documentation.Assists analyzing systems performance and modify programs to increase the efficiency of operation.Assists with reinstatement of integrity of system as quickly as possible following an outage in order to minimize work and data loss.Participates in on-call rotation.Works with the rest of the OARC team in the development of training materials and user education for internal and external use as needed.Understands and adheres to Rutgers’ compliance standards as they appear in RBHS’s Corporate Compliance Policy, Code of Conduct and Conflict of Interest Policy.Performs other related duties as assigned.Conceives, develops, optimizes, integrates, and maintains HPC systems, technical operation and continued development of HPC, on-site cloud infrastructure and storage services.Provides hardware, software, and end-user administration and support to a diverse group of end users that need access to ARC resources.Operates as a member of the ARC team with focus on one of the University’s campuses.Position Status
Full TimeWork Arrangement
Consistent with the current application of Rutgers Policy 60.3.22, this position may be eligible for a hybrid work arrangement. The flexible work arrangements outlined in Rutgers Policy 60.3.22 are part of a pilot program that is effective September 1, 2022 through August 31, 2024. Therefore, there is no guarantee that this flexible work arrangement will continue beyond that date. Flexible work arrangements are not permanent, are subject to change or cancellation and contingent on the employee receiving approval in the FlexWork@RU Application System. Additional information may be found at
https://futureofwork.rutgers.edu .Minimum Education and Experience
Bachelor’s degree required, preferably in computer science or engineering.Equivalent education, experience and/or training may be substituted for the degree requirements.A minimum of four years of relevant experience, which includes three (3) years’ of experience in the following:Familiarity with Linux configuration management including Ansible and Puppet.Familiarity with common virtualization and cloud technologies in a Linux command-line context.Management and use of monitoring tools and frameworks using (but not limited to) Nagios, Ganglia, Elasticsearch.Familiarity with traditional 2-layer/3-layer networking management.Familiarity with supporting research groups using HPC or similar systems.Linux support of clusters and infrastructure including troubleshooting, maintenance, management, and design.Onsite hardware support of hardware used in Linux clusters.Required Knowledge, Skills, and Abilities
With guidance, design, install, configure, optimize, and maintain integrity of highly complex High-Performance operating system platforms.Able to help resolve system emergencies with significant impact on the integrity of user data and systems.Preferred Qualifications
Expert management of batch cluster resource management using Slurm.Expert management of virtualization, cloud and containerized workflows in site using OpenStack, KVM, Docker, Kubernetes, Jupyter.Knowledge of security policies.Storage mesh management of low latency/high speed interconnects for parallel processing: Infiniband, OmniPath.Virtual 2-layer/3-layer networking management with a Linux environment via command line tools.Installation, maintenance and management of distributed file systems such as (but not limited to) GPFS.Expert support of Linux clusters and infrastructure including troubleshooting, maintenance, management, design and support HPC hardware troubleshooting, support and maintenance.Physical Demands and Work Environment
Standing, sitting, walking, talking or hearing.Visual acuity to perform activities such as: viewing a computer terminal, reading, analyzing written information/data, etc.Ability to perform physical labor without restrictions, including lifting equipment up to fifty (50) pounds.Posting Details
Posting Number:
23ST3203Posting Open Date:
11/28/2023Special Instructions to Applicants
All offers of employment are contingent upon successful completion of all pre-employment screenings. Under Policy 100.3.1 Immunization Policy for Covered Individuals, if employment will commence during Flu Season, Rutgers University may require certain prospective employees to provide proof that they are vaccinated against Seasonal Influenza for the current Flu Season, unless the University has granted the individual a medical or religious exemption.
#J-18808-Ljbffr