Logo
General Dynamics Information Technology

HPC System Administrator

General Dynamics Information Technology, Manassas, Virginia, United States, 22110


Req ID: RQ179611

Type of Requisition: Regular

Clearance Level Must Be Able to Obtain: None

Job Family: Systems Administration

Skills:

High-Performance Computing (HPC) Systems,HPC,Linux

Certifications:

None - N/A

Experience:

10 + years of related experience

US Citizenship Required:

Yes

Job Description:

At GDIT, people are our differentiator. Our work depends on an On Site HPC Systems Admin joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). This position is on-site at a datacenter in the Manassas Virginia area.

WCOSS provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather and climate operational use. Operating 24/7, the next 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA’s growing environmental modeling needs.

We are looking for individuals to join GDIT’s team to deploy, operate and support leading-edge technology for WCOSS. Specific technology training will be provided. CANDIDATES MUST HAVE AN ACTIVE PUBLIC TRUST CLEARANCE OR ABOVE TO BE CONSIDERED.

We think. We act. We deliver. There is no challenge we can’t turn into opportunity.

In this role, a typical day will include:

Applying current HPC systems administrative skills; desire to learn and deploy new technologies.

Developing and deploying monitoring capabilities.

Developing and implementing tools for cluster administration.

Providing technical support with team of HPC System & Storage Administrators to resolve operational issues.

Providing off-hour on-call support on a rotating basis.

Managing, planning, and reporting for on-site vendor/subcontractor activities.

Working on site at a Manassas data center

Managing on-site office and access for vendors and subcontractors

Contributing to planning for software and hardware upgrades along with future installations

REQUIRED QUALIFICATIONS

Bachelor’s degree or equivalent and 10+ years of experience with HPC systems operations.

Experience working in a 24X7 operational environment.

DESIRED QUALIFICATIONS

Demonstrated experience to deploying and managing large-scale HPC systems using OS provisioning tools (e.g., xCat, HPCM, Bright).

Demonstrated experience using configuration management tools (e.g., Ansible, Puppet).

Linux system administration experience (e.g., SLES, RedHat or CentOS).

Batch management/scheduling experience, PBSpro preferred.

Parallel filesystem configuration and monitoring experience (e.g., Lustre, NFS).

Network interconnect configuration and monitoring experience (e.g., Infiniband, Ethernet).

Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C).

Strong writing skills for technical documents, system procedures, user wiki’s and FAQs.

Ability to work both independently and as part of a team.

Knowledge/experience with managing subcontractors or vendors under Service Level Agreements (SLAs)

Knowledge of computer system power and cooling (air and liquid cooling)

Experience managing, maintaining and repairing HPC and server hardware

The likely salary range for this position is $123,250 - $166,750. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.

Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.

We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 30 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.

We connect people with the most impactful client missions, creating an unparalleled work experience that allows them to see their impact every day. We create opportunities for our people to lead and learn simultaneously. From securing our nation’s most sensitive systems, to enabling digital transformation and cloud adoption, our people are the ones who make change real.

GDIT is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status, or any other protected class.