High Performance Computing Engineer - Mid-level Job at GDIT, Chantilly, Loudoun County, VA

MC9SNk5QcTNLcTBteXcvQXYwdzVhdmFoWGc9PQ==
  • GDIT
  • Chantilly, Loudoun County, VA

Job Description

Responsibilities for this Position

Location: USA VA Chantilly
Full Part/Time: Full time
Job Req: PRX25933

Type of Requisition:
Regular

Clearance Level Must Currently Possess:
Top Secret SCI + Polygraph

Clearance Level Must Be Able to Obtain:
Top Secret SCI + Polygraph

Public Trust/Other Required:
None

Job Family:
IT Infrastructure and Operations

Job Qualifications:

Skills:
Automation, Scripting, Tooling
Certifications:
None
Experience:
6 + years of related experience
US Citizenship Required:
Yes

Job Description:
HPC Engineer Who You Are You are a talented, multidiscipline engineer versed in getting the best performance out of systems. You are familiar with High Performance Computing using both CPU and GPU based systems. You understand scheduling using SLURM, computing using MPI, and operating software at scale. What you will be doing Playing a key role in defining and operating some of the most complex compute platforms that the client has to bring to bear against complex problems. These systems enable complex analysis, simulation and modeling leveraging massively parallel computing and disparate holding of very large data sets, to answer difficult questions. To do this you will assist the users in deploying jobs to these systems to harness the capabilities of these systems producing answers in the form of analytic product, models and simulations. This mission enablement is the heart of the hardest problems to solve. Responsible for the normal day-to-day HPC operations and maintenance of the HPC systems Provide day to day systems administration duties for Nvidia GPUs, Commodity Cluster Systems and Cray HPC environments Perform system monitoring, software installations, debug, upgrades, health checks, and identification/implementation of automated business processes Provide assessments, on-going performance analysis and recommendations for future architectures Responsible for operating all the host systems for the analysis Works in a liaison role, linking the analysts and their specialty codes and applications, to the computing systems that are focused on yielding in-depth technically sound results. Oversees analytic applications running on a clustered HPC fabric including CPU and GPU systems Managing job submission to clients applications and codes using MPI/OpenMPI Provide in-depth analytic results, to achieve a best-tool-for-the-job approach. Partners with data scientists, engineers, and analysts conducting specialized scientific and engineering analysis. Escalate issues and problems to hardware support and/or engineering management as necessary Responsible for continuous performance analysis and tuning the HPC environment Assist with the identification, troubleshooting, and repair of software problems impacting performance of implemented HPC solutions Perform installation of software patches including upgrades to operating systems and firmware Assist with the resolution of trouble tickets and software problems identified by system's users Identify and expand services and functionalities offered in HPC environment Be a primary point of contact to resolve any hardware or software malfunctions, including working with service personnel as necessary Review system logs to identify and resolve software and systems related issues Prepare reports related to the operational efficiency of the hardware and execution of users jobs Experience with MPI/OpenMPI, SLURM, and Linux Operating Systems essential Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack Experience with high speed networking, and CUDA preferred Software integration experience a plus Other duties could be required to support the customer's mission What you will need Minimum of 6 years demonstrated on-the-job experience Demonstrated on-the-job experience with integrating functionality from disparate systems via scripting/tooling/automation Demonstrated on-the-job experience with the Sponsor's system security environment and requirements Demonstrated experience leading systems architecture, operations, maintenance and administration

The likely salary range for this position is $207,386 - $280,582. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.

Scheduled Weekly Hours:
40

Travel Required:
None

Telecommuting Options:
Onsite

Work Location:
USA VA Chantilly

Additional Work Locations:

Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.

We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.

Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.

Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans



PI280965546


HPC Engineer Who You Are You are a talented, multidiscipline engineer versed in getting the best performance out of systems. You are familiar with High Performance Computing using both CPU and GPU based systems. You understand scheduling using SLURM, computing using MPI, and operating software at scale. What you will be doing Playing a key role in defining and operating some of the most complex compute platforms that the client has to bring to bear against complex problems. These systems enable complex analysis, simulation and modeling leveraging massively parallel computing and disparate holding of very large data sets, to answer difficult questions. To do this you will assist the users in deploying jobs to these systems to harness the capabilities of these systems producing answers in the form of analytic product, models and simulations. This mission enablement is the heart of the hardest problems to solve. Responsible for the normal day-to-day HPC operations and maintenance of the HPC systems Provide day to day systems administration duties for Nvidia GPUs, Commodity Cluster Systems and Cray HPC environments Perform system monitoring, software installations, debug, upgrades, health checks, and identification/implementation of automated business processes Provide assessments, on-going performance analysis and recommendations for future architectures Responsible for operating all the host systems for the analysis Works in a liaison role, linking the analysts and their specialty codes and applications, to the computing systems that are focused on yielding in-depth technically sound results. Oversees analytic applications running on a clustered HPC fabric including CPU and GPU systems Managing job submission to clients applications and codes using MPI/OpenMPI Provide in-depth analytic results, to achieve a best-tool-for-the-job approach. Partners with data scientists, engineers, and analysts conducting specialized scientific and engineering analysis. Escalate issues and problems to hardware support and/or engineering management as necessary Responsible for continuous performance analysis and tuning the HPC environment Assist with the identification, troubleshooting, and repair of software problems impacting performance of implemented HPC solutions Perform installation of software patches including upgrades to operating systems and firmware Assist with the resolution of trouble tickets and software problems identified by system's users Identify and expand services and functionalities offered in HPC environment Be a primary point of contact to resolve any hardware or software malfunctions, including working with service personnel as necessary Review system logs to identify and resolve software and systems related issues Prepare reports related to the operational efficiency of the hardware and execution of users jobs Experience with MPI/OpenMPI, SLURM, and Linux Operating Systems essential Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack Experience with high speed networking, and CUDA preferred Software integration experience a plus Other duties could be required to support the customer's mission What you will need Minimum of 6 years demonstrated on-the-job experience Demonstrated on-the-job experience with integrating functionality from disparate systems via scripting/tooling/automation Demonstrated on-the-job experience with the Sponsor's system security environment and requirements Demonstrated experience leading systems architecture, operations, maintenance and administration


The likely salary range for this position is $207,386 - $280,582. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.



Scheduled Weekly Hours:
40



Travel Required:
None



Telecommuting Options:
Onsite



Work Location:
USA VA Chantilly



Additional Work Locations:



Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.


We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.


Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.


Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans







PI280965546

Job Tags

Full time, Temporary work, Part time, Immediate start, Remote work, Worldwide, Flexible hours,

Similar Jobs

MedWatch

Case Manager (Specialty) / RN Job at MedWatch

 ...treatment plans for medical necessity, standards of care, and ongoing communication with all members of the health care team. This is a remote/work-from-home position. This position required you to be licensed in ALL 50 States ( that includes states not included in the... 

Hotwire Communications Ltd

Junior Graphic Designer Job at Hotwire Communications Ltd

The Junior Graphic Designer plays a key role in supporting the marketing team by creating visually compelling designs that elevate the companys brand, products, and services. This position is ideal for a creative, detail-oriented self-starter who thrives in a fast-paced... 

WK Kellogg Co

Project Manager, Legal & Compliance Job at WK Kellogg Co

 ...how we are going toaccomplishthis, and we wouldlove foryou to join us in this effort. JOB OVERVIEW The Project Manager for the WK Kellogg Legal Department will play a critical role in supporting WKs Crisis Management, Corporate Governance and Brand... 

Koch

Recruiter Job at Koch

 ...Your Job We are seeking a dynamic Recruiter to join our team supporting Molex, a Koch Company. At Koch, hiring is about more than...  ...experience supporting early career recruiting- i.e. intern, co-op, entry level hiring. For this role, we anticipate paying $75,000 - $12... 

TALENThire Professional Services

Senior Talent Advisor (Product Recruiter) Job at TALENThire Professional Services

**This position is posted for one of our most valued clients** Key Responsibilities: Maintain full accountability for recruiting outcomes from initial engagement through hire Serve as a strategic partner to Hiring Managers, Managing Directors, and Executives...