The role involves administering Linux HPC clusters, managing Slurm configurations, automating workflows, supporting CI/CD infrastructure, and collaborating with research teams while documenting processes.
General Description:
We are seeking a DevOps Software Engineer – HPC Specialist, where our DevOps team supports and maintains high-performance computing (HPC) environments and secure CI/CD infrastructure that support scientific research. This role demands expertise in Linux cluster administration, Slurm workload manager, and DevOps tools such as GitLab CI/CD, Python, and JFrog Artifactory, all within a highly secure, air-gapped environment. You’ll also document complex systems and processes clearly for a variety of technical and non-technical audiences.
Essential Duties:
Administer and troubleshoot Linux-based HPC clusters running Slurm.
Manage and maintain Slurm configurations and job scheduling policies.
Collaborate with researchers to support scalable and automated scientific workflows.
Monitor and optimize HPC performance, capacity, and reliability.
Develop and automate cluster management tasks, including node provisioning, software deployment, and user environment setup.
Administer and troubleshoot CI/CD infrastructure across open and air-gapped networks.
Contribute to Infrastructure-as-Code (IaC) automation and system administration.
Collaborate with developers, system administrators, and research staff to support integrated platforms.
Write and maintain high-quality technical documentation.
Participate in Agile team activities to support iterative problem-solving and project delivery.
Required Skills:
Proven ability to communicate complex technical concepts clearly in both written and verbal formats.
Hands-on experience administering Slurm in HPC environments.
Knowledge of HPC environment architecture and common challenges in scientific computing.
Strong Linux system administration skills.
Proficiency in Python programming and scripting languages (e.g., Bash or PowerShell).
Experience with software packaging and environment management (e.g., Conda) in HPC contexts.
Strong troubleshooting, analytical, and problem-solving abilities.
Familiarity with air-gapped or high-security computing environments.
Experience working in research or scientific computing environments is highly desired.
Required Education:
BS + 6 years of experience, or MS + 4 years of experience in computer science, computer engineering, or a related field. Candidates with different experience levels will be considered for other positions.
Special Requirements:
US citizenship and ability to obtain and maintain US Government security clearance
This is an on-site position due to the need to work with air-gapped networks and sensitive information.
Compensation:
The base salary range for this full-time position is $132,765 - $165,983 + bonus + benefits.
Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range during the hiring process. Please note that the compensation details listed reflect the base salary only, and do not include potential bonus or benefits.
We are proud to be an EEO/AA employer M/F/D/V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.
Top Skills
Bash
Conda
Gitlab Ci/Cd
Jfrog Artifactory
Linux
Powershell
Python
Slurm
Similar Jobs at HRL Laboratories
Computer Vision • Hardware • Machine Learning • Software • Semiconductor
Design, integrate, and test radar and communication systems. Collaborate with teams to develop components and solve customer problems.
Top Skills:
C++MatlabPhased Array SystemsRf SystemsSimulinkSystemvueVss
Computer Vision • Hardware • Machine Learning • Software • Semiconductor
The Program Scheduler develops and manages integrated master schedules, ensuring schedules are aligned across projects while performing risk analysis and reporting project performance.
Top Skills:
Atlassian JiraMicrosoft ProjectMS Office
Computer Vision • Hardware • Machine Learning • Software • Semiconductor
Design PCB layouts including symbol creation, component placement, and routing. Work with vendors to ensure design integrity and performance.
Top Skills:
Cadence AllegroCadence OrcadPcb DesignPcb LayoutRf Design
What you need to know about the Charlotte Tech Scene
Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.
Key Facts About Charlotte Tech
- Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
- Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
- Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
- Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus