Senior HPC Infrastructure Engineer
Job Description:
Job Title: Senior HPC Infrastructure Engineer
Primary Location: Chicagoland, Hybrid with 2-3 days in-office.
Position Type: Contract
Overview
TalentFish is casting a line for a Sr. HPC Infrastructure Engineer! This is a contract
opportunity. This position plays a key role in supporting the design, deployment, and
optimization of high-performance computing (HPC) infrastructure, both on-prem and in
cloud environments. This role combines deep technical system expertise with hands-on
administration to ensure scalable, reliable, and secure environments for advanced scientific
research and computational workloads.
What You Bring to the Role (Ideal Experience)
- Strong background in Linux/Unix system administration
- Experience designing and supporting HPC clusters in research, academic, or scientific computing environments
- Proficiency with parallel computing frameworks such as MPI and OpenMP
- Familiarity with job scheduling/resource management systems (e.g., Slurm, Torque, PBS)
- Hands-on experience with high-speed interconnects (e.g., InfiniBand, Omni-Path)
- Strong understanding of networking, storage solutions, and system performance tuning
- Experience with backup, disaster recovery, and data integrity solutions in high-performance environments
- Fluency in scripting (e.g., Bash, Python)
- Strong troubleshooting skills and collaborative communication style
- Bachelor's degree in Computer Science, Engineering, or equivalent experience (Master's preferred)
- Relevant technical certifications (e.g., Red Hat, CompTIA Linux+) are a plus
What You'll Do (Skills Used in this Position)
- Design, deploy, and manage scalable HPC systems across both cloud and on-prem environments
- Define system requirements and optimize Linux-based systems for performance, reliability, and scalability
- Maintain, monitor, and patch HPC environments to ensure high availability and security
- Design and manage high-performance storage systems with robust backup, replication, and archival strategies
- Conduct benchmarking and performance tuning, collaborating with HPC operations to resolve bottlenecks
- Partner with cybersecurity teams to ensure compliance and security in HPC environments
- Maintain technical documentation, SOPs, and troubleshooting guides
- Provide end-user training and technical support, managing on-site computing technologies
- Contribute to overall operational efficiency through team collaboration and continual improvement initiatives
Compensation Information
The expected salary range for this position is $55-65 per hour, depending on experience and
qualifications. This role also offers comprehensive benefits, including health insurance, a
401(k) plan, and paid time off. TalentFish is committed to pay transparency and equal
opportunity. The salary range provided complies with applicable state and federal
regulations.
This role requires authorization to work in the U.S. without current or future visa sponsorship.
All offers are contingent upon the completion of a background check, which may include,
but is not limited to, reference checks, education verification, employment verification, drug
testing, criminal records checks, and any required certifications or compliance requirements
based on the end client's background check policies and applicable laws.
About TalentFish
TalentFish is an employee-owned company pioneering a new realm in talent acquisition. We
are redefining IT staffing by evolving AI, video screening, and our unique platform.
TalentFish focuses on providing the best employee, consultant, and client experience
possible.
At TalentFish, we are an Equal Opportunity Employer; we embrace and encourage diversity!
Required Skills:
Pay Compliance Operations Collaboration High Availability Offers System Administration Disaster Recovery Technical Documentation Authorization Transparency Data Integrity Bash Video Scalability Salary Checks Operational Efficiency Unix Compensation Screening Reliability Storage Optimization Insurance Infrastructure Availability Networking Technical Support Regulations Linux Computer Science Security Records Troubleshooting Education Scheduling Administration Research Documentation Testing Design Engineering Python Science Training Communication Management
Salary Package:
$ 55.00 - 65.00 (US Dollar)