Position Details
About this role
The role involves designing, building, and maintaining high-performance AI infrastructure platforms, focusing on GPU-enabled servers and HPC clusters.
Key Responsibilities
- Build and upgrade HPC systems
- Automate configuration management
- Optimize system performance
- Collaborate with stakeholders
- Monitor system health
Technical Overview
Environment includes NVIDIA DGX, Cisco UCS, Linux, GPU acceleration, HPC benchmarking, and DevOps automation tools.
Ideal Candidate
The ideal candidate is an experienced infrastructure engineer with 7+ years in high-performance computing environments, proficient in Linux, Python, and GPU infrastructure, with strong troubleshooting skills.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 7 years of HPC experience, Lack of Linux or Python expertise, No experience with NVIDIA DGX or Cisco UCS
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile