Position Details
About this role
This role involves developing and troubleshooting cloud-native, GPU-accelerated systems for high-performance AI and simulation workloads, focusing on Kubernetes, Slurm, and microservices.
Key Responsibilities
- Debug multi-tenant clusters
- Prototype Kubernetes/Slurm features
- Resolve scheduling issues
- Integrate GPU capabilities
- Collaborate on system improvements
Technical Overview
The technical environment includes Kubernetes, Slurm, GPU computing with CUDA, container orchestration, and cloud-native microservices architecture.
Ideal Candidate
The ideal candidate is a mid-level engineer with over 5 years of experience in cloud-native environments, GPU computing, and container orchestration. They excel in troubleshooting complex systems involving Kubernetes, Slurm, and GPU accelerators, with strong communication skills for customer engagement.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 5 years of relevant experience, Lack of experience with Kubernetes or Slurm, No experience with GPU computing or CUDA
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile