Position Details
About this role
Infrastructure-focused software engineer role at xAI to redefine AI infra with large-scale Kubernetes deployments and distributed systems. You’ll design, build, and maintain compute and data platforms for scalable AI.
Key Responsibilities
- Scale compute infrastructure on Kubernetes by building controllers, admission plugins, and supporting systems that empower teams to leverage Kubernetes effectively
- Design and maintain one of the largest traffic shaping and load balancing deployments using
Technical Overview
Role emphasizes Kubernetes-based infrastructure, controllers and admission plugins, GPU-scale Colossus cluster, and Azure DevOps CI/CD with distributed systems thinking.
Ideal Candidate
The ideal candidate is a highly capable infrastructure engineer with 3-5 years of cloud experience, strong Kubernetes expertise, and hands-on Azure DevOps CI/CD experience. They should thrive in scaling AI infrastructure and distributed systems.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Certifications
Preferred
Industry & Role
Keywords for Your Resume
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile