✦ Luna Orbit — Cloud & Infrastructure

Software Engineer, ML Infrastructure

at Cursor

📍 SF / NY Unknown Posted March 08, 2026
Type Full-Time
Experience mid
Exp. Years Not specified
Education Not specified
Category Cloud & Infrastructure

This role involves building and maintaining large-scale compute and storage infrastructure to support Cursor’s AI and coding models, working closely with ML researchers and engineers to optimize training systems and hardware utilization.

  • Improve training throughput
  • Build GPU infrastructure
  • Collaborate with ML teams
  • Automate GPU cluster management
  • Enhance system reliability

The role focuses on developing high-performance infrastructure, including GPU clusters, distributed storage, and networking, utilizing tools like Kubernetes, Slurm, and infrastructure-as-code practices across Linux environments.

The ideal candidate is a systems engineer with experience in building large-scale compute and storage infrastructure, proficient in Python, Typescript, Rust, and Golang. They have hands-on experience with distributed storage, networking, and GPU infrastructure, and can operate in Linux and cloud environments, preferably with Kubernetes and Slurm expertise.

Strong background in systems and infrastructure-focused software engineeringExperience with PythonTypescriptRustGolangExperience with distributed storage and networking infrastructureExposure to large-scale systemsProduction use of infrastructure-as-codeOperational exposure to Nvidia GPUs
Exposure to Nvidia Blackwell and Hopper hardwareExperience with RaySlurmKubernetes experience
KubernetesRaySlurmLinuxNvidia GPUsBlackwellHopper
Large-scale computeStorage infrastructureSoftware infrastructureGPU infrastructureDistributed storageNetworking infrastructureLinuxKubernetesK8sGPU clustersNvidia GPUsInfinibandRoCERaySlurmInfrastructure-as-codeConfiguration management
Large-scale computeStorage infrastructureSoftware infrastructureGPU infrastructureDistributed storageNetworking infrastructureLinuxKubernetesK8sGPU clustersNvidia GPUsInfinibandRoCERaySlurmInfrastructure-as-codeConfiguration management
Problem-solvingCollaborationSystems thinkingOperational awarenessAdaptability
Industry SaaS
Job Function Build and optimize large-scale compute and storage infrastructure for AI training
Large-scale computeStorage infrastructureSoftware infrastructureGPU infrastructureDistributed storageNetworking infrastructureLinuxKubernetesK8sGPU clustersNvidia GPUsInfinibandRoCERaySlurmInfrastructure-as-codeConfiguration management

Lack of experience with large-scale systems, No experience with Nvidia GPUs or infrastructure-as-code, Unfamiliarity with Linux or Kubernetes

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile