Position Details
About this role
Cursor is seeking a Software Engineer to join the Model Routing & Inference team. You will build the inference platform that powers every AI interaction in the product, including the inference gateway and cross-provider routing and failover.
Key Responsibilities
- Build and maintain the inference platform
- Design and evolve the inference gateway
- Implement cross-provider failover
- Design routing backpressure and admission control
- Optimize GPU utilization and cost-performance at scale
Technical Overview
The role focuses on building high-throughput, low-latency distributed systems for inference serving, with emphasis on gateway abstractions over provider APIs, routing logic, backpressure, and capacity planning to optimize GPU usage and cost/performance.
Ideal Candidate
The ideal candidate is a mid-to-senior software engineer with experience building high-throughput, low-latency distributed systems for inference serving. They should excel at making trade-offs between reliability, cost, latency, and user experience in production-scale AI workflows.
Must-Have Skills
Nice-to-Have Skills
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Bachelor's degree or equivalent in IT or related field, Lack of experience with high-throughput distributed systems
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile