Position Details

Type Not Specified

Experience senior

Exp. Years Not specified

Education Not specified

Category AI & Machine Learning

About this role

This role focuses on building and optimizing distributed systems for AI inference, including request routing, autoscaling, and multi-region deployment across cloud platforms.

Key Responsibilities

Design request routing algorithms
Autoscale compute fleet
Build deployment pipelines
Integrate new AI accelerators
Optimize inference performance

Technical Overview

Involves distributed systems engineering, Kubernetes, cloud infrastructure, ML inference optimization, and deployment pipelines.

Ideal Candidate

The ideal candidate is a results-oriented software engineer with extensive experience in distributed systems, Kubernetes, and large-scale ML inference deployment. They are flexible, impact-driven, and comfortable working across cloud platforms.

Must-Have Skills

Distributed systemsKubernetesPythonLoad balancingML inference

Nice-to-Have Skills

RustRequest routingBatchingCaching strategiesMulti-region deployments

Tools & Platforms

KubernetesAWSGCPAzurePythonRust

Required Skills

Distributed systemsLoad balancingRequest routingKubernetesCloud infrastructurePythonRustML inferenceBatchingCaching

Hard Skills

Distributed systemsLoad balancingRequest routingKubernetesCloud infrastructurePythonRustLarge-scale distributed systemsML inferenceBatchingCaching

Soft Skills

Results-orientedFlexibilityImpact-drivenTeamworkProblem-solving

Industry & Role

Industry Technology/AI

Job Function Distributed systems and inference deployment engineering

Keywords for Your Resume

Distributed systemsLoad balancingRequest routingKubernetesCloud infrastructurePythonRustML inferenceBatchingCachingAutoscalingMulti-region deploymentsInference optimizationDeployment pipelinesAI hardware

Deal Breakers

Lack of experience with distributed systems or Kubernetes, No background in ML inference or deployment pipelines, Inability to work in fast-paced, impact-driven environments

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Staff / Senior Software Engineer, Inference

Get matched to jobs like this