Position Details

Salary $185K – $270K USD / year

Type Full-Time

Experience senior

Exp. Years 8+ years

Education Not specified

Category AI & Machine Learning

About this role

This role involves developing and scaling AI inference platforms for autonomous vehicle applications, focusing on high-performance distributed systems and GPU utilization.

Key Responsibilities

Design core ML platform components
Optimize model serving
Lead large-scale AI infrastructure projects
Collaborate with ML engineers
Research state-of-the-art model deployment techniques

Technical Overview

Requires expertise in machine learning systems, distributed computing, GPU hardware, cloud infrastructure, and container orchestration tools like Kubernetes and Docker.

Ideal Candidate

The ideal candidate is a senior AI/ML engineer with over 8 years of experience in building scalable distributed machine learning systems, proficient in GPU hardware, cloud infrastructure, and container orchestration tools like Kubernetes and Docker.

Must-Have Skills

8+ years of industry experienceExpertise in machine learning systems or backend servicesExperience with distributed systemsKnowledge of GPU hardware

Nice-to-Have Skills

Open source contributionsAI infrastructureModel optimizationScalabilityReal-time inference

Tools & Platforms

KubernetesDockerGPU hardware (H100A100B200)

Required Skills

Machine LearningDistributed SystemsModel ServingBackend SoftwareGPU UtilizationCloud ComputingKubernetesDocker

Hard Skills

Machine LearningDistributed SystemsModel ServingBackend SoftwareGPU UtilizationCloud ComputingHigh Performance ComputingPythonC++KubernetesDocker

Soft Skills

Problem-solvingTechnical LeadershipCollaborationInnovationCommunication

Industry & Role

Industry Automotive

Job Function Building and scaling AI inference platforms for autonomous vehicles

Role Subtype AI & Machine Learning

Tech Domains Machine Learning, Distributed Systems, Cloud Computing, High Performance Computing

Clearance & Visa

Clearance Required None

Visa Sponsorship No

Keywords for Your Resume

ML EngineerMachine LearningDistributed SystemsModel ServingBackend SoftwareGPU UtilizationCloud ComputingKubernetesDockerAI InfrastructureModel OptimizationReal-time inferenceHigh Performance ComputingOpen sourceSunnyvaleAImachine learningdistributed systemsmodel servingGPU utilizationcloud computingAI infrastructurereal-time inferencehigh performance computing

Deal Breakers

Less than 8 years of experience, Lack of expertise in distributed systems or GPU hardware, No experience with Kubernetes or Docker, Inability to work in hybrid environment

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Staff ML Engineer, Inference Platform

Get matched to jobs like this