✦ Luna Orbit — AI & Machine Learning

Principal Engineer- AI Platform Solutions

at Advanced Micro Devices

📍 Santa Clara, California, United States Hybrid Posted March 14, 2026
Type Not Specified
Experience lead
Exp. Years Not specified
Education Not specified
Category AI & Machine Learning

This role involves partnering with AI software teams and customers to enable large-scale training and inference on AMD GPUs, designing scalable Kubernetes architectures, and optimizing AI workloads.

  • Partner with teams to enable LLM training
  • Design Kubernetes architectures
  • Validate inference frameworks
  • Optimize GPU workloads
  • Collaborate with customers

The technical environment includes AI infrastructure, Kubernetes, distributed training frameworks, GPU computing, and inference frameworks like vLLM and SGLang.

The ideal candidate is a lead AI platform engineer with extensive experience in large-scale AI infrastructure, Kubernetes, and GPU-based distributed training. They should be solution-oriented, collaborative, and capable of designing scalable AI deployment architectures.

AI infrastructureKubernetesLarge Language Modelsdistributed trainingGPU computing
K8sInference frameworksvLLMSGLangKubernetes-native
KubernetesSLURMKubeflowMPIVolcanoKueue
AI platformAI infrastructureLarge Language ModelsLLMKubernetesdistributed traininginference frameworksvLLMSGLangGPU computing
AI PlatformAI infrastructureLarge Language ModelsLLMKubernetesK8sDistributed trainingInference frameworksvLLMSGLangKubernetes-nativeGPU computingContainer orchestration
collaborativesolution-orientedproblem-solvingcommunicationteamwork
Industry AI / Data Centers / Hardware
Job Function AI platform solutions engineering for large-scale AI workloads
Role Subtype AI Infrastructure Engineer
Tech Domains Kubernetes, Linux, MPI, SLURM, Container orchestration
Clearance Required None
Visa Sponsorship No
AI infrastructureLarge Language ModelsLLMKubernetesdistributed trainingGPUinference frameworksvLLMSGLangK8sKubeflowMPIvolcanoKueueGPU computing

Lack of experience with Kubernetes or AI infrastructure, No experience with large language models, Unwillingness to work in a hybrid environment

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile