Position Details

Type Not Specified

Experience entry

Exp. Years 0+ years

Education Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, or a related field

Category AI & Machine Learning

About this role

This role involves designing and validating large-scale AI training and inference architectures on AMD GPUs, focusing on Kubernetes-based solutions for enterprise AI deployments.

Key Responsibilities

Design reference architectures
Validate Kubernetes training stacks
Implement GPU placement strategies
Collaborate with customers
Benchmark inference frameworks

Technical Overview

Focuses on GPU-accelerated computing, Kubernetes orchestration, distributed training, inference frameworks like vLLM and SGLang, and optimizing large language model workloads.

Ideal Candidate

The ideal candidate is a solution-oriented AI infrastructure engineer with hands-on experience in GPU-accelerated computing, Kubernetes, and large-scale AI deployments. They should be capable of designing production-ready systems and optimizing AI workloads.

Must-Have Skills

GPU-accelerated computingKubernetesdistributed traininginference frameworksproblem-solving

Nice-to-Have Skills

SLURMGPU placementK8sMPIperformance optimization

Tools & Platforms

KubernetesSLURMKubeflowMPIGPU Operator

Required Skills

GPU-accelerated computingKubernetesdistributed traininginference frameworksSLURMGPU placementMPIperformance benchmarkinglarge language modelsinferencetraining

Hard Skills

Embedded x86FPGAGPU-accelerated computingKubernetesdistributed traininginference frameworksSLURMGPU placementK8sMPI

Soft Skills

communicationproblem-solvingself-drivenorganizationcollaboration

Industry & Role

Industry Technology

Job Function AI infrastructure solutions engineering

Role Subtype AI Infrastructure Engineer

Tech Domains Kubernetes, Linux, MPI, SLURM, GPU

Keywords for Your Resume

GPU-accelerated computingKubernetesdistributed traininginference frameworksSLURMGPU placementMPIK8sperformance benchmarkingAI infrastructurelarge language modelsLLMinferencetrainingKubernetes nativeAI workloads

Deal Breakers

Lack of experience with GPU-accelerated computing, No knowledge of Kubernetes or distributed training, Inability to develop production AI solutions, No experience with inference frameworks

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Technical Sales Rotational Engineer - Sales Track

Get matched to jobs like this