Position Details
About this role
This role involves advancing the Triton compiler and runtime for AMD GPUs, focusing on distributed execution, communication, and performance optimization for AI workloads.
Key Responsibilities
- Develop AMD GPU backend for Triton; Build distributed communication capabilities; Optimize kernel performance; Collaborate with hardware teams; Enhance compiler infrastructure
Technical Overview
The technical environment includes GPU architecture, compiler backends like MLIR and LLVM, AMD Instinct accelerators, and performance tuning tools, working within AMD's Triton ecosystem.
Ideal Candidate
The ideal candidate is a senior AI/ML software engineer with deep expertise in GPU architecture, compiler technologies, and distributed systems. They should have experience optimizing GPU kernels, working with AMD Instinct accelerators, and developing scalable AI training and inference solutions.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with GPU compiler or runtime, No background in GPU architecture or performance engineering, Unable to work in a hybrid environment, No experience with AMD GPUs
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile