Position Details
About this role
AMD is seeking a Senior ML Systems Engineer to develop and optimize ML operator kernels and dataflow pipelines for the NPU platform, with full-stack responsibilities from kernel implementation to hardware integration.
Key Responsibilities
- Drive technical innovation in NPU kernel and dataflow development; debug silicon bring-up and production issues; coordinate with compiler, runtime, and hardware teams; model performance; mentor engineers
Technical Overview
Role spans kernel development for ML workloads across ROCm stack, ONNX Runtime integration, and performance optimization for AMD GPUs/NPUs, including quantization and low-level hardware interactions.
Ideal Candidate
The ideal candidate is a senior ML/AI systems engineer with a strong background in GPU/accelerator kernel development, ROCm/ROCm stack expertise, and hands-on experience with ONNX Runtime. They should excel at cross-functional debugging across frameworks, runtimes, and hardware, and be capable of guiding large-scale AI workloads.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Must have a Masters or PhD in Electrical/Computer Engineering or related field
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile