Position Details
About this role
This role involves developing and optimizing GPU-based machine learning inference solutions, focusing on performance analysis, workload profiling, and hardware-software collaboration to meet next-generation AI performance goals.
Key Responsibilities
- Work on GPU ML workloads
- Profile inference pipelines
- Optimize transformer models
- Collaborate with compiler and hardware teams
- Develop performance strategies
Technical Overview
The position requires expertise in GPU performance profiling, ML workload development, transformer model optimization, and collaboration with hardware and compiler teams, utilizing tools like CUDA and profiling software.
Ideal Candidate
The ideal candidate is a senior AI/ML engineer with extensive experience in GPU performance analysis, workload optimization, and hardware-software collaboration, particularly in AI inference solutions.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience in GPU performance profiling, No background in machine learning workloads, No experience with hardware architecture
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile