✦ Luna Orbit — AI & Machine Learning

Sr. Engineer - Performance AI/ML Deployment Engineering

at Advanced Micro Devices

📍 Santa Clara, California, United States Hybrid Posted April 16, 2026
Type Not Specified
Experience executive
Exp. Years Not specified
Education Not specified
Category AI & Machine Learning

This senior/principal role focuses on deploying and managing AI/ML fabrics for AMD data center GPU systems. You will act as a technical interface with customers and partners, drive at-scale debug and infrastructure optimization, and benchmark Machine Learning applications across compute, network, and storage environments.

  • Collaborate with strategic customers on scalable compute, networking, and storage designs
  • Perform system-level triage and at-scale debug across hardware/firmware/software
  • Drive the ramp of Instinct-based large scale AI datacenter infrastructure
  • Interface with ROCm, DC GPU HW/FW/ASIC teams, field engineering, OEM/ODM partners, CSPs
  • Benchmark and optimize Machine Learning application performance across infrastructure

The role emphasizes large network architecture, storage environments, AI/ML network deployments, and performance tuning for Instinct-based datacenter infrastructure. You will lead post-rollout management, system triage, and at-scale debug across hardware, firmware, and software, coordinating with ROCm and DC GPU engineering teams.

The ideal candidate is a senior/principal engineer focused on AI/ML deployment engineering for data center GPU infrastructure. They have strong experience with large network architecture, storage environments, and performance tuning, and can perform disciplined system triage and at-scale debug across hardware, firmware, and software while interfacing with customers and internal engineering teams.

technical interface between customers and internal engineering groupsat-scale debug of complex issues across hardwarefirmwareand softwareoptimize computenetworkand storage and benchmark Machine Learning applicationslead/drive ramp of Instinct-based large scale AI datacenter infrastructurediscipline approach to system triage and infrastructure optimization
ROCmInstinct(TM)
AI/ML fabric deploymentsystem triageat-scale debuginfrastructure optimizationcompute networking storage environmentperformance tuningbenchmarking Machine Learning applicationslarge network architecturestorageAI/ML network deploymentsROCm interfaceInstinct(TM) deploymentdatacenter deploymenthardware firmware software troubleshooting
DC GPU AI/ML Advanced Forward Deployment and Systems EngineeringAI/ML fabric deploymentsystem triageat-scale debuginfrastructure optimizationcompute architecture optimizationnetwork architecturestorage environmentbenchmarking Machine Learning applicationslarge network architecturestorageAI/ML network deploymentsperformance tuningpost-rollout managementproduction qualification rampdatacenter deploymentcompute networking storage environmentMachine Learning applicationsROCm software developers interface
leadershipself-motivatedteam environment collaborationtechnical interface with customerscross-functional collaborationrapid resolution focus
Industry Manufacturing
Job Function Lead advanced forward deployment and systems optimization for AI/ML data center GPU infrastructure
Role Subtype Site Reliability Engineer
Sr. EngineerPerformance AI/ML Deployment EngineeringSenior/Principal EngineerDC GPU AI/ML Advanced Forward Deployment and Systems EngineeringAI/ML fabricsadvanced forward deploymentpost-rollout managementsystem triageat-scale debuginfrastructure optimizationnetwork architectureStorageAI/ML network deploymentsperformance tuningbenchmarkingMachine Learning applicationsInstinctInstinct(TM)ROCmcustomers and partnersfield application engineersOEM/ODM partnersCSPshardwarefirmwaresoftware

Must have experience performing system triage and at-scale debug across hardware, firmware, and software, Must have experience optimizing compute, network, and storage and benchmarking Machine Learning applications

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile