✦ Luna Orbit — AI & Machine Learning

AI Hardware Systems Engineer, Annapurna Labs, Trainium Machine Learning Fleet Operations

at Amazon.com

📍 US, TX, Austin Unknown Posted March 19, 2026
Type Full-Time
Experience mid
Exp. Years 2+ years
Education Not specified
Category AI & Machine Learning

This role involves debugging GPU and server hardware, developing automation software, and analyzing hardware trends within a large-scale ML hardware fleet. The engineer will work on system remediation and operational excellence for ML servers.

  • Debug hardware issues
  • Develop automation software
  • Analyze hardware trends
  • Manage system remediation
  • Optimize ML server fleet

The technical environment includes scripting in Python and Bash, hardware debugging of GPU and server systems, data infrastructure development, and automation for fleet operations.

The ideal candidate is a mid-level AI or machine learning engineer with at least 2 years of experience in software development, scripting in Python or Bash, and hardware debugging of GPU and server systems. They are skilled in developing automation tools and analyzing hardware performance trends.

2+ years of professional software development experienceExperience with scripting in Python or BashExperience debugging GPU and server hardwareDeveloping automation softwareAnalyzing hardware trends
Experience with large-scale experimentsData infrastructure developmentHardware/software co-design
PythonBash
PythonBashGPU hardwareserver hardwaredata infrastructureautomation softwarehardware debugginglarge scale experiments
PythonBashGPU hardwareServer hardwareData infrastructureAutomation software
debuggingproblem-solvingcollaborationanalytical thinkingcommunication
Industry Technology
Job Function Maintain and optimize hardware/software systems for machine learning infrastructure
Role Subtype AI & Machine Learning Engineer
Tech Domains Python, Bash
AI engineerML engineerGPU hardwareserver hardwarePython scriptingBash scriptinghardware debuggingautomation softwaredata infrastructurelarge scale experimentshardware trendsML systemsdeep learningAI systemssoftware developmentPythonBash

No experience with GPU or server hardware debugging, Less than 2 years of professional software development experience, Lack of scripting experience in Python or Bash

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile