✦ Luna Orbit — Software Engineering

Sr. Software Engineer, Annapurna Labs, ML Acceleration

at Amazon.com

📍 US, TX, Austin Unknown Posted April 14, 2026
Type Full-Time
Experience senior
Exp. Years Not specified
Education Not specified
Category Software Engineering

Senior software engineer role within Annapurna Labs focused on ML acceleration platforms. You will build software that initializes machine learning accelerators, validates firmware, monitors server health, and implements large-scale data collection and recovery mechanisms.

  • Initialize machine learning accelerators and monitor server health
  • Evaluate and optimize firmware performance
  • Develop tests to validate firmware
  • Build data collection and aggregation systems at AWS scale
  • Build error detection and recovery mitigation systems at AWS scale

The technical scope includes firmware performance evaluation and test development, systems software for accelerator initialization, and AWS-scale monitoring with sensor data, logs, and device metrics. You will also implement error detection and recovery mitigation plus automation, continuous integration, and fleet metrics-driven deployments.

The ideal candidate is a senior software engineer who has built and operated systems for machine learning acceleration, including firmware performance evaluation and validation testing. They can develop scalable monitoring and data aggregation systems at AWS scale and implement error detection and recovery mitigation for high-availability fleets.

Develop software that initializes machine learning acceleratorsMonitor server health by collecting sensor datalogsand device metricsEvaluate and optimize firmware performanceDevelop tests to validate firmwareBuild data collection and aggregation systems at AWS scaleBuild error detection and recovery mitigation systems at AWS scale
Amazon Web Services (AWS)continuous integrationfleet metrics
machine learning acceleratorsfirmware performancetests to validate firmwaresystems softwaredata collectiondata aggregationsensor datalogsdevice metricsmonitor server healtherror detectionrecovery mitigationAWS scaleautomationcontinuous integrationfleet metrics
machine learning acceleratorsfirmware performance evaluationfirmware performance optimizationtests to validate firmwaresystems softwaredata collectiondata aggregationmonitor server healthcollecting sensor datacollecting logscollecting device metricserror detectionerror detection and recovery mitigation systemscontinuous integrationfleet metricsautomationscalable software systemsAWS scale
cross-functional collaborationarchitectural abstraction thinkingcode reviewsmentorshipknowledge sharingmonitoring and operational focuscommunicationbuilding in highly cross-functional environment
Industry SaaS
Job Function Develop scalable software platforms for ML accelerator initialization, monitoring, and reliability.
Role Subtype Platform Engineer
Tech Domains Amazon Web Services
Sr. Software EngineerSenior Software EngineerAnnapurna LabsML Accelerationmachine learning acceleratorsfirmware performanceoptimize firmware performancetests to validate firmwaresystems softwaredata collectiondata aggregationsensor datalogsdevice metricsmonitor server healtherror detectionrecovery mitigationAWS scaleautomationcontinuous integrationfleet metricscross-functional environment

Must be able to evaluate and optimize firmware performance, Must have experience building monitoring/data systems using sensor data, logs, and device metrics, Must be able to build error detection and recovery mitigation systems

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile