Position Details
About this role
Senior software engineer role within Annapurna Labs focused on ML acceleration platforms. You will build software that initializes machine learning accelerators, validates firmware, monitors server health, and implements large-scale data collection and recovery mechanisms.
Key Responsibilities
- Initialize machine learning accelerators and monitor server health
- Evaluate and optimize firmware performance
- Develop tests to validate firmware
- Build data collection and aggregation systems at AWS scale
- Build error detection and recovery mitigation systems at AWS scale
Technical Overview
The technical scope includes firmware performance evaluation and test development, systems software for accelerator initialization, and AWS-scale monitoring with sensor data, logs, and device metrics. You will also implement error detection and recovery mitigation plus automation, continuous integration, and fleet metrics-driven deployments.
Ideal Candidate
The ideal candidate is a senior software engineer who has built and operated systems for machine learning acceleration, including firmware performance evaluation and validation testing. They can develop scalable monitoring and data aggregation systems at AWS scale and implement error detection and recovery mitigation for high-availability fleets.
Must-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Must be able to evaluate and optimize firmware performance, Must have experience building monitoring/data systems using sensor data, logs, and device metrics, Must be able to build error detection and recovery mitigation systems
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile