Position Details
About this role
This role involves building and operationalizing AI/ML systems in mission-critical environments, focusing on deployment, monitoring, and reliability of models.
Key Responsibilities
- Own ML lifecycle end-to-end
- Deploy models into mission environments
- Implement monitoring for models and systems
- Build cloud-native ML infrastructure
- Establish data discipline
Technical Overview
The technical environment includes Python, ML frameworks (PyTorch, TensorFlow), containerization with Docker, orchestration with Kubernetes, and monitoring tools like Prometheus and Grafana, within cloud-native and distributed systems.
Ideal Candidate
The ideal candidate is a mid-level AI/ML engineer with experience deploying machine learning systems into production, especially within classified or mission-critical environments. They possess strong skills in Python, ML frameworks, and cloud-native infrastructure, with a focus on reliability and system stability.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Clearance & Visa
Keywords for Your Resume
Deal Breakers
Lack of experience with ML deployment in production environments, No experience with Kubernetes or Docker, No security clearance or experience working in classified environments, Lack of experience with monitoring tools (Prometheus, Grafana, OpenTelemetry)
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile