Position Details
About this role
This role involves managing Datadog’s experiment tracking platform for AI and ML models, working with research and engineering teams to define product vision, and driving adoption of scalable ML training solutions.
Key Responsibilities
- Define product vision for Model Lab
- Lead discovery with AI teams
- Design experiment tracking system
- Partner with engineering
- Drive customer adoption
Technical Overview
Focus on ML infrastructure, experiment tracking, distributed training, hyperparameter tuning, and reproducibility workflows, utilizing frameworks like PyTorch, TensorFlow, and JAX.
Ideal Candidate
The ideal candidate is a product manager with at least 4 years of experience in ML infrastructure, experiment tracking, and distributed training, with familiarity in frameworks like PyTorch, TensorFlow, or JAX. They are entrepreneurial, thrive in ambiguity, and can translate complex ML workflows into product features.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience in ML infrastructure or experiment tracking, No familiarity with ML frameworks, Inability to operate in ambiguous environments, No experience with distributed training, Lack of technical translation skills
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile