Position Details
About this role
This role involves developing evaluation strategies and validation processes for AI agents, focusing on large language models, data quality, and safety standards within a SaaS life sciences environment.
Key Responsibilities
- Define evaluation strategies
- Assess LLM output quality
- Design high-quality datasets
- Develop automated evaluation pipelines
- Perform root cause analysis
Technical Overview
The technical environment includes AI evaluation, data curation, automation pipelines, and model validation, primarily using Python and related tools.
Ideal Candidate
The ideal candidate is an experienced AI Data Engineer with a focus on large language models and evaluation methodologies. They possess strong analytical skills, experience with data curation, model validation, and automation pipelines, and can communicate technical findings effectively.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with AI evaluation methodologies, No background in data curation or model validation, Unable to work remotely within North America
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile