Position Details
About this role
This role involves designing, developing, and maintaining scalable data pipelines supporting AI research and production, with a focus on data governance, security, and automation.
Key Responsibilities
- Design data pipelines
- Collaborate with AI scientists
- Apply data governance
- Monitor data pipelines
- Improve reliability
Technical Overview
The technical environment includes Python, SQL, NoSQL databases, cloud platforms (AWS, Azure, GCP), Kubernetes, CI/CD pipelines, data warehousing, and vector databases.
Ideal Candidate
The ideal candidate is a mid-level data engineer with 3+ years of experience in building scalable data pipelines, proficient in Python and SQL, with knowledge of cloud platforms like AWS, Azure, or GCP. They should have experience with data governance, automation, and working with NoSQL and vector databases.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with cloud platforms, No experience in data pipelines or ETL, No proficiency in Python or SQL, No data governance experience
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile