Position Details
About this role
This role involves designing and building scalable data pipelines, managing data governance, and optimizing data processing performance in a cloud environment. The candidate will work with Databricks, PySpark, and various cloud tools.
Key Responsibilities
- Build data pipelines
- Ensure data quality and governance
- Optimize data processing performance
- Collaborate with platform teams
- Manage cloud-based data solutions
Technical Overview
The technical environment includes Databricks, Azure Data Factory, Synapse, Terraform, Kafka, Kinesis, and Power BI. The focus is on data pipeline development, data modeling, and performance tuning.
Ideal Candidate
The ideal candidate is a mid-level data engineer with 3+ years of experience in building data pipelines, data modeling, and governance. They should have hands-on experience with Databricks, PySpark, SQL, and cloud platforms like Azure or AWS.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years of relevant experience, Lack of experience with Databricks or PySpark, No experience with cloud platforms (Azure, AWS, GCP), No knowledge of data modeling or governance
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile