Position Details

Salary $121K – $148K USD / year

Type Not Specified

Experience mid

Exp. Years 3+ years

Education High School Diploma or GED

Category Data & Analytics

About this role

This role involves designing and building scalable data pipelines, managing data governance, and optimizing data processing performance in a cloud environment. The candidate will work with Databricks, PySpark, and various cloud tools.

Key Responsibilities

Build data pipelines
Ensure data quality and governance
Optimize data processing performance
Collaborate with platform teams
Manage cloud-based data solutions

Technical Overview

The technical environment includes Databricks, Azure Data Factory, Synapse, Terraform, Kafka, Kinesis, and Power BI. The focus is on data pipeline development, data modeling, and performance tuning.

Ideal Candidate

The ideal candidate is a mid-level data engineer with 3+ years of experience in building data pipelines, data modeling, and governance. They should have hands-on experience with Databricks, PySpark, SQL, and cloud platforms like Azure or AWS.

Must-Have Skills

3+ years in data engineering or analyticsexperience with cloud platforms (AzureAWSGCP)building pipelines in Databricksdata modeling (dimensional3NF)SQL and Python (PySpark)data governance and securityETL/ELT pipeline developmentperformance tuning in Spark/Databricks

Nice-to-Have Skills

Data Vault 2.0Delta LakeML/AI experienceCI/CD pipelinesIaC with Terraform

Tools & Platforms

DatabricksAzure Data FactorySynapseTerraformGitPower BITableauKafkaKinesisEvent Hubs

Required Skills

DatabricksPySparkSQLData Vault 2.0Delta LakeETLELTData governanceCloud platformsAzureAWSGCPPower BITableauKafkaKinesisEvent HubsTerraform

Hard Skills

DatabricksPySparkSQLData Vault 2.0Delta LakeAzure Data FactorySynapseTerraformGitPower BITableauMLData governanceData modelingETLELTSparkPerformance tuningCost optimizationKafkaKinesisEvent Hubs

Soft Skills

collaborationproblem-solvingindependent workanalytical thinkingcommunication

Industry & Role

Industry Energy

Job Function Data pipeline development and data governance

Role Subtype Data Engineer

Tech Domains Databricks, Azure Data Factory, Synapse, Terraform, Power BI, Tableau, Kafka, Kinesis, Event Hubs

Keywords for Your Resume

data engineeringdatabrickspysparksqldelta lakedata vault 2.0etleltdata governancecloud platformsazureawsgcppower bitableaukafkakinesisevent hubsterraform

Deal Breakers

Less than 3 years of relevant experience, Lack of experience with Databricks or PySpark, No experience with cloud platforms (Azure, AWS, GCP), No knowledge of data modeling or governance

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Advisor III, Data Engineering

Get matched to jobs like this