Position Details

Type Full-Time

Experience mid

Exp. Years 3+ years

Education Not specified

Category Data & Analytics

About this role

This role involves designing and developing scalable data pipelines for financial data, supporting real-time analytics, and ensuring data governance using Databricks, PySpark, and Airflow.

Key Responsibilities

Design and develop data pipelines
Implement ETL processes
Support real-time analytics
Collaborate with data scientists and analysts
Ensure data quality and compliance

Technical Overview

The environment includes Databricks for big data processing, PySpark for data engineering, Airflow for workflow orchestration, and Kafka for streaming data.

Ideal Candidate

The ideal candidate is a mid-level data engineer with at least 3 years of experience in building and managing large-scale data pipelines using PySpark, Databricks, and Airflow, preferably in a financial services environment. Strong collaboration and problem-solving skills are essential.

Must-Have Skills

PySparkDatabricksAirflowPythonSQLbig data processingfinancial data

Nice-to-Have Skills

KafkaETL pipelinesreal-time analyticsdata governanceDevOpsCI/CD

Tools & Platforms

DatabricksApache KafkaAirflowPySparkSQL

Required Skills

PySparkPythonSQLAirflowDatabricksKafkaETLbig datadata pipelinesreal-time analytics

Hard Skills

PySparkPythonSQLAirflowDatabricksApache KafkaETLBig DataData PipelinesData DesignData DevelopmentData TestingData DebuggingData DocumentationData DeploymentData SupportFinancial DataReal-time AnalyticsData GovernanceFinancial Regulations

Soft Skills

collaborationproblem-solvingcommunicationteamworkanalytical thinking

Industry & Role

Industry Financial Services

Job Function Build and manage large-scale data pipelines for financial analytics

Role Subtype Data Engineer

Tech Domains Python, SQL, Apache Kafka, Databricks, Airflow

Keywords for Your Resume

PySparkPythonSQLAirflowDatabricksApache KafkaETLbig datadata pipelinesreal-time analyticsfinancial datadata governanceDevOpsCI/CDfinancial regulationsETL pipelines

Deal Breakers

No experience with PySpark or Databricks, Lack of SQL or Python skills, No experience with data pipelines or ETL, Location outside New York

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Cloud Data Engineer

Get matched to jobs like this