Position Details

Salary $150K – $190K USD / year

Type Full-Time

Experience senior

Exp. Years Not specified

Education Not specified

Category Data & Analytics

About this role

This role is a senior data engineering position focused on building and scaling high-throughput streaming pipelines and real estate datasets. You will lead improvements in data quality and observability and contribute to AI-driven tooling that helps triage and resolve data quality issues.

Key Responsibilities

Build and scale high-throughput streaming pipelines
Model and deliver production-grade real estate datasets
Strengthen data quality and observability with monitoring and alerting
Leverage AI tooling to triage and debug data quality issues
Drive platform architecture for AI-powered product delivery

Technical Overview

The stack explicitly includes Airflow, Spark Streaming, Kafka, and Iceberg for ingesting and operating streaming pipelines. The role also covers dataset modeling and transformation logic, plus data quality checks, monitoring, and alerting to maintain freshness, accuracy, and cost balance.

Ideal Candidate

The ideal candidate is a senior data engineer who has already built with AI daily and uses Claude Code as a core workflow. They have strong experience designing and operating high-throughput streaming pipelines using Airflow, Spark Streaming, Kafka, and Iceberg, and they improve data quality with checks, monitoring, and alerting.

Must-Have Skills

Build and scale high-throughput streaming pipelinesDesignimplementand operate pipelinesAirflowSpark StreamingKafkaIcebergImplement and improve data quality checksmonitoringalertingClaude Code

Tools & Platforms

AirflowSpark StreamingKafkaIcebergClaude CodeAnthropic tokens

Required Skills

streaming pipelinesAirflowSpark StreamingKafkaIcebergdata correctnessreliabilityperformancedata modelingtransformation logicfreshnessaccuracycostdata quality checksmonitoringalertingtriagedebugAI-driven toolingClaude CodeAI agentsAnthropic tokens

Hard Skills

streaming pipelineshigh-throughput streaming pipelinesDesignimplementand operate pipelinesAirflowSpark StreamingKafkaIcebergdata correctnessreliabilityperformancedata modelingtransformation logicfreshnessaccuracycost balancingdata quality checksmonitoringalertingtriagedebugdata operationsAI-driven toolingClaude CodeAI agentsproduction code

Soft Skills

technical leadershipstrong opinionsclear and direct communicationexplain complex tradeoffssystems thinkingconnect technical decisions to customer outcomes and long-term business valueenergized by ambiguity and speedownershipcross-functional communication with productdesignand executive stakeholdersfast-growing company collaboration

Industry & Role

Industry SaaS

Job Function Design, operate, and improve large-scale streaming data infrastructure and datasets

Role Subtype Data Engineer

Tech Domains Python, SQL / PostgreSQL

Keywords for Your Resume

Sr. Data EngineerData Engineerhigh-throughput streaming pipelinesstreaming pipelinesAirflowSpark StreamingKafkaIcebergdata correctnessreliabilityperformancedata modelingtransformation logicfreshnessaccuracycostdata quality checksmonitoringalertingtriagedebugdata quality issuesAI-driven toolingClaude CodeAI agentsAnthropic tokens

Deal Breakers

Must have hands-on experience with Airflow, Spark Streaming, Kafka, and Iceberg, Must demonstrate experience implementing data quality checks, monitoring, and alerting

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Sr. Data Engineer - US (Remote)

Get matched to jobs like this