Position Details

Salary $62K – $141K USD / year

Type Not Specified

Experience mid

Exp. Years 2+ years

Education Bachelor's degree

Category Data & Analytics

About this role

This role involves developing and maintaining scalable data pipelines and platforms using Apache Iceberg and related technologies for government clients.

Key Responsibilities

Develop data pipelines
Maintain data lake infrastructure
Implement schema evolution and data versioning
Troubleshoot data environment issues
Collaborate with multidisciplinary teams

Technical Overview

The technical environment includes data lake architectures, distributed file systems like S3, HDFS, GCS, and query engines such as Presto, Trino, Spark, and Hive, with programming in Python, Java, and Scala.

Ideal Candidate

The ideal candidate is a mid-level data engineer with at least 2 years of experience in building and maintaining data pipelines, proficient with Apache Iceberg, distributed file systems, and query engines like Presto or Spark. They should have strong troubleshooting skills and a solid understanding of data lake and warehouse architectures.

Must-Have Skills

2+ years of experience developing and maintaining data pipelinesExperience working with Apache Iceberg or table formatsExperience working with distributed file systemsExperience with query engines such as PrestoTrinoSparkor HiveExperience in Python and programming languages such as Java or ScalaExperience with data lifecycle managementKnowledge of data lake and warehouse architecture principlesAbility to debug and troubleshoot data lake environmentsAbility to obtain and maintain a Public Trust or Suitability/Fitness determinationBachelor's degree

Nice-to-Have Skills

Apache Airflowdata governancemodern lakehouse paradigms

Tools & Platforms

Apache AirflowAmazon S3HDFSGoogle Cloud StoragePrestoTrinoSparkHive

Required Skills

Apache Icebergdata pipelinesdistributed file systemsS3HDFSGCSPrestoTrinoSparkHivePythonJavaScalaETLELTdata lakeschema evolutiontime-travel querieslakehouse architecturedebuggingtroubleshootingdata governancepublic trust

Hard Skills

Apache IcebergApache Iceberg or table formatsDelta LakeHudidata lake transactionsschema evolutiondata version controlpartition optimizationdistributed file systemsS3HDFSGCSquery enginesPrestoTrinoSparkHivePythonJavaScalaETLELTdata lifecycle managementtime-travel querieslakehouse architecture

Soft Skills

analytical explorationdata examinationtroubleshootingdebuggingdocumentationmodular solution design

Industry & Role

Industry Government/Public Sector

Job Function Build and maintain scalable data lake solutions for government projects

Clearance & Visa

Clearance Required Public Trust or Suitability/Fitness

Keywords for Your Resume

Deal Breakers

Lack of experience with Apache Iceberg or similar table formats, No experience with distributed file systems, No Bachelor's degree, Inability to obtain security clearance, Less than 2 years of relevant experience

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Apache Iceberg Data Engineer

Get matched to jobs like this