Position Details

Type Full-Time

Experience mid

Exp. Years 2 years

Education Not specified

Category Data & Analytics

About this role

Software Engineer role focused on building and optimizing data pipelines and analytics solutions in a cloud environment leveraging Java and PySpark.

Key Responsibilities

Develop and maintain large-scale data processing pipelines (and infrastructure)
Lead data modeling and schema updates
Lead code reviews and mentorship
Drive data quality and data accessibility
Ensure governance and business alignment

Technical Overview

Work with AWS, Snowflake, Redshift, and Spark-based data processing; implement data modeling techniques and CI/CD for data products; exposure to batch and streaming data processing.

Ideal Candidate

The ideal candidate is a data engineer with 2+ years of experience in Java/PySpark, cloud data warehouses, and data modeling, comfortable building scalable data pipelines in a cloud environment.

Must-Have Skills

2 years of experienceProficiency in Java and PySparkExperience with AWS deploymentsExperience with Cloud Data Warehouse Snowflake and AWS RedshiftStrong SQL skillsKnowledge of data modeling techniquesExperience with CI/CD pipelinesUnderstanding of agile methodologies

Nice-to-Have Skills

Experience with Docker and KubernetesExperience with data governanceExperience with streaming data (batch/micro-batching/stream)Experience with data architecture and ETL

Tools & Platforms

Amazon Web ServicesSnowflakeAmazon RedshiftParquetIcebergSQL / PostgreSQLApache Spark

Required Skills

2 years of experienceJavaPySparkAWSSnowflakeAmazon RedshiftSQL / PostgreSQLParquetIcebergDimensional modelingData VaultKimballInmonCI/CDApache Sparkbatch processingstreamingdata governancedata pipelines

Hard Skills

JavaPySparkSQL / PostgreSQLAmazon Web ServicesSnowflakeAmazon RedshiftParquetIcebergDimensional modelingData VaultKimballInmonCI/CDApache SparkBatch processingStreamingData governanceData pipelines

Soft Skills

problem-solvingattention to detailteamworkcommunicationplanning

Industry & Role

Industry SaaS

Job Function Develop and maintain large-scale data processing pipelines and data models on AWS

Role Subtype Software Engineer

Tech Domains Amazon Web Services, Snowflake, Amazon Redshift, Parquet, Iceberg, SQL / PostgreSQL, Apache Spark

Keywords for Your Resume

Software EngineerJavaPySparkApache SparkAWSAmazon Web ServicesSnowflakeAmazon RedshiftSQL / PostgreSQLParquetIcebergDimensional modelingData VaultKimballInmonCI/CDbatch processingstreamingdata governancedata pipelinesagile development

Deal Breakers

Less than 2 years of relevant experience, No exposure to AWS or cloud data warehouses

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Software Engineer / Solutions Architect (Python & GenAI)

Get matched to jobs like this