Position Details

Type Full-Time

Experience mid

Exp. Years 5+ years

Education Bachelor's degree in computer science, engineering, analytics, mathematics, statistics, IT or equivalent

Category Data & Analytics

About this role

Data Engineer III on the Infrastructure Reliability team designs, builds, and scales data pipelines and warehousing to feed ML models, detection systems, and real-time dashboards across Amazon's fulfillment network.

Key Responsibilities

Design, build, and maintain scalable ETL/ELT pipelines that ingest from thousands of sites to ML models and dashboards
Develop and own data models for AI-powered incident detection and remediation orchestration
Partner with data scientists, software engineers, and product managers
Build and operate large-scale data warehouse on AWS
Establish data quality frameworks, monitoring, and governance

Technical Overview

Stack includes AWS data tools (Redshift, S3, Glue, EMR), Spark, Hadoop, Hive; focuses on ETL/ELT pipelines, data modeling, data governance, and cross-team collaboration.

Ideal Candidate

The ideal candidate is a senior data engineer with 5+ years of experience building scalable ETL/ELT pipelines, data warehouses, and ML-ready data, proficient in Python/Java/Scala/NodeJS and AWS data tools.

Must-Have Skills

5+ years of data engineering experienceExperience with data modelingwarehousing and building ETL pipelinesExperience with SQLExperience in at least one modern scripting or programming languagesuch as PythonJavaScalaor NodeJSExperience mentoring team members on best practicesExperience building data products incrementally and integrating and managing data sets from multiple sourcesBachelor's degree in computer scienceengineeringanalyticsmathematicsstatisticsIT or equivalent

Nice-to-Have Skills

Experience with big data technologies such as HadoopHiveSparkEMRExperience operating large data warehousesMaster's degree in computer scienceengineeringanalyticsmathematicsstatisticsIT or equivalent

Tools & Platforms

Amazon Web ServicesRedshiftS3GlueEMRSpark

Required Skills

5+ years of data engineering experiencedata modelingwarehousing and ETL pipelinesSQLPythonJavaScalaNodeJSdata productsdata integrationAWS stack (RedshiftS3GlueEMR)SparkHadoopHive

Hard Skills

SQLPythonJavaScalaNode.jsETL/ELTData modelingData warehousingRedshiftS3GlueEMRSparkHadoopHive

Soft Skills

collaborationmentoringcommunicationproblem-solving

Industry & Role

Industry Cloud & Infrastructure

Job Function Develop scalable data infrastructure on AWS to enable AI-driven reliability and observability for the fulfillment network.

Role Subtype Data Engineer

Tech Domains Amazon Web Services, SQL / PostgreSQL, Python, Java, Scala, Node.js, Spark, Hadoop, Hive, Linux

Keywords for Your Resume

senior data engineerdata engineer iiietleltdata pipelinesdata modelsdata warehousingsqlpythonjavascalanodejssparkhadoophiveemrredshifts3glueawsamazon web servicesdata engineer

Deal Breakers

No experience with AWS data stack (Redshift, S3, Glue, EMR), No SQL experience, Lack of Bachelor's degree in CS/Engineering/Analytics/Math/IT

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Data Engineer, Infrastructure Reliability

Get matched to jobs like this