Position Details
About this role
Data Engineer III on the Infrastructure Reliability team designs, builds, and scales data pipelines and warehousing to feed ML models, detection systems, and real-time dashboards across Amazon's fulfillment network.
Key Responsibilities
- Design, build, and maintain scalable ETL/ELT pipelines that ingest from thousands of sites to ML models and dashboards
- Develop and own data models for AI-powered incident detection and remediation orchestration
- Partner with data scientists, software engineers, and product managers
- Build and operate large-scale data warehouse on AWS
- Establish data quality frameworks, monitoring, and governance
Technical Overview
Stack includes AWS data tools (Redshift, S3, Glue, EMR), Spark, Hadoop, Hive; focuses on ETL/ELT pipelines, data modeling, data governance, and cross-team collaboration.
Ideal Candidate
The ideal candidate is a senior data engineer with 5+ years of experience building scalable ETL/ELT pipelines, data warehouses, and ML-ready data, proficient in Python/Java/Scala/NodeJS and AWS data tools.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
No experience with AWS data stack (Redshift, S3, Glue, EMR), No SQL experience, Lack of Bachelor's degree in CS/Engineering/Analytics/Math/IT
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile