Position Details
About this role
Build and maintain foundational data infrastructure and analytics tools for Amazon PeopleInsights eXperience (APIX). Design scalable data pipelines and data lake capabilities that transform complex HR Ops and employee experience data into actionable, self-serve insights.
Key Responsibilities
- Design and implement high performant and cost-efficient data lake infrastructure using AWS big data stack, Spark, Hive, SQL, Apache Airflow, AWS Glue, EMR, S3, Redshift and OLAP technologies
- Collaborate with Business Intelligence Engineers to build semantic layers and optimize SQL queries
- Follow software best practices including coding standards, code reviews, and testing
- Work directly with customers to integrate new data types, curate data profiles, perform data quality checks, and incorporate feedback
- Enable technical and non-technical customers to drive self-serve analytics and ad-hoc reporting; iterate via proof of concepts
Technical Overview
Responsible for high-performance, cost-efficient data lake infrastructure on AWS using Spark, Hive, SQL, Apache Airflow, AWS Glue, Amazon EMR, Amazon S3, and Amazon Redshift with OLAP technologies. Collaborates on semantic layers, query optimization, testing/code reviews, and data quality workflows (profiling and validation), while supporting self-serve reporting.
Ideal Candidate
The ideal candidate is a mid-level Data Engineer experienced building scalable data lake infrastructure and data pipelines in a big data environment. They have hands-on experience with AWS big data stack components (Spark, Hive, SQL, Apache Airflow, AWS Glue, EMR, S3, Redshift) and can optimize SQL for fast, cost-efficient analytics while enabling self-serve reporting and data quality workflows.
Must-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Must have experience building data lakes and data processing services, Must have experience with one or more query language (SQL, PL/SQL, HiveQL, SparkSQL), Must have experience with one or more scripting language (Python, Scala)
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile