Position Details
About this role
This internship involves assisting in building and managing a secure, scalable enterprise data lake, focusing on data ingestion, cataloging, and visualization for government clients.
Key Responsibilities
- Assist in data lake development
- Support data ingestion and cataloging
- Develop data visualizations
- Ensure data quality and security
- Document processes
Technical Overview
The technical environment includes Python, Apache Spark, AWS Glue, Amazon S3, AWS Athena, and AWS QuickSight, with a focus on data discovery, cataloging, and secure data processing pipelines.
Ideal Candidate
The ideal candidate is an intern or early-career data engineer with experience in Python and cloud-based data processing tools like Apache Spark and AWS services. They should be familiar with ETL processes, data cataloging, and visualization tools, and able to work securely within government environments.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Certifications
Preferred
Industry & Role
Clearance & Visa
Keywords for Your Resume
Deal Breakers
Lack of experience with Python or cloud data tools, No ability to obtain Secret clearance, No relevant internship or experience in data engineering, Inability to work in a secure environment, Not scheduled to obtain a Bachelor's degree
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile