✦ Luna Orbit — Cloud & Infrastructure

Lead Data Engineer (Python, Spark, Glue, AWS)

at Discover Financial Services

📍 Richmond, VA Unknown 💰 $179K – $204K USD / year Posted April 17, 2026
Salary $179K – $204K USD / year
Type Full-Time
Experience lead
Exp. Years 4 years+ (application development) and 2 years+ (big data) and 1 year+ (cloud computing)
Education Bachelor's Degree
Category Cloud & Infrastructure

Lead data engineering work at Capital One by designing, developing, testing, and supporting cloud-based data solutions. The role emphasizes Python/Spark-based engineering, big data tooling, and delivering robust analytics platforms in AWS and other public clouds.

  • Collaborate across Agile teams to build and support data engineering solutions
  • Develop using Python, Java, Scala and database technologies
  • Use cloud data warehousing like Redshift and Snowflake
  • Apply distributed data/computing frameworks such as Spark and Kafka
  • Perform unit tests and optimize for performance

You will work across Agile teams to build data engineering systems using Python and Spark, leveraging big data frameworks (MapReduce, Hadoop, Hive, EMR, Kafka) and cloud warehousing (Redshift, Snowflake). Responsibilities include unit testing, performance tuning, and working on real-time/streaming data when applicable.

The ideal candidate is a lead data engineer with 4+ years of application development experience and 2+ years in big data technologies, plus hands-on cloud computing experience on Amazon Web Services. They have delivered real-time or streaming data solutions and are comfortable working across Agile teams using Python, Spark, and cloud data warehousing like Redshift or Snowflake.

Bachelor's DegreeAt least 4 years of experience in application development (Internship experience does not apply)At least 2 years of experience in big data technologiesAt least 1 year experience with cloud computing (AWSMicrosoft AzureGoogle Cloud)collaborate with and across Agile teams to designdeveloptestimplementand support technical solutions
7+ years of experience in application development including PythonSQLScalaor Java4+ years of experience with a public cloud (AWSMicrosoft AzureGoogle Cloud)4+ years experience with Distributed data/computing tools (MapReduceHadoopHiveEMRKafkaSparkGurobior MySQL)4+ year experience working on real-time data and streaming applications4+ years of experience with NoSQL implementation (MongoCassandra)4+ years of data warehousing experience (Redshift or Snowflake)4+ years of experience with UNIX/Linux including basic commands and shell scripting2+ years of experience with Agile engineering practicesmentoring other members of the engineering community
PythonJavaScalaRedshiftSnowflakeAmazon Web ServicesMicrosoft AzureGoogle CloudMapReduceHadoopHiveEMRKafkaSparkGurobiMySQLMongoCassandraUNIX/Linuxshell scripting
PythonSparkGlueAWSAmazon Web ServicesJavaScalaSQLRedshiftSnowflakebig data technologiesNoSQL databasesKafkaHadoopHiveEMRMapReducereal-time datastreaming applicationsUNIX/Linuxshell scriptingAgile
application developmentprogramming languages: Javaprogramming languages: Scalaprogramming languages: PythonOpen Source RDBMSNoSQL databasesdata warehousing services: Redshiftdata warehousing services: Snowflakebig data technologiescloud computingAmazon Web ServicesMicrosoft AzureGoogle CloudDistributed data/computing toolsMapReduceHadoopHiveEMRKafkaSparkGurobiMySQLreal-time datastreaming applicationsNoSQL implementationMongoCassandraUNIX/Linuxshell scriptingunit testsperformance tuningAgile engineering practicesAgile teamsfull-stack development tools
collaborationmentoringcommunicationiterationexperimentationlearning new technologiesinclusive mindsetstaying on top of tech trends
Industry Fintech
Job Function Design and deliver scalable, cloud-based data engineering platforms.
Role Subtype Data Engineer
Tech Domains Python, Amazon Web Services, Google Cloud Platform, Azure, Linux, SQL / PostgreSQL
Visa Sponsorship No
Lead Data EngineerData EngineerPythonSparkGlueAWSAmazon Web ServicesSQLScalaJavaOpen Source RDBMSNoSQL databasesNoSQL implementationRedshiftSnowflakebig data technologiescloud computingMicrosoft AzureGoogle CloudAgileAgile teamsunit testsperformance tuningdistributed dataMapReduceHadoopHiveEMRKafkaGurobiMySQLreal-time datastreaming applications

Must have at least 4 years application development experience, Must have at least 2 years big data technologies experience, Must have at least 1 year cloud computing experience (AWS, Microsoft Azure, Google Cloud)

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile