Position Details
About this role
This role involves developing next-generation distributed data storage and processing systems at Databricks, focusing on performance and scalability for big data workloads.
Key Responsibilities
- Build distributed data storage systems
- Develop query optimization engines
- Enhance data pipeline performance
- Scale infrastructure across cloud platforms
- Implement ACID transactions and time travel features
Technical Overview
The technical environment includes Java, Scala, C++, Apache Spark, Hadoop, cloud storage platforms like AWS S3 and Azure Blob Storage, with a focus on distributed systems and data engineering.
Ideal Candidate
The ideal candidate is a senior software engineer with 5+ years of experience in Java, Scala, or C++, specializing in distributed data storage and processing systems. They possess strong knowledge of big data frameworks like Apache Spark and Hadoop, and have a solid foundation in algorithms and data structures.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with distributed systems, No experience with big data frameworks (Apache Spark, Hadoop), Less than 5 years of relevant experience, No background in algorithms or data structures
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile