Position Details
About this role
This role focuses on developing high-performance distributed data storage and processing systems to support large-scale data analytics at Databricks.
Key Responsibilities
- Developing distributed data storage systems
- Building data pipelines
- Implementing ACID transactions
- Optimizing data processing performance
- Supporting cloud storage integrations
Technical Overview
The technical environment includes Java, Scala, C++, Apache Spark, Hadoop, cloud storage, and data pipeline technologies, emphasizing scalable and reliable data systems.
Ideal Candidate
The ideal candidate is a senior software engineer with over 5 years of experience in distributed data systems, proficient in Java, Scala, or C++, and experienced with big data frameworks like Spark and Hadoop. They are passionate about building scalable data storage and processing solutions.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 5 years of experience, Lack of experience with distributed systems, No background in big data frameworks
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile