Position Details
About this role
This role involves developing next-generation distributed data storage and processing systems to support advanced analytics and machine learning at Databricks.
Key Responsibilities
- Developing distributed data storage systems
- Optimizing query engines
- Supporting machine learning integration
- Enhancing data processing performance
- Scaling data infrastructure
Technical Overview
The technical environment includes Java, Scala, C++, Apache Spark, cloud storage, distributed systems, and query engine optimization, focusing on scalable and high-performance data solutions.
Ideal Candidate
The ideal candidate is a senior software engineer with over 5 years of experience in distributed data systems, proficient in Java, Scala, or C++, with knowledge of machine learning and cloud infrastructure. They excel at building high-performance data processing engines.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 5 years of experience, No experience with distributed systems, Lack of knowledge in query engines
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile