Position Details
About this role
This role involves building, maintaining, and evolving a high-scale data export system at Braze, focusing on reliability, observability, and scalability.
Key Responsibilities
- Build and maintain scalable systems
- Improve system reliability
- Implement monitoring and observability standards
- Lead incident response and postmortems
- Guide junior engineers in SRE practices
Technical Overview
The technical environment includes Kafka-based event pipelines, Kubernetes, Docker, and various monitoring tools, with a focus on distributed systems and high availability.
Ideal Candidate
The ideal candidate is a senior SRE with at least 5 years of experience in building and maintaining high-scale, distributed systems. They should have strong expertise in Kubernetes, Docker, and monitoring tools, with a focus on reliability and performance.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with Kubernetes or Docker, No background in distributed systems, Less than 5 years of relevant experience, Unwillingness to work onsite in Austin
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile