Position Details
About this role
This role involves maintaining and improving the reliability, automation, and scalability of Braze's infrastructure at a massive scale, ensuring high uptime and performance.
Key Responsibilities
- Partner with engineering teams on infrastructure design
- Debug reliability issues
- Develop Infrastructure as Code
- Create deployment pipelines
- Ensure SLAs are met
Technical Overview
The technical environment includes Kubernetes, Terraform, Chef, Docker, Linux, Ruby on Rails, MongoDB, Redis, Kafka, and networking protocols, focusing on automation, monitoring, and distributed systems.
Ideal Candidate
The ideal candidate is a senior SRE with at least 5 years of experience in cloud infrastructure, automation, and distributed systems. They possess strong skills in Kubernetes, Terraform, and automation tools, with a focus on reliability and scalability.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with Kubernetes or Terraform, No experience with automation tools like Chef or Docker, Unwillingness to work onsite in Chicago
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile