✦ Luna Orbit — DevOps & SRE

Senior Site Reliability Engineer, Currents

at Braze

📍 New York City Onsite Posted March 12, 2026
Type Full-Time
Experience senior
Exp. Years 5+ years
Education Bachelor’s in Computer Science, Software Engineering, or a related STEM field
Category DevOps & SRE

This role involves building, maintaining, and improving large-scale data export systems at Braze, focusing on reliability, scalability, and observability.

  • Build and maintain high-scale data systems
  • Improve system reliability and performance
  • Lead incident response and postmortems
  • Define monitoring standards
  • Support infrastructure and platform engineering

The environment includes Kafka-based event pipelines, Kubernetes or Docker for deployment, and monitoring tools like Sentry, Datadog, and PagerDuty.

The ideal candidate is a senior-level SRE with at least 5 years of experience in deploying and maintaining large-scale, distributed systems. They possess strong skills in Kubernetes, Docker, Kafka, and monitoring tools, with a focus on reliability and incident management.

Using distributed systems to deploy and monitor live applications such as Kubernetes or Docker SwarmWorking with alerting software (SentryDatadogand/or PagerDuty)Utilizing programming languages (JavaKotlinand/or Ruby)
Experience with KafkaExperience with data export systemsExperience with high-scale systems
KubernetesDockerSentryDatadogPagerDuty
KubernetesDockerKafkaMonitoringReliabilityObservabilityIncident responseDistributed systemsData streamingScalabilityJavaKotlinRuby
KubernetesK8sDockerDocker SwarmMonitoring toolsSentryDatadogPagerDutyJavaKotlinRubyDistributed systemsData streamingKafkaEvent pipelineScalabilityReliabilityObservabilityIncident responseBlameless postmortems
TeamworkCommunicationProblem-solvingAutonomyAccountabilityCollaborationAgile project leadership
Industry SaaS
Job Function Ensure the reliability and scalability of Braze's data export systems
Site Reliability EngineerSREKubernetesDockerKafkaEvent pipelineMonitoringReliabilityObservabilityIncident responseBlameless postmortemsDistributed systemsData streamingScalabilityJavaKotlinRuby

Lack of experience with Kubernetes or Docker Swarm, Less than 5 years of relevant experience, No experience with monitoring tools like Datadog or PagerDuty

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile