✦ Luna Orbit — DevOps & SRE

Lead SRE, Site Reliability Engineering

at Klaviyo

📍 Dublin, IE Onsite Posted March 12, 2026
Type Not Specified
Experience lead
Exp. Years Not specified
Education Not specified
Category DevOps & SRE

Lead Site Reliability Engineer responsible for setting technical direction and reliability strategy for critical platforms at Klaviyo in Dublin. Focuses on automation, security, scalability, and operational excellence.

  • Set technical vision and strategy
  • Design and evolve foundational services
  • Drive adoption of SRE best practices
  • Identify systemic reliability risks
  • Lead incident response and capacity planning

Role involves deep systems thinking, building reliable infrastructure, automating operational tasks, and ensuring system security and performance at a global scale using tools like Kubernetes, Prometheus, and Terraform.

The ideal candidate is a senior-level SRE with extensive experience in automation, observability, and reliability engineering at a global scale. They possess strong leadership skills and a deep understanding of infrastructure security and performance optimization.

Site Reliability Engineeringautomationobservabilityincident responsecapacity planningperformance analysis
securityfault tolerancescalabilitylatencyfault tolerance
KubernetesPrometheusGrafanaTerraformAWSGoogle CloudAzure
Site Reliability EngineeringSREautomationobservabilitySLIsSLOsincident responsecapacity planningperformance analysisfault tolerancesecurityinfrastructuresoftware engineeringKubernetesPrometheusGrafanaTerraformAWSGoogle CloudAzure
Site Reliability EngineeringSREautomationobservabilitySLIsSLOserror budgetsincident responsecapacity planningperformance analysisfault tolerancesecurityinfrastructuresoftware engineeringautomation tools
leadershiptechnical visionproblem-solvingcollaborationcommunicationstrategic thinking
Industry SaaS
Job Function Leading reliability and operational excellence for critical SaaS platforms
Site Reliability EngineeringSREautomationobservabilitySLIsSLOserror budgetsincident responsecapacity planningperformance analysisfault tolerancesecurityinfrastructuresoftware engineeringautomation toolsKubernetesPrometheusGrafanaTerraformAWSGoogle CloudAzure

Lack of experience with automation tools, No background in reliability engineering, Unable to work onsite in Dublin

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile