Position Details

Salary $115K – $160K USD / year

Type Full-Time

Experience mid

Exp. Years 6+ years

Education Not specified

Category DevOps & SRE

About this role

This role involves maintaining and scaling cloud infrastructure, ensuring system reliability, and automating operations across distributed systems.

Key Responsibilities

Design scalable infrastructure
Lead incident response
Implement monitoring and observability
Automate deployment pipelines
Collaborate on system improvements

Technical Overview

The technical environment includes AWS, Kubernetes, Docker, Terraform, and observability tools like Datadog, Prometheus, and Grafana, focusing on automation and incident management.

Ideal Candidate

The ideal candidate is a senior SRE with 6+ years of experience in infrastructure, proficient in Python, AWS, Kubernetes, and Terraform. They should have strong incident response skills and a focus on automation and system reliability.

Must-Have Skills

PythonAWSKubernetesTerraformCI/CDIncident Response

Nice-to-Have Skills

SecurityHealthcare EnvironmentsAuroraDatabase PerformanceLegacy System Modernization

Tools & Platforms

TerraformAWSKubernetesDockerDatadogPrometheusGrafana

Required Skills

PythonLinuxUnixAWSEC2EKSECSS3IAMVPCKubernetesDockerTerraformCI/CDDatadogPrometheusGrafanaIncident ResponseRCA

Hard Skills

PythonLinuxUnixAWSAmazon Web ServicesEC2EKSECSS3IAMVPCKubernetesDockerTerraformCI/CDDatadogPrometheusGrafanaIncident ResponseRoot Cause Analysis

Soft Skills

CollaborationOwnershipProblem-solvingTechnical GuidanceMentorship

Industry & Role

Industry Technology / SaaS

Job Function Maintain and improve cloud infrastructure and system reliability

Keywords for Your Resume

Site Reliability EngineerSREPythonLinuxUnixAWSAmazon Web ServicesEC2EKSECSS3IAMVPCKubernetesDockerTerraformCI/CDDatadogPrometheusGrafanaIncident ResponseRoot Cause AnalysisMonitoringObservabilityAutomation

Deal Breakers

Less than 6 years of relevant experience, Lack of AWS or Kubernetes expertise, No experience with incident response or root cause analysis

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Site Reliability Engineer

Get matched to jobs like this