Position Details

Salary $141K – $230K USD / year

Type Full-Time

Experience senior

Exp. Years 5+ years

Education Bachelor’s or Master’s degree in Computer Science

Category Cloud & Infrastructure

About this role

A senior site reliability engineer role focused on maintaining and improving cloud infrastructure reliability, scalability, and performance across multiple cloud providers.

Key Responsibilities

Design and implement scalable cloud systems
Manage incident response and post-mortems
Ensure system reliability and performance
Collaborate across teams to improve infrastructure
Drive chaos engineering initiatives

Technical Overview

Environment includes AWS, Google Cloud Platform, Azure, Kubernetes, Terraform, with responsibilities in incident management, chaos engineering, and system monitoring.

Ideal Candidate

The ideal candidate is a senior SRE with 5+ years experience in cloud infrastructure, proficient in AWS, Kubernetes, Terraform, and incident management, with strong leadership and problem-solving skills in a remote setting.

Must-Have Skills

CloudAWSKubernetesTerraformIncident Management

Nice-to-Have Skills

GCPGoogle Cloud PlatformAzureChaos EngineeringMonitoring

Tools & Platforms

AWSGoogle Cloud PlatformAzureKubernetesTerraformPulumi

Required Skills

CloudAWSAmazon Web ServicesGCPGoogle Cloud PlatformAzureKubernetesTerraformincident managementmonitoring

Hard Skills

CloudAWSAmazon Web ServicesGCPGoogle Cloud PlatformAzureKubernetesTerraformPulumiMonitoringIncident ManagementSLOsSLAsChaos Engineering

Soft Skills

collaborationproblem-solvingcommunicationleadershipcontinuous improvement

Industry & Role

Industry Technology/Cloud Computing

Job Function Maintain and optimize cloud infrastructure reliability

Role Subtype Site Reliability Engineer

Tech Domains Amazon Web Services, Google Cloud Platform, Azure, Kubernetes, Terraform

Keywords for Your Resume

site reliability engineerSREcloud infrastructureAWSAmazon Web ServicesGCPGoogle Cloud PlatformAzureKubernetesTerraformincident managementSLOsSLAschaos engineeringmonitoringreliabilityperformancedistributed systems

Deal Breakers

Lack of experience with Kubernetes or Terraform, No experience with incident management or SLOs, Unwillingness to work remotely

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Site Reliability Engineer- Remote

Get matched to jobs like this