✦ Luna Orbit — DevOps & SRE

Site Reliability Manager - Remote

at Kairos Technologies

📍 Remote, US Remote 💰 $70 – $80 USD / year Posted April 10, 2026
Salary $70 – $80 USD / year
Type Contract
Experience lead
Exp. Years 8+ years
Category DevOps & SRE

Sr SRE Manager to lead reliability engineers for uptime and efficiency of customer-facing platforms, defining SLOs, incident management, and robust observability; drives automation and security alignment.

  • Lead & grow the team
  • Own reliability strategy
  • Operate the platform
  • Incident management
  • Observability and automation

Cloud-native, multi-cloud with Kubernetes; IaC (Terraform/CloudFormation/Bicep); CI/CD pipelines; observability stack (Datadog, Dynatrace, Prometheus, Grafana, New Relic); incident response and postmortems.

The ideal candidate is an 8+ year seasoned SRE/DevOps leader who can define and enforce SLOs, manage incidents, and drive a culture of blameless engineering across cloud platforms (AWS/Azure/GCP) and Kubernetes.

8+ years in software/platform/reliability engineering2–4 years leading SRE/DevOps/Platform teamsExperience operating large-scale services on AWS/Azure/GCP with Kubernetes and containersLinux fundamentalsIaC (Terraform/CloudFormation/Bicep)CI/CD (GitHub Actions/CircleCI/Azure DevOps)observability (metricslogstraces) and alertingon-call programscommunication and stakeholder management
chaos engineeringload testingsecurity/compliance partnershipscost optimization
DatadogDynatracePrometheusGrafanaNew RelicTerraformCloudFormationBicepGitHub ActionsCircleCIAzure DevOpsAmazon Web ServicesMicrosoft AzureGoogle Cloud PlatformKubernetes
8+ years in software/platform/reliability engineering; 2–4 years leading SRE/DevOps/Platform teams; AWS/Azure/GCP with Kubernetes; Linux; IaC; CI/CD; observability; on-call; security practices
SLOserror budgetsincident managementobservabilityAPMDatadogDynatracePrometheusGrafanaNew RelicIaCTerraformCloudFormationBicepCI/CDGitHub ActionsCircleCIAzure DevOpsKubernetesLinuxAWSAzureGCPleast-privilegesecrets management
leadershipcoachingstakeholder managementcommunicationdata-driven decision making
Industry Technology
Job Function Lead reliability engineering for scalable, observable cloud platforms
Role Subtype Site Reliability Engineer
Tech Domains Amazon Web Services, Kubernetes, Linux, CI/CD, Terraform, Prometheus, Datadog, Grafana, Azure, Google Cloud Platform
site reliability managersre managerremotecontract6 monthsslosliserror budgetsincident managementobservabilityapmdatadogdynatraceprometheusgrafananew relicIaCterraformcloudformationbicepci/cdgithub actionscircleciazure devopskuberneteslinuxawsazuregcpleast-privilegesecrets managementsresla
Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile