Position Details

Type Full-Time

Experience mid

Exp. Years 3+ years

Education Not specified

Category System Administration

About this role

This role involves maintaining and improving the reliability of internal services and platforms, automating infrastructure, and troubleshooting scalability issues within a large-scale SaaS environment.

Key Responsibilities

Ensure system uptime
Automate infrastructure
Troubleshoot scalability issues
Develop monitoring systems
Collaborate with engineering teams

Technical Overview

The role covers infrastructure automation, cloud infrastructure, distributed systems, and monitoring, utilizing tools like Kubernetes, Terraform, Docker, and Ruby on Rails.

Ideal Candidate

The ideal candidate is a mid-level Site Reliability Engineer with at least 3 years of experience in infrastructure automation, cloud environments, and distributed systems. They are proficient with tools like Kubernetes, Terraform, and Docker, and capable of troubleshooting scalability issues.

Must-Have Skills

experience in Site Reliability Engineeringproficiency with infrastructure automation toolsexperience with cloud infrastructureability to troubleshoot scalability issuesexperience with monitoring and alerting

Nice-to-Have Skills

experience with Ruby on RailsMongoDBRedisKafkaKubernetesDockerLinux system administration

Tools & Platforms

ChefTerraformKubernetesDockerRuby on RailsMongoDBRedisKafka

Required Skills

Site Reliability EngineeringSREInfrastructure as CodeChefTerraformKubernetesDockerLinuxRuby on RailsMongoDB

Hard Skills

Site Reliability EngineeringSREInfrastructure as CodeChefTerraformKubernetesDockerLinuxRuby on RailsMongoDBRedisKafkaMonitoringAlertingAutomationDistributed SystemsScalabilityNetworking

Soft Skills

problem-solvingcollaborationanalytical thinkingcommunicationautomation mindsetadaptability

Industry & Role

Industry Technology / SaaS

Job Function Site reliability engineering and infrastructure automation

Keywords for Your Resume

Site Reliability EngineerSREInfrastructure as CodeChefTerraformKubernetesDockerLinuxRuby on RailsMongoDBRedisKafkaMonitoringAlertingAutomationDistributed SystemsScalabilityNetworking

Deal Breakers

Less than 3 years of SRE experience, Lack of experience with automation tools, No familiarity with cloud infrastructure, Unwilling to work onsite in New York City

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Site Reliability Engineer

Get matched to jobs like this