Position Details
About this role
This role involves maintaining and improving the reliability of internal services and platforms, automating infrastructure, and troubleshooting scalability issues within a large-scale SaaS environment.
Key Responsibilities
- Ensure system uptime
- Automate infrastructure
- Troubleshoot scalability issues
- Develop monitoring systems
- Collaborate with engineering teams
Technical Overview
The role covers infrastructure automation, cloud infrastructure, distributed systems, and monitoring, utilizing tools like Kubernetes, Terraform, Docker, and Ruby on Rails.
Ideal Candidate
The ideal candidate is a mid-level Site Reliability Engineer with at least 3 years of experience in infrastructure automation, cloud environments, and distributed systems. They are proficient with tools like Kubernetes, Terraform, and Docker, and capable of troubleshooting scalability issues.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years of SRE experience, Lack of experience with automation tools, No familiarity with cloud infrastructure, Unwilling to work onsite in New York City
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile