Position Details
About this role
A senior site reliability engineer role focused on maintaining and improving cloud infrastructure reliability, scalability, and performance across multiple cloud providers.
Key Responsibilities
- Design and implement scalable cloud systems
- Manage incident response and post-mortems
- Ensure system reliability and performance
- Collaborate across teams to improve infrastructure
- Drive chaos engineering initiatives
Technical Overview
Environment includes AWS, Google Cloud Platform, Azure, Kubernetes, Terraform, with responsibilities in incident management, chaos engineering, and system monitoring.
Ideal Candidate
The ideal candidate is a senior SRE with 5+ years experience in cloud infrastructure, proficient in AWS, Kubernetes, Terraform, and incident management, with strong leadership and problem-solving skills in a remote setting.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with Kubernetes or Terraform, No experience with incident management or SLOs, Unwillingness to work remotely
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile