Position Details
About this role
This role focuses on improving system reliability through observability, automation, and incident management within a cloud and containerized environment.
Key Responsibilities
- Implement SRE practices
- Enhance observability
- Respond to incidents
- Automate system management
- Improve operational efficiency
Technical Overview
The position involves implementing SRE methodologies, observability tools, and automation on cloud platforms like Kubernetes, with a focus on system resilience and performance.
Ideal Candidate
The ideal candidate is a mid-level SRE professional with 3+ years experience in system reliability, observability, and automation. They should be skilled in incident response, disaster recovery, and working with cloud and container platforms.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with SRE practices, No knowledge of observability tools, Inability to respond to system incidents
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile