Position Details
About this role
This role involves ensuring the reliability and scalability of GitLab's production systems through automation, incident response, and infrastructure management.
Key Responsibilities
- Design scalable infrastructure
- Respond to incidents
- Automate operational tasks
- Improve reliability
- Collaborate across teams
Technical Overview
Focuses on Kubernetes, system administration, and automation to support highly scalable, reliable SaaS infrastructure, with emphasis on incident management and operational automation.
Ideal Candidate
The ideal candidate is a mid-level Site Reliability Engineer with expertise in Kubernetes, system administration, and automation. They excel at maintaining reliable infrastructure, responding to incidents, and improving system performance in a remote setting.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of Kubernetes or system administration experience, No automation skills, Inability to work remotely
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile