Position Details
About this role
This role involves leading the development and operation of reliable, scalable cloud infrastructure and automation frameworks, supporting global customers with self-healing capabilities.
Key Responsibilities
- Design and operate cloud infrastructure
- Lead automation initiatives
- Troubleshoot production issues
- Develop monitoring and observability tools
- Drive self-healing infrastructure
Technical Overview
The position requires expertise in infrastructure as code, Kubernetes, cloud platforms (GCP, AWS), monitoring tools, and scripting languages, with a focus on automation, observability, and reliability engineering.
Ideal Candidate
The ideal candidate is a senior Site Reliability Engineer with over 7 years of experience in infrastructure, DevOps, and SRE practices. They should be proficient with Kubernetes, Terraform, and cloud platforms like GCP and AWS, with strong troubleshooting and automation skills.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 7 years of relevant experience, No experience with Kubernetes or Terraform, Lack of cloud platform knowledge, No automation or scripting skills
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile