Position Details
About this role
This role involves managing and improving cloud infrastructure on Azure, ensuring system reliability, and enhancing observability through monitoring and incident management.
Key Responsibilities
- Manage Azure cloud infrastructure
- Build and improve CI/CD pipelines
- Monitor system health
- Respond to incidents
- Collaborate on capacity planning
Technical Overview
The technical environment includes Azure cloud services, AKS, Kubernetes, Grafana, Prometheus, Pulumi, and incident response tools, focusing on scalable, reliable infrastructure.
Ideal Candidate
The ideal candidate is a mid-level SRE with 3+ years experience managing Azure cloud infrastructure, Kubernetes, and monitoring tools like Grafana and Prometheus. They should be proactive in incident response and capacity planning.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years SRE experience, No experience with Azure or Kubernetes, Lack of monitoring and incident response skills, No familiarity with Pulumi or infrastructure as code
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile