Position Details
About this role
Cloud Site Reliability Engineer (SRE) to drive reliability, scalability, and performance of cloud-based infrastructure in Bangalore, focusing on automation and incident response.
Key Responsibilities
- Design and maintain fault-tolerant cloud architectures across AWS/Azure/GCP
- Deploy, manage, and optimize cloud resources using IaC (Terraform, Ansible)
- Implement monitoring, alerting, and logging
- Lead incident response for outages
- Build automation to reduce toil and improve reliability
Technical Overview
Stack includes AWS/Azure/GCP; IaC with Terraform/Ansible; monitoring with Splunk, Azure Monitor, Dynatrace, AWS CloudWatch; Linux/Windows; scripting in Python/PowerShell/Bash; capacity planning and autoscaling
Ideal Candidate
Senior cloud SRE with 10+ years, strong incident response, cloud platforms across AWS/Azure/GCP, automation via IaC, and mentoring capabilities.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Certifications
Preferred
Industry & Role
Keywords for Your Resume
Deal Breakers
10+ years experience in Cloud SRE, Hands-on experience with AWS/Azure/GCP, On-site in Bangalore
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile