Position Details

Type Not Specified

Experience mid

Exp. Years Not specified

Education Not specified

Category DevOps & SRE

About this role

Cloud Site Reliability Engineer (SRE) to drive reliability, scalability, and performance of cloud-based infrastructure in Bangalore, focusing on automation and incident response.

Key Responsibilities

Design and maintain fault-tolerant cloud architectures across AWS/Azure/GCP
Deploy, manage, and optimize cloud resources using IaC (Terraform, Ansible)
Implement monitoring, alerting, and logging
Lead incident response for outages
Build automation to reduce toil and improve reliability

Technical Overview

Stack includes AWS/Azure/GCP; IaC with Terraform/Ansible; monitoring with Splunk, Azure Monitor, Dynatrace, AWS CloudWatch; Linux/Windows; scripting in Python/PowerShell/Bash; capacity planning and autoscaling

Ideal Candidate

Senior cloud SRE with 10+ years, strong incident response, cloud platforms across AWS/Azure/GCP, automation via IaC, and mentoring capabilities.

Must-Have Skills

Hands-on programming/scripting (PythonPowerShellBash)Cloud platforms AWSAzureor GCPKubernetesDockerTerraformAnsibleSplunk or equivalentAzure Monitor or Dynatrace or similarWindows and Linux/UnixIncident response and post-incident reviews

Nice-to-Have Skills

Chaos engineering or resilience testingMulticloud or hybrid deploymentsSLOs/SLIs/error budgetsAzure DevOps Engineer or other cloud certifications

Tools & Platforms

Amazon Web ServicesMicrosoft AzureGoogle Cloud PlatformKubernetesDockerTerraformAnsibleSplunkAzure MonitorDynatraceAWS CloudWatch

Required Skills

10+ years of cloud SRE experience; hands-on AWS/Azure/GCP; Kubernetes; Docker; Terraform; Ansible; monitoring/observability; incident response; post-incident reviews; Python; PowerShell; Bash; Windows; Linux

Hard Skills

PythonPowerShellBashAWSMicrosoft AzureGoogle Cloud PlatformKubernetesDockerTerraformAnsibleSplunkAzure MonitorDynatraceAWS CloudWatchWindowsLinux/UnixVPCIAMRBACEncryptionPost-incident reviews

Soft Skills

Strong communicationTeam collaborationLeadershipMentoringProblem-solving

Certifications

Preferred

AWS Certified Solutions ArchitectGoogle Cloud Professional DevOps EngineerAzure DevOps Engineer

Industry & Role

Industry Fintech

Job Function Design, implement, and operate reliable, scalable cloud infrastructure with strong SRE discipline across a Bangalore-based team.

Role Subtype Site Reliability Engineer

Tech Domains Amazon Web Services, Microsoft Azure, Google Cloud Platform, Kubernetes, Docker, Terraform, Ansible, Splunk, Azure Monitor, Dynatrace

Keywords for Your Resume

cloud site reliability engineersrel2bangaloreonsitekubernetesdockerterraformansiblemonitoringobservabilityincident managementpost-incident reviewspythonpowershellbashwindowslinuxrbacencryption

Deal Breakers

10+ years experience in Cloud SRE, Hands-on experience with AWS/Azure/GCP, On-site in Bangalore

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Tech Lead, Infrastructure Engineering

Get matched to jobs like this