About this role
This role involves supporting infrastructure reliability, automating deployment and monitoring processes, and ensuring high availability of banking and payment systems.
Key Responsibilities
- Design monitoring solutions
- Automate infrastructure tasks
- Lead incident response
- Conduct capacity planning
- Collaborate on security and deployment
Technical Overview
The technical scope includes cloud platforms (AWS, Azure, GCP), infrastructure automation with Terraform, monitoring tools like Prometheus and DataDog, scripting in Python and Bash, and CI/CD pipelines.
Ideal Candidate
The ideal candidate is a proactive Site Reliability Engineer with experience in cloud environments, automation, monitoring, and incident management, capable of supporting large-scale infrastructure and ensuring application reliability.
Must-Have Skills
Site Reliability EngineeringSREMonitoringAutomationIncident responseCloud platformsTerraformPythonCI/CD
Nice-to-Have Skills
AzureGoogle CloudDataDogSplunkELK StackAnsible
Tools & Platforms
AWSAzureGoogle CloudTerraformPrometheusGrafanaDataDogSplunkELK StackJenkinsGitLab CI/CDHarnessAzure DevOps
Required Skills
Site Reliability EngineeringSREMonitoringInfrastructureAutomationIncident responseCapacity planningPerformance tuningDisaster recoveryBackup strategiesSecurityDeployment pipelinesConfiguration managementCloud platformsAWSAzureGoogle CloudTerraformPrometheusGrafanaDataDogSplunkELK StackScriptingPythonBashCI/CDHarnessJenkinsGitLab CI/CDAzure DevOps
Hard Skills
Site Reliability EngineeringSREMonitoringInfrastructureApplication performanceAutomationIncident responseCapacity planningPerformance tuningDisaster recoveryBackup strategiesSecurityDeployment pipelinesConfiguration managementCloud platformsAWSAzureGoogle CloudTerraformPrometheusGrafanaDataDogSplunkELK StackScriptingPythonBashCI/CDHarnessJenkinsGitLab CI/CDAzure DevOps
Soft Skills
collaborativetroubleshootingcommunicationnegotiationinfluencingownershipproblem-solving
Keywords for Your Resume
Site Reliability EngineerSREMonitoringInfrastructureApplication performanceAutomationIncident responseCapacity planningPerformance tuningDisaster recoveryBackup strategiesSecurityDeployment pipelinesConfiguration managementCloud platformsAWSAzureGoogle CloudTerraformPrometheusGrafanaDataDogSplunkELK StackScriptingPythonBashCI/CDHarnessJenkinsGitLab CI/CDAzure DevOpsSite Reliability Engineering
Deal Breakers
Lack of experience with cloud platforms (AWS, Azure, GCP), No scripting or automation skills, Unfamiliar with monitoring tools like Prometheus or DataDog, No experience with CI/CD pipelines
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile