Position Details
About this role
This role involves ensuring system reliability, automating infrastructure, and managing incident response within a financial services environment using cloud and container technologies.
Key Responsibilities
- Drive reliability and performance, Automate infrastructure, Collaborate with cross-functional teams, Lead incident management, Build monitoring systems
Technical Overview
Stack includes cloud platforms (Azure, AWS, GCP), Kubernetes, Terraform, monitoring tools (Prometheus, Grafana), and scripting in Python, Go, or Java, with a focus on SRE principles.
Ideal Candidate
The ideal candidate is a mid-level SRE with strong expertise in cloud infrastructure (Azure, AWS, GCP), container orchestration (Kubernetes), and observability tools. They should have experience automating infrastructure, managing incidents, and mentoring engineering teams in a fast-paced financial environment.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with cloud infrastructure (Azure, AWS, GCP), No experience with Kubernetes or Terraform, No familiarity with monitoring tools like Prometheus or Grafana, Lack of scripting/programming skills in Python, Go, or Java
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile