Position Details

Type Not Specified

Experience mid

Exp. Years 3+ years

Education Not specified

Category DevOps & SRE

About this role

Site Reliability Engineer (SRE) to join the Cloud Infrastructure team to improve reliability and scalability of self-service platforms, with a focus on Kubernetes-based workloads and modern CI/CD practices.

Key Responsibilities

Collaborate with internal customers and partners to deliver key business outcomes
Ensure cloud products are reliable, scalable, and secure
Enhance observability across cloud services
Respond to cloud incidents with root cause analysis
Drive CI/CD improvements

Technical Overview

Stack emphasizes Go and Python programming, Kubernetes and Kubernetes controllers, RESTful APIs, CI/CD, and observability to support large-scale cloud services.

Ideal Candidate

The ideal candidate is a mid-level SRE with 3+ years of cloud-native experience, strong Go/Python skills, and solid Kubernetes expertise including controllers. They should excel in observability, incident response, and CI/CD improvements within a fast-paced, open-source-friendly environment.

Must-Have Skills

Minimum of 3+ years of programming experience with Go or PythonExtensive experience with KubernetesExperience developing with Kubernetes and/or building Kubernetes controllers

Nice-to-Have Skills

Certifications in KubernetesUnderstanding application lifecycle managementCI/CD experienceOpen-source contributionsAgile development methodologies

Tools & Platforms

KubernetesGoPythonRESTCI/CD

Required Skills

GoGolangPythonKubernetesKubernetes controllersRESTful APIAPI designWeb servicesCI/CDObservabilityIncident responseRoot cause analysisOpen sourceAgile development

Hard Skills

GoGolangPythonKubernetesKubernetes controllersRESTful APIAPI designWeb servicesCI/CDObservabilityIncident responseRoot cause analysisOpen sourceAgile development

Soft Skills

CollaborationCommunicationProblem-solvingTeamworkAdaptability

Certifications

Preferred

Kubernetes Certification

Industry & Role

Industry E-commerce

Job Function Site Reliability Engineering for cloud infrastructure and Kubernetes-based platforms

Role Subtype Site Reliability Engineer

Tech Domains Kubernetes, Python, Go, RESTful API, CI/CD, Observability, Web services

Keywords for Your Resume

Site Reliability Engineer (SRE)cloud infrastructureGoGolangPythonKubernetesKubernetes controllersRESTful APIweb servicesCI/CDobservabilityincident responseroot cause analysisautomationopen sourceagile developmentKubernetes Certificationsopen-source standardssite reliability engineersrekubernetesgopythonrestful apici/cd

Deal Breakers

3+ years of Go or Python, Kubernetes experience, Experience with Kubernetes controllers

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Software Engineer - Cloud SRE

Get matched to jobs like this