Position Details

Type Full-Time

Experience senior

Exp. Years 6+ years

Education Not specified

Category DevOps & SRE

About this role

Senior software engineer in reliability engineering focused on building and maintaining production services with strong emphasis on observability, automation, and on-call incident response.

Key Responsibilities

Improve observability, reliability and availability by defining and measuring key metrics
Build automation and improve systems to eliminate toil and operations work
Collaborate with core infrastructure to performance tune and optimize cloud deployments
Automate incident response and reduce service disruptions
Participate in on-call support rotation

Technical Overview

Hands-on with containerization and cloud platforms, including Docker, Kubernetes, Terraform, AWS/GCP/Azure; strong observability tooling (Datadog, Kibana) and incident response readiness.

Ideal Candidate

The ideal candidate is a senior site reliability engineer with 6+ years building and operating production services, strong observability and debugging skills, and experience across AWS/GCP/Azure. They should be comfortable with on-call rotations and thrive in a fast-paced, regulated fintech environment.

Must-Have Skills

6+ years of software engineering experienceDesigningbuildingscaling and maintaining production servicesStrong observability and debugging skills

Nice-to-Have Skills

RubyGoand Terraform experienceAWSGCPor Azure cloud experienceExperience in regulated environmentsCrypto-forward / on-chain familiarity

Tools & Platforms

DockerTerraformKubernetesAWSGCPAzure

Required Skills

6+ years of software engineeringreliability engineeringobservabilitydebuggingDockerKubernetesTerraformAWSGoogle Cloud PlatformAzureDatadogKibana

Hard Skills

ObservabilityDebuggingPerformance tuningDockerTerraformKubernetesAWSGCPAzureCloud deploymentsOn-callSRE practicesGoRubyTerraform

Soft Skills

CommunicationCollaborationMentorshipProblem solvingLeadership

Industry & Role

Industry Finance / Fintech

Job Function Build reliable, observable, scalable cloud services and manage incident response

Role Subtype Site Reliability Engineer

Tech Domains Docker, Terraform, Kubernetes, Amazon Web Services, Google Cloud Platform, Azure

Keywords for Your Resume

Senior Software EngineerSRECoinbaseReliability EngineeringDockerKubernetesTerraformAWSGCPAzureDatadogKibanaObservabilityOn-callHybridRemoteCryptoon-call rotationproduction systemscloud deploymentsSite Reliability Engineer

Deal Breakers

Must be able to participate in on-call rotations

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Software Engineer , SRE

Get matched to jobs like this