About this role
This role is for a Site Reliability Engineer (SRE) to support a greenfield initiative within the Trade Compliance and Innovation team. You will act as the primary SRE for one squad and support a second squad, partnering with Dev and QA to build reliable infrastructure from development through production.
Key Responsibilities
- Apply software engineering to IT operations to maintain scalable, secure, highly available production
- Bridge development and operations using engineering rigor for system administration and infrastructure management
- Design, build, and support infrastructure across DEV, Test, and Production
- Develop and maintain automation using code for logs, monitoring, test environments, and incident response
- Implement and manage Infrastructure as Code (IaC) using Terraform and support deployments of Java and Python microservices
Technical Overview
You will apply software engineering to IT operations, build automation for logs and monitoring, and manage Infrastructure as Code (IaC) with Terraform. The stack includes Java and Python microservices, containerized workloads, blue-green deployments, and Azure services including Azure Container Apps (ACA) and Azure Kubernetes Service (AKS), with observability using ELK and Grafana.
Ideal Candidate
The ideal candidate is a senior Site Reliability Engineer (SRE) who can own reliability for a squad in a distributed setup. They have hands-on DevOps and cloud infrastructure experience, strong automation with Terraform, and operational expertise across Java and Python microservices on Kubernetes and Azure services. They are highly capable in monitoring, logging, observability, and incident response.
Must-Have Skills
Infrastructure as Code (IaC)TerraformAutomation using codelog analysisincident responseblue-green deploymentsAzure Container Apps (ACA)Kubernetes platforms (AKS)monitoringloggingobservabilityELKGrafanaJavaPythonmicroservicescontainerized workloads
Tools & Platforms
TerraformAzure Container Apps (ACA)Kubernetes platforms (AKS)AKSAzure FunctionsELKGrafana
Required Skills
software engineering practices to IT operationssystem administrationinfrastructure managementautomation using codelog analysismonitoring systemsincident responseInfrastructure as Code (IaC)TerraformJavaPythonmicroservicescontainerized workloadsblue-green deploymentsscaling strategiesresiliencysecurity posturesAzure Container Apps (ACA)Kubernetes platforms (AKS)Azure Functionsmessaging systemswebhooksdistributed integrationsmonitoringloggingobservabilityELKGrafanaDEVTestProduction
Hard Skills
software engineering practices to IT operationssystem administrationinfrastructure managementscalablesecurehighly available production environmentsinfrastructure across DEV environmentsinfrastructure across Test environmentsinfrastructure across Production environmentsautomation using codelog analysismonitoring systemstest environmentsincident responseInfrastructure as Code (IaC)TerraformJavaPythonmicroservicescontainerized workloadsblue-green deploymentsscaling strategieshorizontal scalingvertical scalingresiliencysecurity posturesAzure Container Apps (ACA)Kubernetes platforms (AKS)KubernetesAKSAzure Functionsmessaging systemswebhooksdistributed integrationsmonitoringloggingobservabilityELKGrafana
Soft Skills
independently researchindependently designindependently implement solutionspartner closely with development teamspartner closely with QA teamspartner closely with global Devpartner closely with global QApartner closely with global SRE team membersability to learn and apply new technologiesability to resolve infrastructure and reliability issues
Keywords for Your Resume
Site Reliability EngineerSREDevOpsDevOps and SREinfrastructure as codeInfrastructure as Code (IaC)TerraformJavaPythonmicroservicescontainerized workloadsblue-green deploymentsAzure Container Apps (ACA)AKSKubernetesAzure Functionsmessaging systemswebhooksELKGrafanamonitoringloggingobservabilityincident responsescalablehighly available production environments
Deal Breakers
Must be able to manage Infrastructure as Code (IaC) using Terraform, Must have experience supporting Azure Container Apps (ACA) and Kubernetes platforms (AKS), Must have experience with blue-green deployments
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile