About this role
Site Reliability Engineer focusing on improving software infrastructure, monitoring, and deployment automation across cloud platforms, with emphasis on Kubernetes-based data pipelines.
Key Responsibilities
- Improve software development infrastructure and systems through monitoring, performance tuning, and cost optimization
- Guide technical direction and collaborate with development teams for deployment and operation
- Build scalable data pipeline processing and cloud infrastructure components
- Implement CI/CD strategies for consumer teams and resolve operational issues
- Onboard and support API applications for end-to-end deployment
Technical Overview
Role requires containerized deployments using Kubernetes/Docker, cloud-native tooling, logs/metrics with CloudWatch/Elasticsearch/Splunk, and automation via Terraform/Ansible/Airflow; includes Azure AKS and data services like Cosmos DB/Data Factory.
Ideal Candidate
The ideal candidate is a senior site reliability engineer with hands-on experience in Kubernetes and cloud-native tooling, able to design and operate scalable data pipelines with strong automation and monitoring skills.
Must-Have Skills
Containerize and deploy applications using Kubernetes and DockerCloudWatchElasticsearchand SplunkConfigure and deploy RedHat Linux serversSupport Azure DevOps with IaaSSaaSand PaaS platformsInstall and monitor GradleKibanaMavenKafkaCassandraand ServiceNow serversVirtual MachinesApp Servicesand Azure Kubernetes Service (AKS)Create and maintain ConfigmapsSecretsCertificate managerIngressand Cron Jobs using Kubernetes Clusters and NamespacesHelm chartsPythonBash scriptingTerraformAirflowand AnsibleBuild data pipelines and analytics solutions using Azure SQLCosmos DBand Data Factory
Tools & Platforms
KubernetesDockerAzure DevOpsAKSCloudWatchElasticsearchSplunkRedHat LinuxTerraformAirflowAnsibleKibanaMavenGradleKafkaCassandraCosmos DBData FactoryAzure SQLServiceNow
Required Skills
KubernetesDockerCloudWatchElasticsearchSplunkRedHat LinuxAzure DevOpsAzure Kubernetes ServiceTerraformAirflowAnsiblePythonBash scriptingTerraformData FactoryCosmos DBAzure SQLKafkaCassandraServiceNowGradleMavenKibanaWeb ServicesAKSConfigMapsSecretsIngressCron JobsHelm chartsVMApp ServicesFortify
Hard Skills
KubernetesDockerCloudWatchElasticsearchSplunkRedHat LinuxAzure DevOpsIaaSSaaSPaaSGradleKibanaMavenKafkaCassandraServiceNowVMsApp ServicesAzure Kubernetes ServiceConfigmapsSecretsCertificate managerIngressCron JobsIngressCron JobsHelm chartsPythonBash scriptingTerraformAirflowAnsibleAzure SQLCosmos DBData Factory
Soft Skills
Strong communicationCollaborative teamworkProblem solvingProactiveAdaptability
Keywords for Your Resume
Software Engineer (Site Reliability Engineering)Site Reliability EngineeringKubernetesDockerCloudWatchElasticsearchSplunkRedHat LinuxAzure DevOpsAKSTerraformAirflowAnsiblePythonBash scriptingConfigMapsSecretsCertificate managerIngressCron JobsHelm chartsAzureAzure SQLCosmos DBData FactoryKafkaKibanaMavenGradleVMsApp ServicesAzure Kubernetes Service
Deal Breakers
Master’s degree required, 3 years of experience in a related occupation, Not specified visa sponsorship
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile