Position Details
About this role
This role focuses on developing and maintaining a scalable observability platform using Splunk, Grafana, and automation tools. The engineer will optimize data pipelines, dashboards, and incident workflows in a large distributed system environment.
Key Responsibilities
- Design and optimize Splunk environments
- Develop Grafana dashboards
- Automate deployment with Terraform
- Enhance telemetry data pipelines
- Participate in incident reviews
Technical Overview
The technical environment includes Splunk, Grafana, Terraform, with scripting in Go, Python, and Ruby. The focus is on automation, performance tuning, and scalable telemetry data management.
Ideal Candidate
The ideal candidate is a senior Site Reliability Engineer with extensive experience in Splunk, Grafana, and infrastructure automation using Terraform. They should possess scripting skills in Go, Python, or Ruby and have a proven track record of optimizing observability systems for large-scale distributed environments.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of hands-on experience with Splunk or Grafana, No scripting skills in Go, Python, or Ruby, Inexperience with infrastructure as code tools like Terraform
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile