Position Details
About this role
Okta is seeking a Site Reliability Engineer specialized in Observability to develop and expand our monitoring ecosystem within GCP, focusing on automation, high reliability, and scalable infrastructure.
Key Responsibilities
- Build scalable observability infrastructure
- Automate deployment of agents
- Optimize data collection in GCP
- Participate in incident response
- Develop dashboards and monitoring tools
Technical Overview
The role involves managing GCP-based observability tools, automating deployment with Terraform, coding in Python and Go, and working with Kubernetes, Grafana, and Splunk for monitoring and incident response.
Ideal Candidate
The ideal candidate is a highly technical Site Reliability Engineer with at least 3 years of experience in GCP and Kubernetes, proficient in Python and Go, with strong skills in observability tools like Grafana and Splunk, and experience automating infrastructure using Terraform.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years in SRE/DevOps roles, Lack of experience with GCP or Kubernetes, No scripting skills in Python or Go, No experience with observability tools
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile