Position Details
About this role
Supports and enhances Salesforce's observability and monitoring infrastructure, ensuring system reliability through automation, proactive telemetry, and incident management.
Key Responsibilities
- Manage observability platforms
- Implement monitoring strategies
- Automate incident detection
- Support platform migrations
- Lead on-call incident resolution
Technical Overview
Focuses on managing observability platforms, implementing automation, and supporting incident response using tools like Splunk, Grafana, and OpenTelemetry, with a focus on proactive system health.
Ideal Candidate
The ideal candidate is a senior SRE with extensive experience in monitoring, observability, and automation tools. They should be proactive, capable of managing complex incident responses, and skilled in implementing predictive monitoring solutions.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with monitoring tools, No background in SRE or DevOps, Inability to work independently in critical environments, No experience with automation or incident management
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile