Position Details
About this role
Klaviyo seeks a Senior Observability Platform Engineer to develop and operate scalable monitoring systems that provide insights into product and infrastructure health.
Key Responsibilities
- Design and operate scalable observability systems
- Create developer tooling and dashboards
- Set telemetry standards and best practices
- Use data to improve system reliability
- Automate infrastructure provisioning and upgrades
Technical Overview
The role involves building observability stacks including metrics, logs, traces, and alerting systems, with a focus on automation and AI integration, using tools like Prometheus and custom tooling.
Ideal Candidate
The ideal candidate is a senior software engineer with extensive experience in observability, monitoring, and distributed systems. They should have a strong background in building scalable, reliable telemetry platforms and be familiar with AI integration for system optimization.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with Prometheus or similar tools, No background in distributed systems, Insufficient automation skills, No experience in observability platform engineering
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile