Position Details
About this role
This role involves leading projects to enhance platform reliability, managing monitoring and automation, and troubleshooting complex system issues in a healthcare IT environment.
Key Responsibilities
- Manage platform infrastructure
- Lead observability/monitoring projects
- Troubleshoot service disruptions
- Develop automated procedures
- Lead incident reviews
Technical Overview
The technical environment includes Kubernetes, Docker, CI/CD pipelines, and observability tools, with a focus on automation, performance, and incident response.
Ideal Candidate
The ideal candidate is a mid-level Site Reliability Engineer with 4+ years of experience in managing platform infrastructure, automation, and incident response. They should have strong troubleshooting skills and experience with observability tools and CI/CD pipelines.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with SRE practices, No experience with automation or monitoring tools, Bachelor's degree not in a relevant field
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile