Position Details
About this role
This role involves ensuring the reliability and performance of complex distributed systems through automation, testing, and infrastructure management. The candidate will develop and execute resilience tests, automate deployments, and monitor system health.
Key Responsibilities
- Maintain system reliability
- Develop automated tests
- Implement Infrastructure as Code
- Monitor system performance
- Simulate failure scenarios
Technical Overview
The position requires expertise in SRE practices, automation, cloud infrastructure, Infrastructure as Code, performance testing, and monitoring tools like Splunk to support scalable and resilient systems.
Ideal Candidate
The ideal candidate is a mid-level Site Reliability Engineer with 3+ years of experience in automating and maintaining complex distributed systems. They are skilled in performance testing, automation, cloud infrastructure, and have a strong understanding of system resilience and failure scenarios.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years of experience, No experience with SRE or DevOps practices, Lack of automation skills, Inability to work on-site in Jacksonville, FL, No experience with cloud infrastructure
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile