Position Details
About this role
This role involves leading the development and implementation of site reliability engineering practices, focusing on monitoring, performance, and automation across cloud platforms like Azure, AWS, and GCP, to ensure system stability and scalability.
Key Responsibilities
- Build SRE practices and patterns
- Monitor and improve system performance
- Lead automation and infrastructure as code initiatives
- Mentor SRE teams
- Collaborate with architecture and engineering teams
Technical Overview
The technical environment includes Java, Python, Go, Perl, Ruby, shell scripting, Kubernetes, cloud platforms (Azure, AWS, GCP), CI/CD pipelines, and observability tools, emphasizing scalable, reliable infrastructure.
Ideal Candidate
The ideal candidate is a senior SRE with over 10 years of experience in enterprise software development, proficient in multiple programming languages and cloud platforms. They should have strong leadership skills, experience in monitoring and performance engineering, and a passion for building reliable, scalable systems.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 10 years of experience, Lack of experience with cloud platforms (Azure, AWS, GCP), No experience with Kubernetes, No background in enterprise software development
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile