Position Details
About this role
Site Reliability Engineer - AWS role in Chicago focusing on incident response, monitoring, automation, and reliability of a large-scale SaaS environment.
Key Responsibilities
- Collaborate with engineering teams to promote SRE practices across the organization
- Create and maintain operational documentation such as runbooks and playbooks
- Configure and maintain monitoring and alerting tools related to the applications
- Monitor application and infrastructure health and recommend improvements to performance and reliability
- Identify manual tasks and contribute to automation efforts
Technical Overview
Hands-on SRE with AWS, CI/CD, automation, monitoring, and runbook/playbook development. Requires collaboration across product development, QA, and infrastructure teams.
Ideal Candidate
The ideal candidate is a mid-level SRE with hands-on AWS experience, strong incident response and automation skills, and a proven ability to implement monitoring across large-scale applications.
Must-Have Skills
None listed
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile