Position Details
About this role
Lead the design, build, and maintenance of cloud infrastructure on AWS, ensuring platform reliability, security, and scalability. Drive automation, incident response, and observability for a high-growth AI startup.
Key Responsibilities
- Design and maintain cloud infrastructure
- Implement CI/CD pipelines
- Lead incident response and postmortems
- Ensure security and compliance
- Enhance platform observability
Technical Overview
Expertise in AWS cloud services, Kubernetes, Terraform, CI/CD pipelines, and monitoring tools like Datadog and Prometheus. Focus on security, operational discipline, and autonomous remediation.
Ideal Candidate
The ideal candidate is a senior Site Reliability Engineer with 5+ years of experience designing and maintaining cloud infrastructure on AWS, proficient in Kubernetes, Terraform, and CI/CD pipelines, with a strong focus on security and observability.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of experience with AWS or cloud infrastructure, No experience with Kubernetes or Terraform, Inability to work remotely, Less than 5 years of relevant experience
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile