About this role
Lead the Platform Engineering team responsible for cloud hosted applications, scalable infrastructure, and production reliability. You will own AWS platform design, Terraform-based infrastructure provisioning, CI/CD pipelines, container orchestration, and security/compliance scanning.
Key Responsibilities
- Design, build, and maintain scalable cloud infrastructure on AWS
- Develop and optimize GitHub Actions CI/CD pipelines
- Implement infrastructure as code using Terraform
- Manage and orchestrate containerized applications using managed container platforms
- Drive cost optimization and reliability with monitoring, rightsizing, automated scaling, and 99.9% uptime
Technical Overview
Build and maintain scalable AWS infrastructure and high availability containerized services using managed container platforms. Implement infrastructure as code with Terraform, automate deployments with GitHub Actions CI/CD pipelines, and use monitoring/observability tools like Splunk and Grafana to troubleshoot performance issues and drive 99.9% uptime.
Ideal Candidate
The ideal candidate is a seasoned cloud/platform engineer who has led the design and operation of scalable AWS infrastructure for business-critical applications. They have strong hands-on experience with Terraform and GitHub Actions CI/CD pipelines, plus proven reliability and production support skills targeting 99.9% uptime. They also partner effectively with security teams to implement security best practices, compliance, and automated security scanning.
Must-Have Skills
Designbuildand maintain scalable cloud infrastructure on AWSDevelop and optimize GitHub Actions CI/CD pipelinesImplement infrastructure as code using TerraformManage and orchestrate containerized applications using managed container platformsDrive cost optimization initiatives across cloud resources through monitoringrightsizingand implementing automated scaling solutionsPartner with security teams to implement and maintain security best practicescompliance requirementsand automated security scanning throughout the infrastructure lifecycleDeliver all aspects of production supportEnsure compliance to all agreed SLAs and requirementsMonitor system performancetroubleshoot issuesand implement automated solutions to ensure 99.9% uptime
Nice-to-Have Skills
Select third-party vendor products (production support)managed container platforms
Tools & Platforms
AWSAmazon Web ServicesTerraformGitHub ActionsSplunkGrafana
Required Skills
AWSAmazon Web ServicesGitHub ActionsCI/CD pipelinesinfrastructure as codeTerraformmanaged container platformscontainerized applicationscost optimizationmonitoringrightsizingautomated scalingsecurity best practicescompliance requirementsautomated security scanningproduction supportincident resolutionproactive mitigationchange and problem managementSLAssystem performance monitoringSplunkGrafana
Hard Skills
Designbuildmaintain scalable cloud infrastructureAmazon Web ServicesAWSGitHub ActionsCI/CD pipelinesinfrastructure as codeTerraformmanaged container platformscontainerized applicationshigh availabilityscalabilitycost optimizationmonitoringrightsizingautomated scalingtroubleshoot infrastructure issuesself-service capabilitiessecurity best practicescompliance requirementsautomated security scanningproduction supportincident resolutionproactive mitigationchange managementproblem managementSLAsSLA compliancesystem performance monitoringtroubleshoot issuesautomated solutions99.9% uptimeoptimal resource utilizationlogmetricsGrafanaSplunk
Soft Skills
collaborate with development teamscollaborate with security teamspartner with security teamsleadownershipproblem-solvingcommunicationcross-functional collaboration
Keywords for Your Resume
LeadPlatform EngineeringAWSAmazon Web ServicesTerraforminfrastructure as codeGitHub ActionsCI/CD pipelinescontainerized applicationsmanaged container platformshigh availabilitycost optimizationmonitoringrightsizingautomated scalingsecurity best practicescompliance requirementsautomated security scanningincident resolutionchange managementproblem managementSLAs99.9% uptimeSplunkGrafanaproduction support
Deal Breakers
Must have hands-on experience designing and maintaining scalable AWS infrastructure, Must have experience with Terraform (infrastructure as code), Must have experience optimizing GitHub Actions CI/CD pipelines, Must target 99.9% uptime and production support SLA compliance
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile