About this role
CVS Health is hiring a Staff Cloud Infrastructure Engineer to design, implement, and maintain infrastructure supporting operating systems. The role emphasizes secure cloud and on-premises architecture, automation with Infrastructure-as-Code, continuous observability, and hands-on Level 4 incident response.
Key Responsibilities
- Design and maintain secure, scalable cloud and on-premises infrastructure (Kubernetes, VMWare, OpenShift Virtualization)
- Implement security controls aligned with NIST, CIS, and HIPAA
- Oversee Infrastructure-as-Code (IaC) using Terraform or Ansible
- Provide Level 4 incident response and perform root cause analysis for critical issues
- Build monitoring and observability frameworks and drive modernization with automation and self-healing solutions
Technical Overview
This role covers infrastructure architecture and operations across Kubernetes, VMWare, and OpenShift Virtualization. It requires automation using Infrastructure-as-Code tools like Terraform and/or Ansible, plus monitoring for observability and performance tracking of Linux and cloud systems, along with incident response and root cause analysis.
Ideal Candidate
The ideal candidate is a senior cloud infrastructure engineer who designs and operates secure, scalable infrastructure using Kubernetes and virtualization platforms. They have strong Infrastructure-as-Code experience with Terraform and/or Ansible, provide Level 4 incident response with root cause analysis, and implement security controls aligned to NIST, CIS, and HIPAA.
Must-Have Skills
designingimplementingand maintaining the infrastructure and services supporting the organization's operating systemsdesign and maintain securescalable cloud and on-premises (KubernetesVMWareOpenShift Virtualization) infrastructuresImplement security controls aligned with industry standards (e.g.NISTCISHIPAA)Oversee the implementation of Infrastructure-as-Code (IaC) tools like Terraform or AnsibleProvide Level 4 incident response and lead root cause analysis for critical issuesparticipate in on-call rotationsBuild and maintain monitoring frameworks to ensure continuous observability and performance tracking of Linux and cloud systems
Tools & Platforms
KubernetesVMWareOpenShift VirtualizationTerraformAnsibleLinux
Required Skills
infrastructure architecturecloud and on-premises infrastructuresKubernetesVMWareOpenShift Virtualizationsecurity controlsNISTCISHIPAAInfrastructure-as-CodeIaCTerraformAnsibleautomationprovisioningmonitoring frameworksobservabilityperformance trackingLinuxLevel 4 incident responseincident managementroot cause analysisvulnerability managementon-call rotationsbusiness continuityhigh availabilityself-healing solutionsdocumentationmentor junior engineers
Hard Skills
infrastructure architectureoperating system infrastructure designcloud and on-premises infrastructureKubernetesVMWareOpenShift VirtualizationInfrastructure-as-CodeIaCTerraformAnsiblesecurity controlsNISTCISHIPAAautomation for provisioningautomation for infrastructure resourcesmonitoring frameworksobservabilityperformance trackingLinuxincident responseLevel 4 incident responseroot cause analysisvulnerability managementon-call rotationsmodernize infrastructureautomating software deploymentsautomating software updatesself-healing solutionsdocumentation for automated solutions and infrastructure configurationsincident managementbusiness continuityhigh availability
Soft Skills
cross-functional collaborationstrategic planningoperational excellenceeffective incident managementmentoringknowledge sharingevaluate new technologiesrecommend improvementslead root cause analysis
Keywords for Your Resume
Staff Cloud Infrastructure Engineercloud infrastructureInfrastructure Architecture & Operationoperating systemsKubernetesVMWareOpenShift VirtualizationsecurescalableNISTCISHIPAAInfrastructure-as-CodeIaCTerraformAnsibleautomationmonitoring frameworksobservabilityperformance trackingLinuxincident managementLevel 4 incident responseroot cause analysisvulnerability managementon-call rotationsbusiness continuityhigh availabilitymodernize infrastructureself-healing solutionsdocumentationmentor junior engineersincident response
Deal Breakers
Must have experience designing secure, scalable infrastructure using Kubernetes, Must have Infrastructure-as-Code experience with Terraform or Ansible, Must be able to provide Level 4 incident response and participate in on-call rotations, Must have implemented security controls aligned with NIST, CIS, and HIPAA
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile