✦ Luna Orbit — Cloud & Infrastructure

Principal Architect - Cloud and Observability

at CVS Health

Remote 💰 $144K – $288K USD / year Posted April 04, 2026
Salary $144K – $288K USD / year
Type Full-Time
Experience lead
Exp. Years 10+ years
Education Bachelor's degree in Computer Science, Engineering, or a related field. Equivalent work experience accepted.
Category Cloud & Infrastructure

Principal Architect - Cloud and Observability owns the architecture and standards for CVS Health's observability platforms and hybrid cloud posture. The role is hands-on, shaping telemetry, instrumentation, and cost optimization across multi-cloud and on-prem environments.

  • Own enterprise observability reference architecture across metrics, logs, traces, and events
  • Drive the OpenTelemetry-first instrumentation strategy
  • Build and operate telemetry pipelines on Grafana Mimir, Loki, Tempo
  • Define SLOs/SLIs and alerting frameworks
  • Own telemetry tooling integration with ServiceNow ITOM and xMatters

Responsible for OpenTelemetry-first instrumentation, telemetry pipelines on Grafana Mimir/Loki/Tempo, multi-cloud reference architectures (OpenShift on-prem, Azure, AWS, GCP), and integration with incident management. Uses IaC (Terraform, Pulumi, Helm, ArgoCD) and cloud-native IAM patterns.

The ideal candidate is a senior cloud/observability architect with 10+ years in infrastructure and cloud, 5+ years Kubernetes and OpenTelemetry experience, and proven success delivering enterprise observability across hybrid on-prem and cloud environments.

10+ years in infrastructurecloud architectureplatform engineeringor SRE8+ years of architecture work in observabilitycloud infrastructureor both at a large enterpriseSolid experience with at least two of AzureAWSor GCP -- including networkingidentitycomputeand storage5+ years with Kubernetes in production (OpenShiftEKSAKSor GKE)5+ years with OpenTelemetry or similar frameworks (collectorsSDKssemantic conventionspipeline design)5+ years with observability platforms: Grafana/Mimir/Loki/TempoPrometheusDatadogSplunkDynatraceor comparable toolsExperience defining SLOs/SLIs and building alerting strategies at an organizational levelProven track record writing architecture standards that other teams adopted and followedAble to communicate clearly with both engineers and senior leadership
On-prem / private cloud experience (OpenShift VirtualizationKVM/libvirtVMwareDell PowerFlex or similar storage)Workload identity (SPIFFE/SPIRE) and zero-trust networkingInfrastructure-as-code (TerraformPulumiHelmArgoCD)Streaming platforms such as Kafka or Confluentespecially in telemetry pipeline contextsAIOps or ML-based anomaly detection experienceFinOps background -- cloud cost optimizationchargebackunit economicsService mesh (IstioEnvoyLinkerd) or eBPF-based tools (CiliumPixie)Involvement in open-source communities (CNCFOpenTelemetryetc.)Healthcareinsuranceor financial services experience (HIPAA/SOX familiarity)Cloud certifications are a plus but not required
ServiceNow ITOMGrafanaOpenTelemetryKubernetesOpenShiftAmazon Web ServicesMicrosoft AzureGoogle Cloud PlatformTerraformPulumiHelmArgoCDKafkaConfluentSPIFFE/SPIREServiceNowxMatters
OpenTelemetryGrafanaMimirLokiTempoDatadogSplunkDynatracePrometheusKubernetesOpenShiftEKSAKSGKEAzureAmazon Web ServicesGoogle Cloud PlatformTerraformPulumiHelmArgoCDKafkaConfluentSPIFFESPIREServiceNow ITOMxMattersFinOpsAPIsCI/CD
OpenTelemetryGrafanaMimirLokiTempoDatadogSplunkDynatracePrometheusKubernetesOpenShiftEKSAKSGKEAzureAmazon Web ServicesGoogle Cloud PlatformTerraformPulumiHelmArgoCDAPIs
LeadershipCommunicationStakeholder ManagementMentoringCollaborationProblem-solvingStrategic ThinkingPublic SpeakingInfluenceChange Management

Preferred

Cloud certifications are a plus but not required
Industry Healthcare & Medical
Job Function Architect and implement enterprise observability and hybrid cloud standards for CVS Health, with hands-on delivery of telemetry and cost optimization initiatives.
OpenTelemetryGrafanaMimirLokiTempoDatadogSplunkDynatracePrometheusKubernetesOpenShiftEKSAKSGKEAzureAmazon Web ServicesGoogle Cloud PlatformTerraformPulumiHelmArgoCDAPIs

Less than 10 years in infrastructure/cloud architecture or observability, No 5+ years of Kubernetes in production, No OpenTelemetry experience or lack of familiarity with Grafana/Mimir/Loki/Tempo

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile