✦ Luna Orbit — DevOps & SRE

Technical Account Manager, Observe

at Snowflake

📍 US-CA-Menlo Park Unknown Posted March 17, 2026
Type Not Specified
Experience senior
Exp. Years 7+ years
Education Not specified
Category DevOps & SRE

This role involves managing and improving observability and reliability of large-scale cloud-native systems, working closely with engineering teams to embed monitoring solutions and optimize system performance.

  • Lead observability strategy
  • Implement telemetry pipelines
  • Optimize system reliability
  • Coordinate incident response
  • Collaborate with engineering teams

Focus on SRE practices, observability tools like Datadog, Dynatrace, and OpenTelemetry, cloud-native architectures, distributed systems, and incident management.

The ideal candidate is a senior SRE or DevOps engineer with extensive hands-on experience with observability platforms like Datadog, Dynatrace, or Splunk. They possess strong knowledge of distributed systems, cloud-native architectures, and telemetry pipelines, and can lead initiatives to improve system reliability and performance.

7+ years of experience in SREDevOpsplatform engineeringobservability engineeringHands-on experience with observability platformsExperience with OpenTelemetryUnderstanding of distributed systemsStrong communication skills
Telemetry strategy developmentCost optimizationTelemetry governanceIncident response improvement
DatadogDynatraceSplunkNew RelicGrafanaElastic (ELK)OpenTelemetry
SRESite Reliability EngineeringDevOpsobservability platformsDatadogDynatraceSplunkNew RelicGrafanaElasticOpenTelemetrytelemetry pipelinesdistributed systemscloud-native architectures
SRESite Reliability EngineeringDevOpsobservability platformsDatadogDynatraceSplunkNew RelicGrafanaElastic (ELK)OpenTelemetrytelemetry pipelinescloud-native architecturesdistributed systems
communicationcollaborationproblem-solvinganalytical thinkingcustomer focus
Industry SaaS
Job Function Enhance system reliability and observability for enterprise cloud systems
Role Subtype Site Reliability Engineer
Tech Domains Active Directory, Microsoft 365, Azure, Amazon Web Services, Google Cloud Platform, Kubernetes, Docker, Python, Java, JavaScript
Site Reliability EngineerSREDevOpsobservability platformsDatadogDynatraceSplunkNew RelicGrafanaElasticOpenTelemetrytelemetry pipelinesdistributed systemscloud-native architecturesincident responseobservabilitycloud-native

Less than 7 years of experience in SRE or DevOps, No hands-on experience with observability platforms, Lack of understanding of distributed systems, No experience with OpenTelemetry, Inability to communicate technical concepts

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile