✦ Luna Orbit — Software Engineering

Software Engineer, Compute Efficiency

at Anthropic

📍 San Francisco, CA | New York City, NY Hybrid Posted March 07, 2026
Type Not Specified
Experience mid
Exp. Years 6+ years
Education Not specified
Category Software Engineering

This role focuses on enhancing AI infrastructure efficiency by developing telemetry, cost attribution, and optimization frameworks across large-scale distributed systems.

  • Build telemetry systems
  • Design cost attribution frameworks
  • Identify performance bottlenecks
  • Optimize cluster configurations
  • Collaborate with cloud providers

The technical scope includes distributed systems, cloud platforms, telemetry, and performance optimization using Python, Rust, Go, and Java, bridging hardware and high-level research needs.

The ideal candidate is a senior software engineer with over 6 years of experience in distributed systems, skilled in Python, Rust, Go, or Java. They have a strong background in infrastructure optimization, telemetry, and cost management for large-scale AI systems.

6+ years of industry experienceExpertise in distributed systemsProficiency in PythonRustGoor JavaExperience with cloud infrastructureBuilding telemetry and monitoring systems
Cost attribution frameworksPerformance bottleneck resolutionCluster configuration optimizationWorkload placementInfrastructure reliability
PythonRustGoJavaCloud platforms
Distributed systemsTelemetryCost attributionPerformance optimizationCloud infrastructureCluster configurationWorkload placementInfrastructure reliabilityPythonRustGoJava
Distributed systemsCloud platformsNetworkingApplication-level performanceHardware constraintsTelemetryCost attributionOptimization frameworksPythonRustGoJava
Problem-solvingCollaborationAnalytical thinkingPerformance awarenessCost-consciousness
Industry Technology / AI & Machine Learning / Cloud & Infrastructure
Job Function Optimizing AI infrastructure performance and cost-efficiency
Distributed systemsTelemetryCost attributionPerformance optimizationCloud infrastructureCluster configurationWorkload placementInfrastructure reliabilityPythonRustGoJavaCloud platforms

Less than 6 years of experience, Lack of expertise in distributed systems, No experience with cloud infrastructure, Unfamiliar with telemetry or cost frameworks

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile