✦ Luna Orbit — System Administration

Staff+ Software Engineer, Systems

at Anthropic

📍 San Francisco, CA | New York City, NY | Seattle, WA Hybrid Posted March 07, 2026
Type Not Specified
Experience senior
Exp. Years 10+ years
Education Not specified
Category System Administration

This role involves owning the compute uptime and resilience of large-scale AI clusters, focusing on automation, reliability, and system optimization.

  • Own infrastructure strategy
  • Build scalable AI clusters
  • Define architecture
  • Collaborate with cloud providers
  • Establish operational practices

The technical environment includes distributed systems, cloud infrastructure, Linux kernel tuning, eBPF, and automation tools to ensure high availability and performance of AI compute clusters.

The ideal candidate is a senior systems engineer with over 10 years of experience in distributed systems, reliability, and cloud infrastructure. They possess deep expertise in Linux kernel tuning, eBPF, and systems automation, with a focus on building resilient AI infrastructure.

10+ years of software engineering experienceDeep expertise in distributed systems and reliabilityExperience with cloud platforms (AWSGCP)Strong systems programming skills (PythonRustGoJava)Experience with Linux kernel tuning and eBPF
Security and privacy best practicesMachine learning infrastructure experienceNetworking infrastructure knowledge
KubernetesAWSGCPLinuxeBPFLinux kernel
Distributed systemsReliability engineeringCloud platformsKubernetesAWSGCPLinuxInfrastructure automationeBPFKernel tuning
Distributed systemsReliability engineeringCloud platformsKubernetesAWSGCPLinuxInfrastructure automationeBPFKernel tuning
LeadershipStrategic thinkingCommunicationProblem-solvingTeam mentoring
Industry AI & Machine Learning
Job Function Manage and optimize large-scale AI infrastructure for reliability and performance
Distributed systemsReliability engineeringCloud platformsKubernetesAWSGCPLinuxInfrastructure automationeBPFKernel tuningSecurityPrivacyMachine learning infrastructure

Less than 10 years of experience, Lack of expertise in distributed systems or reliability, No experience with cloud platforms or Linux kernel tuning

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile