✦ Luna Orbit — AI & Machine Learning

Senior Software Engineer, Systems

at Anthropic

📍 San Francisco, CA | New York City, NY | Seattle, WA Hybrid Posted March 07, 2026
Type Not Specified
Experience senior
Exp. Years 6+ years
Education Not specified
Category AI & Machine Learning

This role involves leading infrastructure projects for large-scale AI systems, focusing on reliability, compute uptime, and collaboration with cloud providers to solve complex infrastructure challenges.

  • Lead infrastructure projects
  • Build and maintain AI clusters
  • Partner with cloud providers
  • Solve compute and reliability challenges
  • Improve operational practices

The technical environment includes distributed systems, Kubernetes, cloud platforms (AWS, GCP), systems languages (Python, Rust, Go, Java), and observability tools like eBPF, aimed at building reliable AI infrastructure.

The ideal candidate is a senior systems engineer with over 6 years of experience in distributed systems, reliability engineering, and cloud platforms like AWS and GCP, with expertise in systems languages such as Python, Rust, or Go, and familiarity with ML infrastructure.

6+ years of software engineering experienceLed technical projects end-to-endDeep knowledge of distributed systems and reliabilityExperience with cloud platforms (AWS/GCP)Strong in at least one systems language (PythonRustGoJava)
Security and privacy expertiseExperience with ML infrastructure (GPUsTPUsTrainium)Networking infrastructureLinux kernel tuningeBPF
KubernetesIaCAWSGCPLinuxeBPFGPUsTPUsTrainium
Systems EngineeringDistributed SystemsReliability EngineeringCloud PlatformsKubernetesIaCAWSGCPPythonRustGoJavaLinuxeBPF
Systems EngineeringDistributed SystemsReliability EngineeringCloud PlatformsKubernetesIaCAWSGCPPythonRustGoJavaLinuxeBPF
Technical LeadershipProblem-solvingCommunicationTeam CollaborationOperational Practices
Industry AI / Research
Job Function Infrastructure engineering for large-scale AI systems
Systems EngineerDistributed SystemsReliability EngineeringCloud PlatformsKubernetesIaCAWSGCPPythonRustGoJavaLinuxeBPFML InfrastructureGPUTPUTrainium

Less than 6 years of experience, Lack of experience with distributed systems or cloud platforms, No knowledge of systems languages (Python, Rust, Go, Java)

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile