✦ Luna Orbit — AI & Machine Learning

Machine Learning Systems Engineer, RL Engineering

at Anthropic

📍 San Francisco, CA | New York City, NY | Seattle, WA Hybrid Posted March 07, 2026
Type Not Specified
Experience mid
Exp. Years 4+ years
Education Not specified
Category AI & Machine Learning

Build and improve AI training systems for reinforcement learning models at Anthropic, focusing on large-scale distributed infrastructure and algorithms.

  • Develop training algorithms
  • Optimize training pipelines
  • Build scalable AI infrastructure
  • Support research teams
  • Improve system robustness

Developing AI/ML systems, reinforcement learning pipelines, and infrastructure for training large language models using Python and distributed computing.

The ideal candidate is a software engineer with 4+ years of experience in AI systems, specializing in reinforcement learning and large-scale distributed training. They are proficient in Python and have a strong background in building scalable AI infrastructure to support research and production models.

4+ years of software engineering experienceExperience with systems and tools for AI model trainingKnowledge of large-scale distributed systemsExperience with PythonImplementing LLM finetuning algorithms
High performance distributed systemsLarge scale LLM trainingAI research experienceBuilding AI infrastructure
Python
Machine LearningReinforcement LearningRLHFLarge Language ModelsPythonDistributed SystemsAI SystemsModel TrainingAlgorithm DevelopmentSystem Infrastructure
Machine LearningReinforcement LearningRLHFLarge Language ModelsPythonDistributed SystemsAI SystemsModel TrainingAlgorithm DevelopmentSystem Infrastructure
Results-orientedCollaborationProblem-solvingImpact-drivenLearning mindset

Required

None specified

Preferred

None specified
Industry Technology
Job Function AI Systems Engineering
Machine LearningReinforcement LearningRLHFLarge Language ModelsPythonDistributed SystemsAI SystemsModel TrainingAlgorithm DevelopmentSystem InfrastructureAIMLDeep LearningLLMLanguage Models

Lack of experience with large-scale distributed systems, No background in AI or machine learning, No proficiency in Python, Unable to work in specified locations

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile