✦ Luna Orbit — AI & Machine Learning

Research Engineer, Cybersecurity Reinforcement Learning

at Anthropic

📍 San Francisco, CA | New York City, NY Hybrid Posted March 07, 2026
Type Full-Time
Experience mid
Exp. Years 3+ years
Education Bachelor's degree in a related field or equivalent experience
Category AI & Machine Learning

This role involves developing reinforcement learning techniques to improve cybersecurity defenses and secure AI systems at Anthropic.

  • Develop RL environments
  • Implement secure AI models
  • Conduct cybersecurity experiments
  • Collaborate with research teams
  • Enhance model safety and robustness

Focus on cybersecurity research, machine learning, reinforcement learning environments, LLM training, and secure coding practices.

The ideal candidate is a research engineer with experience in cybersecurity and machine learning, capable of developing reinforcement learning models for secure AI applications. They possess strong software engineering skills and a passion for advancing safe AI systems.

Cybersecurity research experienceExperience with machine learningStrong software engineering skillsAbility to balance research and engineering
Security engineeringFuzzingDetection and responseParticipation in CTF competitionsAcademic cybersecurity researchFamiliarity with RL techniquesLLM training methodologies
RL environmentsLLMsCyber ranges
CybersecurityReinforcement LearningMachine LearningAIRL environmentsSecurity engineeringFuzzingDetection and responseCyber rangesLLM trainingSecure codingVulnerability remediation
CybersecurityReinforcement LearningMachine LearningAIArtificial IntelligenceRL environmentsSecurity engineeringFuzzingDetection and responseCyber rangesLLM trainingSecure codingVulnerability remediation
ResearchEngineeringCollaborationProblem-solvingCommunicationPassion for AIInnovation
Industry Technology
Job Function Advance cybersecurity through AI and reinforcement learning research
CybersecurityReinforcement LearningMachine LearningAIArtificial IntelligenceRL environmentsSecurity engineeringFuzzingDetection and responseCyber rangesLLM trainingSecure codingVulnerability remediation

Lack of cybersecurity research experience, No background in machine learning, Insufficient software engineering skills, No experience with RL or LLM training

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile