✦ Luna Orbit — Software Engineering

Software Engineer, Safeguards

at Anthropic

📍 San Francisco, CA | New York City, NY Hybrid Posted March 07, 2026
Type Not Specified
Experience mid
Exp. Years 5-10+ years
Education Bachelor’s degree in Computer Science, Software Engineering or comparable experience
Category Software Engineering

This role involves developing safety and oversight mechanisms for AI systems, focusing on monitoring, abuse detection, and building defenses to ensure user well-being.

  • Develop monitoring systems for AI behaviors
  • Build abuse detection mechanisms
  • Surface abuse patterns to research teams
  • Create multi-layered safety defenses
  • Enforce terms of service and policies

The technical environment includes Python, Typescript, API development, monitoring dashboards, and safety infrastructure for AI models.

The ideal candidate is a mid-level software engineer with 5+ years of experience in building safety and oversight systems for AI. They possess strong skills in Python and Typescript, with a focus on developing monitoring and abuse detection mechanisms.

PythonTypescriptsoftware engineeringmonitoring systemsabuse detection
trust and safety detection mechanismsprompt engineeringadversarial inputsinternal tooling
APIDashboards
PythonTypescriptAPI developmentmonitoring systemsabuse detectionsafety mechanismssoftware engineering
PythonTypescriptAPI developmentMonitoring systemsabuse detectionintegrityspamfraudabuse detectionsoftware engineering
communicationexplain complex technical conceptscollaborationproblem-solving
Industry Technology / SaaS
Job Function Build safety and oversight systems for AI models
Software EngineerPythonTypescriptAPImonitoring systemsabuse detectionsafety mechanismsintegrity detectionspam detectionfraud detectionAI systemssafety and oversightsoftware engineeringAPI monitoringinternal dashboardsautomated enforcementdashboards

Lack of experience with Python or Typescript, Less than 5 years of relevant experience, No background in safety or abuse detection, Unwillingness to work in hybrid or onsite environments

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile