✦ Luna Orbit — AI & Machine Learning

Senior Software Engineer - Autonomy Evaluation

at General Motors

📍 Sunnyvale, California, United States of America Unknown Posted April 14, 2026
Type Full-Time
Experience senior
Exp. Years 3+ years
Education Bachelor's or higher degree in Computer Science, Data Science, Mechanical or Aerospace Engineering, or equivalent practical experience
Category AI & Machine Learning

Build and evolve GM’s autonomy evaluation platforms, including algorithms and dashboards that aggregate simulation metrics and deliver explainable insights. Apply VLMs and LLMs to classify performance and prioritize validation work, partnering across Autonomy, Simulation, Systems, and Safety teams.

  • Design and implement analysis algorithms to summarize, aggregate, and cluster simulation metrics
  • Build and maintain autonomy evaluation dashboards and reports with trend analysis, drift detection, and scenario coverage
  • Leverage VLMs and LLMs to classify autonomy performance and critical scenarios
  • Maintain high technical standards via architecture, code reviews, and software-engineering best practices
  • Interface with cross-org partners to articulate requirements and resolve handoff issues

The role focuses on data analysis and ML evaluation for autonomy verification using large-scale datasets and statistical methods. You will develop evaluation dashboards with trend analysis and drift detection, and integrate VLM/LLM approaches with human-in-the-loop to identify critical scenarios and improve scenario coverage.

The ideal candidate is a data/ML evaluation engineer with 3+ years of applied experience in data analysis, ML evaluation, or autonomy analytics using large-scale datasets and statistical methods. They can build evaluation dashboards with trend analysis, drift detection, and scenario coverage, and they have experience applying VLMs and LLMs for classifying autonomy performance and critical scenarios.

data analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariesDesign and implement analysis algorithmsBuild and maintain autonomy evaluation dashboards and reportsBachelor's or higher degree in Computer ScienceData ScienceMechanical or Aerospace Engineeringor equivalent practical experience
visualize quantitative information effectively and transparentlydecompose a multi-dimensional space into something consumableevaluating robotics systemsevaluating autonomous vehiclessensor data (cameralidar)
PandasNumPySciPyplotting/visualization librariesvision-language modelsVLMslarge language modelsLLMs
analysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariestrend analysisdrift detectionscenario coveragevision-language models (VLMs)large language models (LLMs)human-in-the-looparchitectural designcode reviewssoftware-engineering best practices
analysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariestrend analysisdrift detectionscenario coveragevision-language modelsVLMslarge language modelsLLMsclassify autonomy performancecritical scenariosprioritize validation effortshuman-in-the-looparchitectural designcode reviewssoftware-engineering best practicesdata miningtrainingmetricsrisk assessmentrelease gating
cross-org communicationrequirements articulationhandoff issue resolutionsharing best practicestechnical leadershipcollaborationcode review collaboration
Industry Aerospace
Job Function Develop and maintain GM’s autonomy evaluation algorithms and dashboards to enable data-driven AV quality decisions at scale
Role Subtype Data Scientist
Tech Domains Python
Senior Software EngineerAutonomy Evaluationevaluation ecosystemanalysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariesautonomy evaluation dashboardstrend analysisdrift detectionscenario coveragevision-language modelsVLMslarge language modelsLLMshuman-in-the-looprisk assessmentrelease gating

Must have 3+ years applied experience in data analysis, ML evaluation, or autonomy analytics, Must demonstrate proficiency with Pandas, NumPy, and SciPy

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile