Position Details

Type Full-Time

Experience senior

Exp. Years 3+ years

Education Bachelor's or higher degree in Computer Science, Data Science, Mechanical or Aerospace Engineering, or equivalent practical experience

Category AI & Machine Learning

About this role

Build and evolve GM’s autonomy evaluation platforms, including algorithms and dashboards that aggregate simulation metrics and deliver explainable insights. Apply VLMs and LLMs to classify performance and prioritize validation work, partnering across Autonomy, Simulation, Systems, and Safety teams.

Key Responsibilities

Design and implement analysis algorithms to summarize, aggregate, and cluster simulation metrics
Build and maintain autonomy evaluation dashboards and reports with trend analysis, drift detection, and scenario coverage
Leverage VLMs and LLMs to classify autonomy performance and critical scenarios
Maintain high technical standards via architecture, code reviews, and software-engineering best practices
Interface with cross-org partners to articulate requirements and resolve handoff issues

Technical Overview

The role focuses on data analysis and ML evaluation for autonomy verification using large-scale datasets and statistical methods. You will develop evaluation dashboards with trend analysis and drift detection, and integrate VLM/LLM approaches with human-in-the-loop to identify critical scenarios and improve scenario coverage.

Ideal Candidate

The ideal candidate is a data/ML evaluation engineer with 3+ years of applied experience in data analysis, ML evaluation, or autonomy analytics using large-scale datasets and statistical methods. They can build evaluation dashboards with trend analysis, drift detection, and scenario coverage, and they have experience applying VLMs and LLMs for classifying autonomy performance and critical scenarios.

Must-Have Skills

data analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariesDesign and implement analysis algorithmsBuild and maintain autonomy evaluation dashboards and reportsBachelor's or higher degree in Computer ScienceData ScienceMechanical or Aerospace Engineeringor equivalent practical experience

Nice-to-Have Skills

visualize quantitative information effectively and transparentlydecompose a multi-dimensional space into something consumableevaluating robotics systemsevaluating autonomous vehiclessensor data (cameralidar)

Tools & Platforms

PandasNumPySciPyplotting/visualization librariesvision-language modelsVLMslarge language modelsLLMs

Required Skills

analysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariestrend analysisdrift detectionscenario coveragevision-language models (VLMs)large language models (LLMs)human-in-the-looparchitectural designcode reviewssoftware-engineering best practices

Hard Skills

analysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariestrend analysisdrift detectionscenario coveragevision-language modelsVLMslarge language modelsLLMsclassify autonomy performancecritical scenariosprioritize validation effortshuman-in-the-looparchitectural designcode reviewssoftware-engineering best practicesdata miningtrainingmetricsrisk assessmentrelease gating

Soft Skills

cross-org communicationrequirements articulationhandoff issue resolutionsharing best practicestechnical leadershipcollaborationcode review collaboration

Industry & Role

Industry Aerospace

Job Function Develop and maintain GM’s autonomy evaluation algorithms and dashboards to enable data-driven AV quality decisions at scale

Role Subtype Data Scientist

Tech Domains Python

Keywords for Your Resume

Senior Software EngineerAutonomy Evaluationevaluation ecosystemanalysis algorithmsdata analysisML evaluationautonomy analyticslarge-scale datasetsstatistical methodsPandasNumPySciPyplotting/visualization librariesautonomy evaluation dashboardstrend analysisdrift detectionscenario coveragevision-language modelsVLMslarge language modelsLLMshuman-in-the-looprisk assessmentrelease gating

Deal Breakers

Must have 3+ years applied experience in data analysis, ML evaluation, or autonomy analytics, Must demonstrate proficiency with Pandas, NumPy, and SciPy

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Senior Software Engineer - Autonomy Evaluation

Get matched to jobs like this