✦ Luna Orbit — AI & Machine Learning

Staff Data Scientist - Entity Resolution, IDGraph

at Socure

📍 Remote - US Remote Posted March 31, 2026
Type Full-Time
Experience lead
Exp. Years 5+ years
Education Master’s or PhD in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field (preferred)
Category AI & Machine Learning

Staff Data Scientist leads the ID Graph program, driving graph modeling, evaluation, and ML signal development at scale across Socure’s product ecosystem.

  • Entity resolution & Graph evaluation; Data quality & modeling frameworks; Signal discovery & graph intelligence; Cross-functional collaboration & technical leadership; Leadership competencies

Strong Python and PySpark expertise; production ML systems experience; hands-on with graph databases and frameworks (NeptuneDB, GraphFrames, OpenCypher); exploring LLMs and knowledge graphs.

The ideal candidate is a senior data scientist with 5+ years of experience in graph-based ML, strong Python/PySpark skills, and a track record of delivering production-grade ML systems at scale. They should be comfortable leading cross-functional teams, mentoring peers, and communicating complex graph-based insights to technical and business stakeholders.

5+ years of experience in applied data sciencemachine learningor artificial intelligencewith a focus on graph-based modeling and large-scale data systemsStrong proficiency in Python and PySparkExperience building production-grade ML systems at scaleHands-on experience with DatabricksFamiliarity with graph databases and query languages such as NeptuneDB and OpenCypherExperience with graph processing frameworks (GraphFrames)Excellent communication skillswith cross-functional collaboration
Experience applying LLMs for evaluationautomationor signal discoveryFamiliarity with Knowledge Graphs and Graph Neural Networks (GNNs)Master’s or PhD in Computer ScienceData ScienceMachine LearningStatisticsMathematicsor a related field5+ years of building cross-functionalcross-product ML initiatives
PythonPySparkDatabricksGraphFramesNeptuneDBOpenCypherGraph Neural NetworksKnowledge GraphsLLMsLarge Language Models
PythonPySparkDatabricksGraphFramesNeptuneDBOpenCypherGraph databasesLearning-to-RankLink predictionSemi-supervised learningGraph Neural NetworksKnowledge GraphsAnomaly DetectionProduction-grade ML systems
PythonPySparkDatabricksGraphFramesGraph databasesNeptuneDBOpenCypherKnowledge GraphsGraph Neural NetworksLearning-to-RankLink predictionSemi-supervised learningAnomaly DetectionProduction-grade ML systemsLLMsLarge Language Models
LeadershipCollaborationCommunicationMentoring peersStrategic thinkingExecutive communication
Industry Fintech
Job Function Lead advanced graph-based data science and ML efforts for Socure's ID Graph platform
Role Subtype data scientist
Tech Domains Python, Databricks, GraphFrames, Graph databases, NeptuneDB, OpenCypher
Visa Sponsorship No
Staff Data ScientistPythonPySparkDatabricksGraph databasesNeptuneDBOpenCypherGraphFramesGraph Neural NetworksKnowledge GraphsLearning-to-RankLink predictionSemi-supervised learningProduction-grade ML systemsLLMsLarge Language ModelsAnomaly DetectionEntity resolutionstaff data scientistpythonpysparkdatabricksneptunedbopencyphergraphframesgraph databasesknowledge graphsllms

5+ years of graph-based ML experience, Inability to lead cross-functional projects, No Databricks experience

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile