About this role
Staff Data Scientist leads the ID Graph program, driving graph modeling, evaluation, and ML signal development at scale across Socure’s product ecosystem.
Key Responsibilities
- Entity resolution & Graph evaluation; Data quality & modeling frameworks; Signal discovery & graph intelligence; Cross-functional collaboration & technical leadership; Leadership competencies
Technical Overview
Strong Python and PySpark expertise; production ML systems experience; hands-on with graph databases and frameworks (NeptuneDB, GraphFrames, OpenCypher); exploring LLMs and knowledge graphs.
Ideal Candidate
The ideal candidate is a senior data scientist with 5+ years of experience in graph-based ML, strong Python/PySpark skills, and a track record of delivering production-grade ML systems at scale. They should be comfortable leading cross-functional teams, mentoring peers, and communicating complex graph-based insights to technical and business stakeholders.
Must-Have Skills
5+ years of experience in applied data sciencemachine learningor artificial intelligencewith a focus on graph-based modeling and large-scale data systemsStrong proficiency in Python and PySparkExperience building production-grade ML systems at scaleHands-on experience with DatabricksFamiliarity with graph databases and query languages such as NeptuneDB and OpenCypherExperience with graph processing frameworks (GraphFrames)Excellent communication skillswith cross-functional collaboration
Nice-to-Have Skills
Experience applying LLMs for evaluationautomationor signal discoveryFamiliarity with Knowledge Graphs and Graph Neural Networks (GNNs)Master’s or PhD in Computer ScienceData ScienceMachine LearningStatisticsMathematicsor a related field5+ years of building cross-functionalcross-product ML initiatives
Tools & Platforms
PythonPySparkDatabricksGraphFramesNeptuneDBOpenCypherGraph Neural NetworksKnowledge GraphsLLMsLarge Language Models
Required Skills
PythonPySparkDatabricksGraphFramesNeptuneDBOpenCypherGraph databasesLearning-to-RankLink predictionSemi-supervised learningGraph Neural NetworksKnowledge GraphsAnomaly DetectionProduction-grade ML systems
Hard Skills
PythonPySparkDatabricksGraphFramesGraph databasesNeptuneDBOpenCypherKnowledge GraphsGraph Neural NetworksLearning-to-RankLink predictionSemi-supervised learningAnomaly DetectionProduction-grade ML systemsLLMsLarge Language Models
Soft Skills
LeadershipCollaborationCommunicationMentoring peersStrategic thinkingExecutive communication
Keywords for Your Resume
Staff Data ScientistPythonPySparkDatabricksGraph databasesNeptuneDBOpenCypherGraphFramesGraph Neural NetworksKnowledge GraphsLearning-to-RankLink predictionSemi-supervised learningProduction-grade ML systemsLLMsLarge Language ModelsAnomaly DetectionEntity resolutionstaff data scientistpythonpysparkdatabricksneptunedbopencyphergraphframesgraph databasesknowledge graphsllms
Deal Breakers
5+ years of graph-based ML experience, Inability to lead cross-functional projects, No Databricks experience
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile