About this role
This role leads Data Engineering delivery and governance for healthcare analytics and operational data products across multi-cloud environments. The manager will own end-to-end SDLC, testing and release management, CI/CD workflows, and Generative AI data foundation architecture using Vector and Graph databases.
Key Responsibilities
- Direct end-to-end SDLC for high-scale data infrastructure and program delivery commitments
- Govern design and optimization of distributed data pipelines (GCP preferred) using DataForm or DataFlow
- Architect and validate functional, integration, and performance testing; manage release management and CAB processes
- Optimize operational velocity via forensic audits of estimations and story points and mitigation of technical dependencies
- Provide technical leadership and evolve the data engineering ecosystem using JavaScript, Python, Spark, CI/CD (Airflow, Jenkins), and AI/ML infrastructure for LLM orchestration
Technical Overview
You will govern distributed data pipelines using DataForm/DataFlow and deliver robust functional, integration, and performance testing to protect production stability. The technical scope includes Generative AI-ready data foundations with Pinecone and Weaviate, LLM orchestration, and healthcare data standards including HL7 and FHIR for EHR systems, alongside CI/CD automation using Airflow and Jenkins.
Ideal Candidate
The ideal candidate is a data engineering leader with strong experience owning end-to-end SDLC for high-scale data infrastructure and governing distributed data pipelines in multi-cloud environments, preferably Google Cloud Platform. They also have hands-on experience with Generative AI data foundations, including Vector and Graph databases (Pinecone, Weaviate) and LLM orchestration, plus healthcare domain knowledge (HL7, FHIR, EHR systems).
Must-Have Skills
Enterprise Data Lifecycle OrchestrationAdvanced AI/ML InfrastructureHands-on experience architecting data foundations for Generative AI applicationsspecifically involving Vector & Graph Databases (e.g.PineconeWeaviate) and LLM orchestrationHealthcare Domain Mastery: Deep understanding of clinical data standards (HL7FHIR) experience with EHR systemsFinOps & Resource Optimization: Proven tra
Nice-to-Have Skills
3 years of leadership or management experience preferred
Tools & Platforms
JiraDataFormDataFlowGoogle Cloud PlatformGCPAirflowJenkinsPineconeWeaviateHL7FHIREHR systemsStored ProceduresSQL scripts
Required Skills
enterprise data lifecycle orchestrationend-to-end SDLCmulti-cloud architecturepipeline governanceDataFormDataFlowtesting frameworksfunctional testingintegration testingperformance testingrelease managementChange Approval Board (CAB)CI/CD workflowsAirflowJenkinsJavaScriptPythonSparkSQL scriptsStored ProceduresGenerative AIvector databasesgraph databasesPineconeWeaviateLLM orchestrationHL7FHIREHR systemsFinOps
Hard Skills
end-to-end SDLCenterprise data lifecycle orchestrationdata infrastructuremulti-cloud architecturepipeline governancedistributed data pipelinesDataFormDataFlowautomationcode qualitysecurity compliancetesting frameworksfunctional testingintegration testingperformance testingrelease managementChange Approval Board (CAB) processCI/CD workflowsAirflowJenkinsdata ingestion conceptsdata foundations for Generative AI applicationsVector databasesGraph databasesPineconeWeaviateLLM orchestrationJavaScriptPythonSparkSQL scriptsStored Proceduresforensic auditsestimationsstory pointsJiraHealthCare Analytical and Operational data productsHL7FHIREHR systemsGenerative AI applicationsVector & Graph DatabasesLLM orchestration
Soft Skills
technical leadershipmentoring technical leadsguiding and leading teamknowledge transferprogram-level deliverycross-functional dependency mitigationoversight of iteration goalscommunicationculture buildingworkgroup throughput optimizationstakeholder alignment
Keywords for Your Resume
ManagerData EngineeringData Engineering managerend-to-end SDLCenterprise data lifecycle orchestrationmulti-cloud architecturepipeline governancedistributed data pipelinesGCP preferredGoogle Cloud PlatformDataFormDataFlowautomationcode qualitysecurity compliancefunctional testingintegration testingperformance testingrelease managementChange Approval Board (CAB)CI/CD workflowsAirflowJenkinsJavaScriptPythonSparkSQL scriptsStored ProceduresVector databasesGraph databasesPineconeWeaviateLLM orchestrationHL7FHIREHR systemsGenerative AI applicationsJiraFinOps
Deal Breakers
Hands-on experience architecting data foundations for Generative AI applications involving Vector & Graph Databases (Pinecone, Weaviate) and LLM orchestration, Deep understanding of clinical data standards (HL7, FHIR) and experience with EHR systems
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile