About this role
Direct and govern end-to-end SDLC for high-scale data infrastructure, including GenAI data foundations and multi-cloud pipelines, with a healthcare focus on HL7/FHIR standards.
Key Responsibilities
- Enterprise Data Lifecycle Orchestration: Direct end-to-end SDLC for high-scale data infrastructure
- Multi-Cloud Architecture & Pipeline Governance: Govern design and optimization of data pipelines on Cloud environments (GCP preferred)
- Performance Engineering, QA Validation & Release management
- Operational Velocity & Dependency Mitigation
- Technical Leadership & Ecosystem Evolution
Technical Overview
Hands-on data engineering leadership with GCP expertise, data pipelines, AI/ML infrastructure, vector databases, and CI/CD tooling; experience with BigQuery, DataFlow, DataForm, Airflow, and Python/JavaScript.
Ideal Candidate
The ideal candidate is a data engineering leader with 3+ years of leadership experience, hands-on expertise in Generative AI data foundations, and strong GCP skills (BigQuery, DataFlow, DataForm). They understand HL7/FHIR healthcare data standards and can manage multi-cloud data pipelines.
Must-Have Skills
3 years of leadership or management experience preferred3 years of cumulative leadership/management experience preferredHands-on experience architecting data foundations for Generative AI applicationsExperience with Vector & Graph Databases (e.g.PineconeWeaviate) and LLM orchestrationDeep understanding of HL7FHIR standardsExperience with EHR systemsHealthcare regulatory requirements for data engineeringCost-optimization strategies for cloud data warehousingMonitoring tools to reduce egress costs and compute wasteGCP BigQuery or equivalent cloud database environments
Nice-to-Have Skills
Advanced AI/ML infrastructureVector databasesExperience with DataForm or DataFlowWeaviatePinecone
Tools & Platforms
Google Cloud PlatformBigQueryDataFlowDataFormAirflowJenkinsPineconeWeaviateHL7FHIREHR
Required Skills
3 years of leadership or management experience preferred; Hands-on experience architecting data foundations for Generative AI applications; Vector & Graph Databases (e.g.PineconeWeaviate) and LLM orchestration; HL7FHIR; EHR systems; Healthcare regulatory requirements; Cost optimization for cloud data warehousing; BigQuery or similar; Data pipelines; DataForm; DataFlow; Airflow; Jenkins; Python; JavaScript; Spark; SQL; Stored Procedures; CI/CD
Hard Skills
GCPGoogle Cloud PlatformDataFormDataFlowAirflowJenkinsPythonJavaScriptSparkSQL / PostgreSQLBigQueryPineconeWeaviateHL7FHIREHRCI/CDCloud cost optimization
Soft Skills
LeadershipMentoringCommunicationStrategic thinkingProblem-solvingCollaboration
Keywords for Your Resume
manager data engineeringremotegcpgoogle cloud platformdata pipelinesdataformdataflowairflowjenkinspythonjavascriptsparksqlbigquerypineconeweaviatehl7fhirehrhealthcarecare datagenerative ai
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile