✦ Luna Orbit — AI & Machine Learning

Distinguished AI Engineer

at Capital One Financial

📍 3 Locations Unknown 💰 $269K – $307K USD / year Posted April 15, 2026
Salary $269K – $307K USD / year
Type Full-Time
Experience executive
Exp. Years Not specified
Education Not specified
Category AI & Machine Learning

Distinguished AI Engineer role focused on architecting and launching production conversational experiences powered by state-of-the-art Generative AI. You will design, develop, deploy, and support foundational AI components and drive performance optimization for large-scale systems.

  • Architect and launch conversational experiences powered by Generative AI
  • Design, develop, test, deploy, and support AI software components (foundation model training, LLM inference, similarity search, guardrails)
  • Leverage open source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch
  • Invent LLM optimization techniques to improve scalability, cost, latency, and throughput
  • Contribute to the technical vision and long-term roadmap for foundational AI systems

Build AI software components covering foundation model training and large language model inference, plus similarity search, guardrails, evaluation, experimentation, governance, and observability. Work with open source and SaaS AI technologies including AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and apply LLM optimization techniques for scalability, cost, latency, and throughput.

The ideal candidate is a senior AI engineer focused on Generative AI who can architect conversational experiences for large-scale customer delivery. They have deep hands-on experience with foundation model training, large language model inference, guardrails, evaluation, and LLM optimization, and are proficient with open source AI tooling such as Huggingface and PyTorch as well as AWS Ultraclusters and VectorDBs.

Ability to architect and launch conversational experiences powered by state of art Generative AI capabilitiesDesigndeveloptestdeployand support AI software components including foundation model training and large language model inferenceLeverage open source and SaaS AI technologies such as AWS UltraclustersHuggingfaceVectorDBsNemo Guardrailsand PyTorch
Invent and introduce state-of-the-art LLM optimization techniques for scalabilitycostlatencythroughputContribute to technical vision and long term roadmap for foundational AI systems
AWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorch
conversational experiencesGenerative AIfoundation model traininglarge language model inferencesimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorchLLM optimizationscalabilitycostlatencythroughput
architect and launch conversational experiencesGenerative AIlarge language model inferencefoundation model trainingsimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityOpen Source AI technologiesAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorchLLM optimization techniquesscalabilitycost optimizationlatency optimizationthroughput optimization
cross-functional collaborationpartnership with engineers and research scientiststechnical vision contributioncommunicationquality focusethical responsibility (responsible AI)taking pride in workinvent and introduce new techniques
Industry Banking
Job Function Architect and engineer production-grade LLM and Generative AI systems for customer-facing conversational experiences.
Role Subtype LLM Engineer
Tech Domains Python, Azure, Amazon Web Services, Kubernetes, Linux, AI & Machine Learning
Distinguished AI EngineerAI EngineerGenerative AIconversational experienceslarge language model inferencefoundation model trainingsimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorchLLM optimization techniquesscalabilitycostlatencythroughputresponsible and reliable AI systemsmachine learningIntelligent Foundations and Experiences (IFX)

Demonstrated ability to design and deploy AI systems including foundation model training and large language model inference, Hands-on experience with Generative AI tooling such as Huggingface and PyTorch, Experience implementing guardrails, model evaluation, and observability

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile