Position Details

Salary $314K – $359K USD / year

Type Not Specified

Experience senior

Exp. Years Not specified

Education Not specified

Category AI & Machine Learning

About this role

This role focuses on building and deploying production AI capabilities, especially foundation models and large language model systems. You will optimize LLM performance, implement guardrails and evaluation, and help shape the long-term technical vision for foundational AI systems.

Key Responsibilities

Partner with cross-functional teams to deliver AI-powered products
Design, develop, test, deploy, and support AI software components
Leverage AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch
Invent state-of-the-art LLM optimization techniques
Contribute to technical vision and long term roadmap

Technical Overview

You will design and operate AI software components spanning foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The work uses AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, with a focus on LLM optimization for scalability, cost, latency, and throughput.

Ideal Candidate

The ideal candidate is a senior AI engineer focused on production large language model systems, including foundation model training and large language model inference. They bring hands-on experience with guardrails, model evaluation, experimentation, governance, and observability, and they are strong in LLM optimization for scalability, cost, latency, and throughput.

Must-Have Skills

Designdeveloptestdeployand support AI software components including foundation model traininglarge language model inferencesimilarity searchguardrailsmodel evaluationexperimentationgovernanceand observabilityInvent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalabilitycostlatencythroughput - of large scale production AI systemsContribute to the technical vision and the long term roadmap of foundational AI systems at Capital One

Tools & Platforms

AWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorch

Required Skills

foundation model traininglarge language model inferencesimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityLLM optimizationAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorch

Hard Skills

foundation model traininglarge language model inferencesimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityLLM optimization techniquesperformance scalabilityperformance costperformance latencyperformance throughputAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorchresponsible AIscalable AIhigh-performance AI infrastructureproduction AI systemstechnical roadmaptechnical vision

Soft Skills

cross-functional collaborationshare passion to do the right thingstaying abreast of latest researchability to understand scientific publicationsjudiciously apply novel techniques in productionadaptabilitythriving on bringing claritytechnical vision and long term roadmap contributionquality pride

Industry & Role

Industry Banking

Job Function Develop and optimize production foundation and LLM capabilities for responsible, scalable enterprise AI.

Role Subtype LLM Engineer

Tech Domains Amazon Web Services, Python, AI & Machine Learning, Kubernetes, Docker

Keywords for Your Resume

Sr. Distinguished AI EngineerDistinguished AI EngineerAI EngineerLLM optimizationlarge language model inferencefoundation model trainingsimilarity searchguardrailsmodel evaluationexperimentationgovernanceobservabilityAWS UltraclustersHuggingfaceVectorDBsNemo GuardrailsPyTorchresponsible AIscalable AI infrastructureproduction AI systems

Deal Breakers

Must be able to design, develop, test, deploy, and support AI software components including foundation model training and large language model inference, Must have experience with LLM optimization to improve scalability, cost, latency, and throughput

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile