Position Details
About this role
Senior Distinguished AI Engineer role focused on building and deploying AI software components across the Intelligent Foundations and Experiences (IFX) team. The position emphasizes production LLM capabilities, including guardrails, evaluation, experimentation, governance, and observability, plus LLM optimization for scalability, cost, latency, and throughput.
Key Responsibilities
- Develop and deploy AI software components (foundation model training, LLM inference)
- Implement similarity search, guardrails, evaluation, experimentation, governance, and observability
- Leverage AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch
- Invent LLM optimization techniques to improve scalability, cost, latency, and throughput
- Partner with cross-functional teams to deliver AI-powered products
Technical Overview
Designs, develops, tests, deploys, and supports AI components such as foundation model training and large language model inference, along with similarity search and guardrails. Works with Open Source and SaaS AI technologies including AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and applies state-of-the-art LLM optimization techniques for large-scale production systems.
Ideal Candidate
The ideal candidate is a senior AI engineer focused on production LLM systems with deep experience in foundation model training and large language model inference. They have hands-on ability to implement guardrails, model evaluation, experimentation, governance, and observability, and they optimize large-scale AI for scalability, cost, latency, and throughput. They are proficient with AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch and can partner effectively with cross-functional teams to deliver AI-powered products.
Must-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Hands-on experience with foundation model training and large language model inference, Ability to implement guardrails and conduct model evaluation and experimentation, Experience leveraging AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, Experience inventing LLM optimization techniques for production systems
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile