Position Details
About this role
This role involves designing, developing, and deploying large language model applications on AWS, focusing on performance, scalability, and cost-efficiency in a production environment.
Key Responsibilities
- Design and deploy LLM applications
- Optimize inference costs
- Build scalable GenAI systems
- Implement observability and monitoring
- Mentor engineering teams
Technical Overview
The technical environment includes AWS cloud services such as SageMaker, Lambda, ECS/EKS, and monitoring tools like CloudWatch and Datadog. The stack emphasizes AI/ML model optimization, prompt engineering, and scalable infrastructure.
Ideal Candidate
The ideal candidate is a senior AI/ML engineer with extensive hands-on experience designing and deploying large language models on AWS cloud infrastructure. They possess strong expertise in prompt engineering, model optimization, and scalable system architecture, with a proven track record of leading AI projects in production environments.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Lack of AWS cloud services experience, No experience with LLM architectures, No background in AI/ML development, Absence of cloud security knowledge
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile