Position Details

Type Full-Time

Experience mid

Exp. Years Not specified

Education Not specified

Category AI & Machine Learning

About this role

This role involves developing and optimizing large-scale ML systems, focusing on large language models and inference performance within AWS infrastructure.

Key Responsibilities

Develop ML systems end-to-end
Optimize inference performance
Build reliable ML infrastructure
Collaborate with scientists and engineers
Manage GPU and accelerator hardware

Technical Overview

The environment includes AWS cloud infrastructure, GPU hardware, custom accelerators, and distributed ML systems, emphasizing inference optimization and scalable deployment.

Ideal Candidate

The ideal candidate is a mid-level machine learning engineer with experience in large language models, inference optimization, and distributed systems. They are proficient with GPU hardware and cloud environments, especially AWS.

Must-Have Skills

Machine LearningMLLarge Language ModelsInference OptimizationDistributed Systems

Nice-to-Have Skills

GPUAccelerator HardwareCloud ComputingDeep Learning

Tools & Platforms

AWSAmazon Web ServicesGPU hardwareCustom Accelerators

Required Skills

Machine LearningMLLarge Language ModelsLLMsInference OptimizationDistributed SystemsGPUAccelerator HardwareCloud ComputingPython

Hard Skills

Machine LearningMLLarge Language ModelsLLMsInference OptimizationDistributed SystemsGPUAccelerator HardwareCloud ComputingPythonDeep Learning

Soft Skills

CollaborationInnovationProblem-solvingCommunicationTeamwork

Industry & Role

Industry Technology / Cloud AI

Job Function Design and optimize large-scale ML infrastructure for AI applications

Role Subtype ML Engineer

Tech Domains Machine Learning, Distributed Systems, GPU, Cloud Computing

Keywords for Your Resume

Machine LearningMLLarge Language ModelsLLMsInference OptimizationDistributed SystemsGPUAccelerator HardwareCloud ComputingPythonDeep LearningAIAI SystemsML InfrastructureScalable MLInference Systemsmachine learningmllarge language modelsllmsinference optimizationdistributed systemsgpuaccelerator hardwarecloud computingpython

Deal Breakers

No experience with large language models, Lack of inference optimization skills, No background in distributed systems

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Machine Learning Engineer, GenAI, Amazon Connect

Get matched to jobs like this