Position Details
About this role
This role involves developing and optimizing generative AI models, focusing on inference performance and architecture improvements using frameworks like OpenVINO and deep learning libraries.
Key Responsibilities
- Develop generative AI models
- Optimize inference performance
- Implement model pruning and quantization
- Collaborate on architecture design
- Enhance deep learning workflows
Technical Overview
The technical environment includes Python, C++, PyTorch, TensorFlow, Hugging Face Transformers, OpenVINO, ONNX Runtime, with emphasis on model optimization, multithreading, and inference acceleration.
Ideal Candidate
The ideal candidate is a senior AI engineer with extensive experience in deep learning architectures, generative AI models, and proficiency with frameworks like PyTorch, TensorFlow, and OpenVINO, with strong skills in model optimization and multithreading.
Must-Have Skills
Nice-to-Have Skills
Tools & Platforms
Required Skills
Hard Skills
Soft Skills
Industry & Role
Keywords for Your Resume
Deal Breakers
Less than 3 years of experience with deep learning frameworks, No experience with OpenVINO or model optimization, Lack of experience in C/C++, Python
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile