✦ Luna Orbit — AI & Machine Learning

Principal Software Development Engineer, AI Open-Source Software

at Advanced Micro Devices

📍 Austin, Texas, United States Hybrid Posted March 14, 2026
Type Not Specified
Experience senior
Exp. Years Post college experience
Education Post college experience
Category AI & Machine Learning

This role involves working closely with strategic partners to develop and optimize AI solutions on AMD GPUs, requiring deep technical expertise in GPU programming and AI frameworks.

  • Collaborate with customers to deploy AI solutions
  • Develop GPU kernel code
  • Optimize AI model performance
  • Support AI inference and training frameworks
  • Provide technical expertise on AMD hardware

Environment includes C/C++, Python, CUDA, HIP, OpenCL, AI/ML frameworks like PyTorch and TensorFlow, with a focus on GPU kernel development, performance optimization, and distributed training.

The ideal candidate is a senior software engineer with expertise in GPU kernel programming, AI/ML frameworks, and distributed training, with strong collaboration and customer engagement skills, based in or willing to work in Austin, Texas.

Strong programming in C/C++ and PythonExperience with GPU kernel programmingExperience with AI/ML frameworksCustomer engagement experienceExperience with distributed training
Experience with HIP or OpenCLPerformance analysis and optimizationContainerization and orchestration
CUDAHIPOpenCLPyTorchTensorFlowJAXSingularity
CC++PythonGPU kernel programmingCUDAHIPOpenCLAI frameworksPyTorchTensorFlowJAXdistributed traininginference frameworkscontainerizationSingularity
CC++PythonCUDAHIPOpenCLGPU kernel programmingAI frameworksPyTorchTensorFlowJAXdistributed traininginference frameworkscontainerizationSingularity
CollaborationProblem-solvingTechnical communicationAutonomyCustomer engagement
Industry Technology
Job Function Develop and optimize AI software solutions on AMD GPUs
Role Subtype AI & Machine Learning
Tech Domains C, C++, Python, CUDA, HIP, OpenCL, AI frameworks, PyTorch, TensorFlow, JAX
CC++PythonGPU kernel programmingCUDAHIPOpenCLAI frameworksPyTorchTensorFlowJAXdistributed traininginference frameworkscontainerizationSingularity

Lack of experience with GPU programming, No experience with AI frameworks, No customer engagement experience

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile