✦ Luna Orbit — Software Engineering

Sr. Staff Software Engineer - GPU Network Software, RCCL, MPI, SHMEM

at Advanced Micro Devices

📍 Austin, Texas, United States Hybrid Posted March 14, 2026
Type Full-Time
Experience senior
Exp. Years Not specified
Education Not specified
Category Software Engineering

This role involves developing and optimizing GPU communication libraries for high-performance computing systems, focusing on multi-node GPU networking and open-source contributions.

  • Support AMD's RCCL; Design, implement, and test networking features; Benchmark, profile, and optimize code; Collaborate with technical experts; Deploy libraries on large clusters

The technical environment includes C, C++, Python, Linux Kernel, GPU networking protocols like RoCE, Libfabric, InfiniBand, and HPC benchmarking tools, primarily working within AMD's ROCm ecosystem.

The ideal candidate is a senior software engineer with extensive experience in GPU communication libraries, proficient in C, C++, and Python, with a strong background in high-performance computing and open-source development. They should have hands-on experience with GPU networking technologies like RoCE, Libfabric, and InfiniBand, and be capable of developing and optimizing multi-node GPU communication systems.

CC++PythonExperience with GPU communication librariesLinux KernelOpen source contributions
RoCELibfabricInfiniBandGPU Network DesignMPISHMEM
ROCmGitLinux
CC++PythonRoCELibfabricInfiniBandLinux KernelGPU NetworksMPISHMEMBenchmarkingProfilingDebuggingVersion Control
CC++PythonRoCELibfabricInfiniBandLinux KernelGPU NetworksMPISHMEMNetworkingOpen Source Software DevelopmentBenchmarkingProfilingDebuggingVersion Control
CommunicationCollaborationAttention to DetailResults-OrientedProblem-SolvingAdaptabilityTeamwork
Industry Semiconductors & Electronics
Job Function Developing GPU communication libraries for high-performance computing
Role Subtype Networking & Telecom
Tech Domains Linux, Open Source Software Development, Networking / TCP-IP, High Performance Computing
GPU communication librariesCC++PythonRoCELibfabricInfiniBandLinux KernelOpen Source Software DevelopmentBenchmarkingProfilingDebuggingVersion ControlMPISHMEMNetworkingGPU NetworksHigh Performance ComputingExascaleOpen SourceOpen source

Lack of experience with GPU communication libraries, No experience with Linux Kernel or open-source contributions, No background in high-performance computing, Unable to work in a hybrid environment

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile