About this role
NVIDIA seeks a Distinguished Engineer to own the end-to-end architecture of data center system software products, including firmware, kernel drivers, and user-mode drivers. You will guide collaborations with hyperscalers, align roadmaps, and drive adoption of new technologies and protocols.
Key Responsibilities
- Serve as the primary technical point of contact for major customers
- Lead technical innovation with hyperscalers to architect next-generation data center products
- Align NVIDIA roadmap with customer requirements through direct engagement
- Develop and drive adoption of new technologies and protocols
- Make critical technical decisions in ambiguous situations, mitigating risks with left-shift strategies
Technical Overview
Senior system software architect role focusing on firmware, kernel, and user-mode drivers for accelerators, with emphasis on Linux, OpenBMC, Redfish/IPMI, MCTP/PLDM/SPDM, and high-performance networking (InfiniBand). Requires experience with CUDA/cuDNN/DOCA and HPC stacks; strong cross-functional leadership and risk mitigation.
Ideal Candidate
The ideal candidate is a senior system software architect with 20+ years in system architecture for server software, firmware, and hardware interconnects. They should excel at cross-functional leadership, be comfortable with industry standards (OCP/DMTF), and have deep experience with Linux, firmware, and high-performance networking.
Must-Have Skills
BS or MS degree in Computer ScienceElectrical Engineering or related field (or equivalent experience).20+ years in the area of System architecture and design.Deep expertise in scalable and performant server system architecturefocusing on SW/HW interfaces.Extensive experience with complex system software for accelerators (GPUsDPUsFPGAs).Mastery of system firmware (SBIOSOpenBMC)embedded systemsand Linux kernel internals.Proficiency in Out-of-Band and In-Band management architecturesdevice management protocols (e.g.MCTPPLDMSPDMRDE) and system management protocols (RedfishIPMI).Extensive knowledge of networking technologies and protocolsincluding TCP/IPEthernetInfiniBandand advanced switching/routing concepts.Experience collaborating with platform security experts to balance security and usability.Demonstrated success leading cross-functional projects without direct authority.
Nice-to-Have Skills
Cloud and cluster-level deployment and management experienceParticipation in standards bodies such as OCP and DMTFKnowledge of NVIDIA HPC programming models and libraries (CUDAcuDNNDOCA)Knowledge of enterprise storage architectures and distributed parallel processing
Tools & Platforms
CUDAcuDNNDOCA
Required Skills
BS or MS in Computer ScienceElectrical Engineering or related field (or equivalent experience)20+ years in system architecture and designdeep expertise in scalable server system architectureSW/HW interfacesextensive experience with complex system software for accelerators (GPUsDPUsFPGAs)mastery of system firmware (SBIOSOpenBMC)embedded systemsLinux kernel internalsproficiency in Out-of-Band and In-Band management architectures (MCTPPLDMSPDMRDE)system management protocols (RedfishIPMI)networking (TCP/IPEthernetInfiniBand)security collaborationleadership in cross-functional projectsleft-shift risk strategies
Hard Skills
LinuxNetworking / TCP-IPInfiniBandSystem firmwareSBIOSOpenBMCEmbedded systemsLinux kernel internalsOut-of-Band managementIn-Band managementMCTPPLDMSPDMRDERedfishIPMITCP/IPEthernet
Soft Skills
LeadershipStrategic thinkingCross-functional collaborationDecision makingMentoring
Keywords for Your Resume
Distinguished EngineerData Center System Software ArchitectNVIDIAsystem softwareLinux kernel internalsOpenBMCSBIOSfirmwareGPUDPUFPGAMCTPPLDMSPDMRDERedfishIPMITCP/IPEthernetInfiniBandOCPDMTFCUDAcuDNNDOCAhyperscalerHPCcloud deploymentlinux kernelembedded systemsNetworking / TCP-IP
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile