NVIDIA Jobs

Senior Systems Software Engineer, Windows and Linux Enablement - DGX Station

NVIDIA

Senior Systems Software Engineer, Windows and Linux Enablement - DGX Station

Reposted 4 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in Santa Clara, CA

224K-357K Annually

Senior level

In-Office or Remote

Hiring Remotely in Santa Clara, CA

224K-357K Annually

Senior level

Own full-stack OS enablement for DGX Station including Windows platform ownership, Linux bring-up, firmware and driver enablement, and application validation across both systems.

The summary above was generated by AI

DGX Station is NVIDIA’s next-generation personal AI supercomputer—a deskside workstation built on the NVIDIA Grace Blackwell GB300 Superchip with massive coherent CPU+GPU memory, designed to bring data-center-class AI capabilities directly to the desks of researchers, developers, and AI engineers. As NVIDIA brings DGX Station to a broad set of customers, we need an engineer who can own full-stack OS enablement—from firmware and drivers through OS integration to ensuring AI applications run seamlessly on day one, with a primary focus on Windows and strong coverage of Linux.

This is a hands-on, technically deep role where you will be the go-to engineer for making DGX Station a first-class Windows platform while also driving its Linux bring-up and validation. You will work across NVIDIA’s GPU driver, CUDA, firmware, BMC, and AI software teams, collaborate closely with Microsoft and ODM/OEM partners, and ensure that developers and enterprise customers have a polished, production-ready experience on DGX Station across both operating systems.

What you’ll be doing:

Windows Platform Ownership (primary): Own end-to-end Windows enablement for DGX Station—driving the platform from initial bring-up on Windows through WHQL certification to customer-ready shipping quality. You are the single point of accountability for “DGX Station works on Windows.”
Linux Bring-up & Enablement: Drive Linux bring-up and continuous enablement for DGX Station on DGX OS / Ubuntu, including kernel module integration, device tree and ACPI configuration, systemd services, initramfs, and dkms packaging. Partner with the DGX OS and kernel teams to land platform support upstream and in NVIDIA’s distribution.
Firmware & Driver Enablement: Enable and validate BIOS/UEFI, BMC, and system-level firmware for Windows and Linux on the Grace (Arm) + Blackwell GB300 architecture. Work with firmware teams to ensure ACPI tables, SMBIOS, Secure Boot, measured boot, power management, and hardware abstraction layers are correct on both OSes.
GPU Driver Integration: Coordinate GPU driver, display driver, and compute driver bring-up and validation on Windows (WDDM, MCDM) and Linux (open-gpu-kernel-modules, DRM/KMS). Work with the NVIDIA driver team and Microsoft to resolve compatibility issues, achieve WHQL certification, and ensure driver stability across Windows Update and Linux kernel revisions.
CUDA & AI Stack Readiness: Ensure the CUDA toolkit, cuDNN, TensorRT, NCCL, and NVIDIA’s AI SDK stack are fully functional on DGX Station on both Windows and Linux. Validate AI/DL workload performance—training, fine-tuning, and inference—and work with the CUDA team to resolve gaps on the Arm + GB300 platform.
Application Validation: Validate that NVIDIA AI applications—NIM microservices, NemoClaw, AI Workbench, and developer tools—run correctly on DGX Station across Windows and Linux. Define and implement test plans covering single-user and multi-user scenarios, container runtimes, application installation flows, and developer workflows.
System Validation & Quality: Drive the overall test strategy for DGX Station on Windows and Linux: functional testing, stress testing, power/thermal validation, sleep/resume and S-state cycles, Windows Update and Linux kernel-upgrade compatibility, and long-duration reliability. Own bug triage and resolution across firmware, BMC, driver, and OS layers.
Partner Engagement: Be the primary technical interface with Microsoft (Windows on Arm, WHQL, driver signing) and ODM/OEM partners shipping DGX Station. Coordinate schedules, resolve cross-company technical blockers, and represent NVIDIA’s platform requirements on both OSes.
Performance Optimization: Profile and optimize system performance—boot time, GPU compute throughput, NVLink-C2C and memory bandwidth utilization, power efficiency, and thermal behavior. Identify bottlenecks across the stack on Windows and Linux and drive fixes with the appropriate teams.
Documentation & Enablement: Create and maintain platform documentation for DGX Station on Windows and Linux: bring-up guides, known issues, driver compatibility matrices, recovery and re-imaging procedures, and developer setup instructions. Enable field and support teams for customer deployments.

What we need to see:

BS or MS in Computer Science, Electrical Engineering, or related field (or equivalent experience) and 12+ yrs of confirmed experience in systems software engineering with deep expertise in Windows platform enablement, driver development, or OS integration, and proven hands-on experience bringing up Linux on new hardware platforms.
Strong hands-on experience with Windows internals: kernel-mode drivers, ACPI, power management, Secure Boot, UEFI, WDM/WDF driver frameworks, and the WHQL certification process.
Solid understanding of Linux platform enablement: kernel modules, device tree / ACPI on Arm, systemd, initramfs, dkms, and packaging for Ubuntu / DGX OS.
Experience with GPU driver stack, display drivers, or compute drivers on Windows and/or Linux. Familiarity with DirectX, WDDM, DRM/KMS, and GPU compute APIs is a strong plus.
Experience enabling hardware platforms—bring-up, driver integration, validation, and certification for shipping products on Windows and Linux.
Strong debugging and root-cause analysis skills across firmware, driver, and OS boundaries. Comfortable with WinDbg, kernel debugging (kd, kgdb/crash), crash dump analysis, ftrace/ETW, and performance profiling tools.
Ability to work across organizational boundaries—coordinating with GPU driver, CUDA, firmware, BMC, and AI software teams as well as external partners (Microsoft, ODM/OEMs).
Proficiency in C/C++ and Python. Experience with Arm architecture is a plus.

Ways to stand out from the crowd:

Experience with Windows on Arm platforms—driver enablement, performance optimization, or application compatibility on Arm-based Windows devices.
Hands-on experience with CUDA, TensorRT, or AI/ML frameworks on Windows and Linux—especially on Arm + NVIDIA GPU systems.
Prior experience working with OEM/ODM partners or silicon vendors on Windows and Linux platform certification for workstation- or server-class hardware.
Track record shipping workstation or server hardware products—from bring-up through general availability—with both Windows and Linux support.
Experience with BMC, Redfish, out-of-band management, or platform manageability software on high-end workstations or servers.Experience with GPU-accelerated applications: AI training and inference, content creation tools, or scientific computing on Windows and Linux.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you’re creative and autonomous, we want to hear from you!

We also welcome out-of-the-box problem solvers who can provide new ideas with a strong execution bias. Expect to be constantly challenged, improving, and evolving for the better. For two decades, we have pioneered visual computing, the art and science of computer graphics. Since the creation of the GPU, the engine of modern visual computing, the field has grown. It now involves video games, movie production, product composition, medical diagnosis, and scientific research. Today, we stand at the beginning of the next era, the AI computing era, ignited by a new computing model, GPU deep learning.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 2, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Liberty Mutual Insurance

Product Owner

31 Minutes Ago

Remote or Hybrid

United States

106K-225K Annually

Senior level

106K-225K Annually

Senior level

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics

Lead the Personal Lines Policy product capability: define strategy and roadmaps, own and prioritize backlog across multiple squads, partner with Claims/Servicing, drive platform modernization and delivery, and ensure regulatory and stakeholder alignment.

Top Skills: APIsPlatform Data CapabilitiesPolicy Platform

Toast

Senior Software Engineer

46 Minutes Ago

Remote

159K-254K Annually

Senior level

159K-254K Annually

Senior level

Cloud • Fintech • Food • Information Technology • Software • Hospitality

Build and maintain core libraries and developer tooling used across the company. Lead complex projects, design and develop backend APIs and AI-assisted developer features, manage platform components (service templates, LaunchDarkly), troubleshoot production issues, conduct code reviews, influence architecture, and mentor engineers to improve developer productivity.

Top Skills: AIAWSBackstageCi/CdDockerEcsGitGoGradleGraphQLGrpcJavaKotlinKubernetesLaunchdarklyOpenapiPostgresReactTypescript

Toast

Consultant

46 Minutes Ago

Remote

65K-80K Annually

Mid level

65K-80K Annually

Mid level

Cloud • Fintech • Food • Information Technology • Software • Hospitality

Manage Spanish-language menu onboarding for restaurant customers: build and configure menus, create go-live plans and trainings, consult on operations, manage multiple onboarding engagements, and meet activation goals.

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus