Skip to content

Senior Director, System Software Engineering – DGX Cloud

US, CA, Santa Clara
Full Time On-site

Summary

Job Description

NVIDIA is seeking a Senior Director, System Software Engineering, to lead strategy and execution for capacity management in DGX Cloud, building the capacity foundation for NVIDIA's internal AI research clusters. This leader will shape the roadmap for scalable system software that automates GPU management at scale, drive execution across teams and functions, and partner closely with architecture, security, product, and developer platform leaders to deliver reliable, high-performance software that powers the next generation of accelerated computing. The ideal candidate combines deep systems expertise with strong organizational leadership, technical judgment, and builds teams that deliver sophisticated platform software at scale.

What you'll be doing:

  • Define and drive the system software strategy for capacity management and automation for DGX Cloud's GPU cloud platforms, aligning long-range technical direction with business and product priorities.

  • Lead engineering leaders responsible for core platform capabilities such as runtime software, host and cluster management, provisioning, observability, reliability, security, and performance optimization.

  • Build a strong execution model across planning, architecture reviews, release readiness, quality, and operational excellence for software delivered across on-prem and cloud environments.

  • Partner closely with security, DevOps, research, and product organizations to translate platform requirements into scalable software roadmaps and high-quality releases.

  • Establish measurable goals for engineering efficiency, service reliability, software quality, and customer impact, using data to continuously improve delivery and operations.

  • Attract, develop, and retain world-class engineering leaders while fostering technical excellence, accountability, inclusion, and innovation.

What we need to see:

  • BS, MS, or PhD in Computer Science, Computer Engineering, or a related technical field, or equivalent experience.

  • 16+ overall years of relevant management experience in system software, platform software, or distributed systems engineering, 7+ years of significant leadership experience leading engineering organizations.

  • Deep technical expertise in operating systems, distributed systems, platform architecture, cloud infrastructure, or large-scale systems software.

  • Demonstrated experience leading delivery of complex software platforms spanning reliability, performance, scalability, security, and observability.

  • Strong record of leadership and influence across engineering, product, program management, and executives.

  • Demonstrated success building and leading high-performing teams, developing leaders, and scaling organizations through growth and change.

  • Excellent technical communication and decision-making, with the ability to connect architecture choices to business outcomes.

  • Demonstrated experience with industry-leading AI tools that help engineers and engineering leaders work more efficiently.

Ways to stand out from the crowd:

  • Experience with AI infrastructure, accelerated computing, GPU-optimized software stacks, or large-scale training and inference environments.

  • Experience leading platform software for cloud-native or hybrid-cloud deployments.

  • Track record of driving architectural simplification and operational excellence across large, complex engineering portfolios.

  • Experience partnering with open-source communities and ecosystem partners on platform adoption and enablement.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/ 

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 384,000 USD - 575,000 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 30, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

About Nvidia

Nvidia

NVIDIA is one of the most influential technology companies in the world, powering the modern era of artificial intelligence, high-performance computing, graphics, and autonomous systems. Originally known for its leadership in gaming GPUs, NVIDIA has evolved into the backbone of AI infrastructure, designing the chips, software, and systems that train and deploy large-scale AI models used across industries from healthcare and robotics to autonomous vehicles and scientific computing.

For job seekers, NVIDIA offers opportunities at the forefront of deep tech, spanning software engineering, AI research, systems engineering, hardware design, networking, robotics, and developer tools. A major focus of its work is the CUDA software platform and AI ecosystem, which enables developers to program GPUs at massive scale and has become foundational to modern machine learning and data center computing. This makes NVIDIA especially attractive to engineers, researchers, and technologists who want to work directly on the infrastructure powering today’s AI revolution.

Unlike traditional hardware companies, NVIDIA operates as a full-stack computing platform company, integrating silicon, systems, and software into a unified ecosystem. Employees may work on everything from GPU architecture and data center systems to AI frameworks, simulation platforms like Omniverse, and autonomous vehicle technology through the DRIVE platform. This breadth allows teams to operate at the intersection of research and production-scale deployment, with direct impact on global computing infrastructure.

As demand for AI, accelerated computing, and autonomous systems continues to grow rapidly, NVIDIA remains one of the most important employers in technology and advanced engineering. For professionals seeking a high-impact career at the center of AI development—where breakthroughs quickly translate into real-world systems at global scale—NVIDIA stands out as one of the most dynamic and sought-after destinations in the industry.

Go to company profile