NVIDIA Jobs

Principal Architect, AI Networking

NVIDIA

Principal Architect, AI Networking

Reposted Yesterday

Be an Early Applicant

In-Office or Remote

4 Locations

272K-431K Annually

Expert/Leader

In-Office or Remote

4 Locations

272K-431K Annually

Expert/Leader

Lead research and architectural direction for AI systems communication at scale, conducting original research, optimizing hardware-software performance, and mentoring engineers.

The summary above was generated by AI

An applied research team within NVIDIA’s Networking Systems & Software Architecture group is solving some of AI’s hardest infrastructure problems. The team builds systems-level software that moves data between GPUs, nodes, and storage at the speed modern AI demands—spanning low-level transport optimization, hardware-software co-design, and communication frameworks that plug directly into production AI stacks. The team's charter expands into emerging domains including quantum computing interconnects.

This Principal Architect role leads the research agenda and architectural direction for how NVIDIA’s AI systems communicate at scale—across GPUs, DPUs, NICs, and heterogeneous storage. It requires someone who defines project scope from scratch, publishes original work, and translates research breakthroughs into production-grade software that ships industry-wide!

What you will be doing:

Setting the long-term technical vision for distributed AI communication systems—GPU-to-GPU, GPU-to-storage, and cross-node data movement.
Conducting original research and prototyping next-generation networking solutions over RDMA, NVLink, and GPUDirect.
Driving hardware-software co-optimization with GPU, DPU, NIC, and network switch. Investigating fundamental bottlenecks in communication runtimes for large-scale AI workloads (KV cache transfer, disaggregated prefill/decode, model parallelism).
Integrating networking capabilities into AI serving stacks such as vLLM, SGLang, and TensorRT-LLM.
Publishing findings, representing NVIDIA in industry forums and standards bodies, and mentoring senior engineers across the organization.

What we need to see:

15+ years in systems software and/or networking with deep expertise in high-performance networking (InfiniBand, RoCE, RDMA, NVLink), communication libraries (e.g. NIXL, NCCL, UCX, MPI, NVSHMEM), and GPU accelerated systems, with track record of defining and delivering complex, cross-team technical initiatives from research concept to production.
MS, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
Deep understanding of computer architecture, memory hierarchies, DMA engines, and OS-level networking.
Understanding of ML systems concepts—transformer architectures, KV cache mechanics, model parallelism, or distributed training and inference patterns.
Proficiency in programming languages such as C, C++, Rust and Python.

Ways to stand out from the crowd:

Knowledge of ML inference frameworks (vLLM, SGLang, TensorRT-LLM) and their communication requirements.
CUDA programming and NVIDIA GPU architecture expertise.
Proved experience influencing product strategy and technical roadmap at a senior level.
Major open-source contributions.

With competitive salaries and a comprehensive benefits package, NVIDIA is widely regarded as one of the most desirable technology employers in the world. Our teams are composed of some of the most forward‑thinking and driven engineers in the industry, and we continue to grow rapidly. If you are a senior data engineer passionate about building large‑scale, high‑impact data platforms, we’d love to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until April 27, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

Deepgram

Technical Recruiter

An Hour Ago

Remote

USA

120K-180K Annually

Senior level

120K-180K Annually

Senior level

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI

Own full-cycle global recruiting for senior, hard-to-fill technical roles across Research, Engineering, and Product. Build proactive sourcing strategies, define role scope and interview plans with hiring managers, evaluate technical depth for AI-native products, deliver excellent candidate experience across time zones, use funnel metrics to improve hiring velocity, and collaborate with People Ops, Legal, and Finance to scale compliant international hiring.

Top Skills: Ai/Ml SystemsAPIsLarge Language Models (Llms)Real-Time InferenceSpeech-To-Text (Stt)Text-To-Speech (Tts)

Inspiren

Platform Engineer

An Hour Ago

Easy Apply

In-Office or Remote

United States

Easy Apply

180K-200K Annually

Senior level

180K-200K Annually

Senior level

Artificial Intelligence • Hardware • Healthtech • Software

The Senior Data Platform Engineer will manage and develop the data infrastructure on Databricks and AWS, ensuring scalable and efficient data capabilities while collaborating across teams.

Top Skills: AWSDatabricksKafkaKinesis

Vannevar

InfoSec Engineer - Compliance (ATO)

An Hour Ago

Remote

USA

Senior level

Artificial Intelligence • Machine Learning • Software • Defense

Lead the ATO process for classified environments, ensuring compliance with RMF and security standards while interfacing with government stakeholders.

Top Skills: AtoAws GovcloudAzure GovernmentDisa StigsEmassGoogle GovernmentKubernetesNist 800-53OpenshiftRmfXacta

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

NVIDIA

Principal Architect, AI Networking

NVIDIA Santa Clara, California, USA Office

Similar Jobs

Technical Recruiter

Platform Engineer

InfoSec Engineer - Compliance (ATO)

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech