NVIDIA Logo

NVIDIA

Senior Software Engineer - HPC

Reposted 2 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA, USA
152K-242K Annually
Senior level
In-Office
Santa Clara, CA, USA
152K-242K Annually
Senior level
As a Senior Software Engineer at NVIDIA, you will enhance HPC infrastructure, improve systems reliability, and optimize cloud operations, focusing on distributed systems and automation.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. The ideal candidate is strong in software development, crafting and building reliable distributed systems, and has the ability to implement well thought out long term maintenance strategy. 

What you’ll be doing:

  • Apply modern distributed systems patterns to push the limits of scale, latency, and reliability.

  • Continuously improve infrastructure provisioning and operations with automation, APIs, and self‑service platforms.

  • Operate in a globally distributed, hybrid multi‑cloud environment (AWS, GCP, on‑prem), building systems that are cloud‑native and location‑agnostic.

  • Build strong cross-functional relationships and align with collaborators across various business units.

  • Improve uptime and Quality of Service (QoS) through data-driven operations, strong SLOs, and robust incident practices.

  • Participate in the team’s on‑call rotation and lead high‑impact incident response when needed.

What we need to see:

  • Strong coding skills in at least two of: Go, Java, C/C++, Scala, Python, Elixir, with a focus on backend, systems, or infrastructure engineering.

  • Deep understanding of scalability, consistency, and performance trade‑offs in server‑side systems; ability to build horizontally scalable, resilient, and low‑latency services.

  • Experience owning services end‑to‑end: architecture, build reviews, implementation, testing, rollout, observability, and iterative improvement.

  • Hands‑on experience with at least one major cloud provider (GCP, AWS, or Azure) and cloud‑native primitives (managed storage, messaging, compute).

  • Proficiency with modern CI/CD, GitOps workflows, and Infrastructure as Code practices for safe, repeatable changes.

  • Bias for action, strong problem‑solving skills, and a track record of simplifying complex systems.

  • B.S. in Computer Science or related field (or equivalent experience), with 5+ years of relevant experience.

  • Careful communication and collaboration skills; comfortable guiding technical decisions across teams.

Ways to stand out from the crowd:

  • Prior experience building core infrastructure or control planes for HPC clusters, large-scale AI/ML platforms, or systems managed by job schedulers (e.g., Slurm or Kubernetes).

  • Maintainer or co‑maintainer responsibilities for an open source component used in production (plugins, operators, exporters, controllers, or SDKs) at large scale.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 13, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

AWS
Azure
C/C++
Ci/Cd
Elixir
GCP
Gitops
Go
Java
Python
Scala
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

16 Minutes Ago
Easy Apply
Hybrid
Easy Apply
90K-113K Annually
Mid level
90K-113K Annually
Mid level
Cloud • Edtech • Healthtech • Mobile • Social Impact • Software • Data Privacy
As a Software Engineer II, you will design and implement full-stack features, contribute to technical design, and ensure high-quality code through collaboration with cross-functional teams.
Top Skills: MicroservicesOrmsPythonReactSQL
16 Minutes Ago
In-Office
Senior level
Senior level
Artificial Intelligence • Hardware • Information Technology • Machine Learning
As a Senior Memory Sustaining Design Engineer, you will design and optimize High Bandwidth Memory solutions, evaluate silicon issues, and collaborate with various engineering teams.
Top Skills: CmosFastspiceFinesimHspicePerlPythonTclVerilog
16 Minutes Ago
In-Office
Expert/Leader
Expert/Leader
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Define micro-architecture for digital subsystems, implement RTL designs in SystemVerilog, collaborate with DV teams, and support post-silicon validation.
Top Skills: Chip Design MethodologiesDftDigital DesignLow-Power DesignSoc IntegrationSystemverilog

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account