NVIDIA Logo

NVIDIA

Infrastructure Systems Software Engineer

Reposted 4 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA, USA
152K-288K Annually
Senior level
In-Office
Santa Clara, CA, USA
152K-288K Annually
Senior level
Design and build distributed systems for NVIDIA's infrastructure, focusing on performance, reliability, and developer productivity to enhance workflows in chip design.
The summary above was generated by AI

As a Software Engineer in NVIDIA’s Internal Infrastructure Group, you’ll design and build distributed systems that power the workflows behind our next generation of GPUs and AI chips. The software you create will help thousands of engineers develop world-changing technology faster, more efficiently, and at scale. You’ll help scale the infrastructure that validates the world’s most advanced GPUs.

What you'll be doing:

  • Build and extend scalable, high-performance infrastructure services, platforms, and tools that improve reliability and developer productivity across NVIDIA’s chip-design ecosystem.

  • Design and optimize distributed workflows that orchestrate millions of regression and validation workloads across heterogeneous compute clusters.

  • Own systems end-to-end, from gathering requirements and proposing technical designs to implementation, performance analysis, testing, and deployment.

  • Collaborate with internal teams to understand workflows, identify bottlenecks, and deliver automation that accelerates engineering workflows.

  • Analyze and tune system performance across distributed services using profiling, tracing, and telemetry to help bring next-generation hardware and AI models to market faster.

What we need to see:

  • BS or MS in Computer Science or a related field (or equivalent experience).

  • 5+ years of professional software development experience.

  • Strong understanding of data structures, algorithms, concurrency, and system design.

  • Proficiency in modern programming languages (Python, C++, Go, or similar) on Linux systems, with experience building large-scale services, infrastructure tooling, or distributed systems.

  • Ability to reason about trade-offs between performance, reliability, and maintainability.

Ways to stand out from the crowd:

  • Experience developing or scaling distributed systems and internal developer tools.

  • Passion for improving engineering workflows and enabling others to move faster.

  • Hands-on familiarity with profiling, tracing, or performance-optimization techniques.

  • Understanding of chip-design, verification, or modern ML workflows.

NVIDIA is widely recognized as one of the most desirable employers in technology. We attract some of the world’s most forward-thinking and dedicated engineers. If you’re driven by curiosity, care deeply about performance and reliability, and love building systems that empower other developers, we want to meet you.
#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 15, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Go
Linux
Python
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

Yesterday
In-Office or Remote
224K-431K Annually
Senior level
224K-431K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
This role involves designing, building, and managing large-scale infrastructure automation and distributed systems, ensuring reliability and efficiency in cloud services.
Top Skills: ContainersDockerGoKubernetesLinuxNetworkingOpenstackPerlPythonRubySlurmStorage
2 Days Ago
Hybrid
140K-172K Annually
Mid level
140K-172K Annually
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
The Software Engineer: Resiliency develops and maintains systems for managing Cloudflare's infrastructure at scale, ensuring reliability and service level capacity through innovative solutions.
Top Skills: Cloudflare WorkersDurable ObjectsGoGrafanaKubernetesPrometheusPythonR2SentryTypescriptWorkers Kv
Senior level
Fintech • Financial Services
Lead the Risk Analytics team to develop valuation frameworks for credit card origination strategies, utilizing advanced analytics and financial modeling. Responsibilities include managing teams, influencing stakeholders, and optimizing marketing budgets for Direct Mail acquisitions.
Top Skills: PythonSAS

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account