NVIDIA Logo

NVIDIA

Senior Deep Learning Engineer

Reposted 16 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA, USA
152K-288K Annually
Senior level
In-Office
Santa Clara, CA, USA
152K-288K Annually
Senior level
The Senior Deep Learning Engineer will develop optimizations for AI inference workloads, collaborate across teams, and stay updated on AI advancements.
The summary above was generated by AI

We are now looking for a Senior Deep Learning Engineer! At NVIDIA, we are at the forefront of advancing the capabilities of artificial intelligence. We are seeking an ambitious and forward-thinking senior deep learning engineer to contribute to the development of next-generation inference optimizations targeting frontier workloads including multi-agent AI systems, generative multimodal models, and inference-time compute scaling. In this role, you will characterize these emerging workloads and develop novel methods to optimize for them across inferencing engines, systems, and hardware architectures. Your work will span multiple tiers of the inference stack from the algorithmic to system level.

As NVIDIA makes significant strides in AI datacenters, our team holds a central role in maximizing the efficiency of our exponentially growing inference deployment needs and establishing a data-driven approach to algorithmic improvements, hardware design and system software development. We collaborate extensively with diverse teams at NVIDIA, spanning deep learning research and framework development teams, to silicon architecture. Thriving in such a high-impact, interdisciplinary environment necessitates not only technical proficiency but also a growth mindset and a pragmatic attitude — qualities that fuel our collective success in shaping the future of datacenter technology.

What You'll Be Doing:

  • Continuously keeping up to date on the latest advancements in generative AI research.

  • Analyzing and prototyping emerging workloads in multi-agent AI systems, generative multimodal models, and inference-time compute scaling.

  • Pioneering and developing optimizations for these workloads across the inference stack to push the boundaries of inferencing quality and speed on NVIDIA systems. 

  • Collaborating closely with production teams to incorporate the latest advancements into cutting-edge software frameworks.

What We Need to See:

  • Master's degree (or equivalent experience) in Computer Science, Artificial Intelligence, Applied Mathematics, or related fields.

  • A strong foundation in deep learning, with a particular emphasis on generative models and inferencing.

  • A track record of at least 5 years of relevant software development experience in modern deep learning frameworks such as PyTorch.

  • Growth mindset and pragmatic attitude.

Ways to Stand Out From the Crowd:

  • Published research or noteworthy contributions to the field of deep learning, particularly in areas such as inference-time compute, multimodal generation, AI systems, etc. 

  • Experience with prototyping or deployment of agentic AI systems and/or multimodal generation models.

  • Experience with collaborating across algorithms, software and performance teams to deliver high quality solutions.

  • Familiarity with computer architecture and how it relates to AI algorithms development.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 31, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

2 Days Ago
In-Office or Remote
4 Locations
152K-288K Annually
Senior level
152K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Develop and productize deep learning models for autonomous vehicles: design and improve DNN architectures, train and fine-tune models, apply low-precision quantization (FP16/INT8), optimize inference for NVIDIA hardware, and collaborate with automotive partners and internal architecture teams to deploy performant, power-efficient perception systems.
Top Skills: CC++CnnCudaDlaFp16GitInt8JaxNvidia GpusNvidia-DockerOnnxPythonPyTorchQuantizationTensorFlowTensorrtTransformer
11 Days Ago
In-Office
Santa Clara, CA, USA
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
As a Senior Deep Learning Systems Engineer, you will optimize datacenters for AI applications by analyzing performance metrics and developing software tools.
Top Skills: C++CudaDockerLinuxPythonPyTorchSlurmTensorFlow
17 Days Ago
In-Office or Remote
4 Locations
152K-242K Annually
Senior level
152K-242K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves analyzing deep learning networks, optimizing compiler algorithms, and collaborating across teams to enhance deep learning software performance and usability.
Top Skills: C/C++CudaLlvmMlirOpenclPythonPyTorchTvmXla

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account