NVIDIA Logo

NVIDIA

Senior DL Algorithms Engineer - Inference Performance

Reposted 20 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
184K-357K Annually
Senior level
In-Office or Remote
2 Locations
184K-357K Annually
Senior level
Implement language and multimodal model inference in NVIDIA Inference Microservices, contribute production code to TRT-LLM, profile and optimize inference performance across the full stack, benchmark SOTA model inference, and collaborate with software/hardware co-design teams to push inference performance.
The summary above was generated by AI

We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve peak performance, we want to hear from you! This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution.
 

What you will be doing:

  • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).

  • Contribute new features, fix bugs and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library.

  • Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.

  • Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis for NVIDIA SW/HW stack.

  • Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services.

What we want to see:

  • PhD in CS, EE or CSEE or equivalent experience.

  • 5+ years of experience.

  • Strong background in deep learning and neural networks, in particular inference.

  • Experience with performance profiling, analysis and optimization, especially for GPU-based applications.

  • Proficient in C++, PyTorch or equivalent frameworks.

  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture.

Ways to stand out from the crowd:

  • Proven experience with processor and system-level performance optimization.

  • Deep understanding of modern LLM architectures.

  • Strong fundamentals in algorithms.

  • GPU programming experience (CUDA or OpenCL) is a plus

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until February 22, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

13 Days Ago
In-Office or Remote
2 Locations
152K-288K Annually
Senior level
152K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Senior DL Algorithms Engineer will optimize deep learning models for performance, analyze bottlenecks, and contribute to open-source frameworks while collaborating across teams for AI model development.
Top Skills: Computer ArchitectureCudaDeep LearningFlashinferGpu ArchitectureOpenclPyTorchSglangTrt-LlmVllm
An Hour Ago
Easy Apply
Remote or Hybrid
San Jose, CA, USA
Easy Apply
193K-275K Annually
Senior level
193K-275K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
This role involves leading AI partnerships, developing go-to-market strategies, engaging with partners, and executing collaborative initiatives in a high-growth environment.
Top Skills: AICybersecurityMlZero-Trust Architecture
An Hour Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
95K-140K Annually
Senior level
95K-140K Annually
Senior level
Enterprise Web • Hardware • Internet of Things • Software
Manage enterprise sales for medical devices, including building relationships, navigating complex sales cycles, and closing deals while collaborating with stakeholders like Sales Engineering and Customer Success.
Top Skills: Manufacturing Execution System (Mes)Medical Device Sales

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account