NVIDIA Logo

NVIDIA

Senior Research Scientist, Post-Training LLM and DLM

Reposted 7 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
160K-299K Annually
Senior level
In-Office or Remote
2 Locations
160K-299K Annually
Senior level
The role involves designing and implementing post-training algorithms for LLMs and DLMs, improving training pipelines, collaborating with researchers, and demonstrating engineering practices.
The summary above was generated by AI

We are now looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion Language Model (DLM) post-training and system optimization. Are you excited to shape the future of large-scale generative AI? NVIDIA is at the forefront of foundation models and generative AI systems, enabling cutting-edge research and real-world deployment at unprecedented scale. Our team is dedicated to advancing post-training algorithms, building efficient large-scale systems, and developing evaluation frameworks to ensure reliability and scalability. Join us to work with world-class researchers and engineers on building the next generation of AI.

What you will be doing:

  • Designing and implementing post-training algorithms LLMs and DLMs.

  • Driving efficiency and scalability improvements across training pipelines and serving systems

  • Collaborating with researchers to translate cutting-edge ideas into production-ready implementations.

  • Exploring new paradigms for evaluation.

  • Demonstrating strong engineering practices, and contributing to open-source communities.

What we need to see:

  • PhD in Computer Science, Electrical Engineering, or related field, or equivalent research experience in LLMs, systems, or related areas.

  • 2+ years of experiences in machine learning, systems, distributed computing, or large-scale model training.

  • Proficiency in Python with hands-on experience in frameworks such as PyTorch.

  • Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming.

  • Proven ability to collaborate across research and engineering teams in multifaceted environments.

Ways to stand out from the crowd:

  • Expertise in post-training LLMs with novel algorithmic/data pipelines

  • Experience developing andscaling large distributed systems for deep learning.

  • Contributions to open-source LLM systems or large-scale AI infrastructure.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD for Level 3, and 192,000 USD - 304,750 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 16, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

An Hour Ago
Remote
United States
200K-250K Annually
Expert/Leader
200K-250K Annually
Expert/Leader
Artificial Intelligence • Blockchain • Professional Services • Security • Consulting • Cybersecurity • Defense
The Principal Security Engineer leads projects, drives technical vision, mentors engineers, engages in business development, and oversees security software development.
Top Skills: C++GoJavaPythonRust
3 Hours Ago
Remote or Hybrid
70K-84K Annually
Mid level
70K-84K Annually
Mid level
Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
Create, edit, and manage a library of training materials and eLearning assets for tax and licensing software. Collaborate with SMEs to produce guides, courses, videos, SCORM packages, in-app tours, and customer communications, ensuring brand and quality standards across publishing platforms.
Top Skills: Adobe AcrobatAdobe FramemakerAdobe PremiereArticulate Rise 360Articulate StorylineBoxCamtasiaGitHTMLMadcap FlareMarkdownMicrosoft WordNetSuiteScormSharepointSnagitThought Industries LmsXML
3 Hours Ago
Remote
United States
180K-220K Annually
Mid level
180K-220K Annually
Mid level
Software • Defense
Design and productize AI decision-making systems for a game engine, utilizing HTNs and GOAP to create intelligent behaviors for entities.
Top Skills: Goal-Oriented Action Planning (Goap)Hierarchical Task Networks (Htns)State MachinesVisual Scripting Editor

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account