NVIDIA Logo

NVIDIA

Senior Software Engineer - Agentic Memory

Reposted 18 Days Ago
Be an Early Applicant
In-Office or Remote
4 Locations
224K-357K Annually
Senior level
In-Office or Remote
4 Locations
224K-357K Annually
Senior level
The Senior Software Engineer will design benchmarks, build data pipelines, run experiments, contribute to open-source projects, and collaborate with teams to enhance agentic memory systems at NVIDIA.
The summary above was generated by AI

NVIDIA’s Agentic Memory team is seeking a Senior Software Engineer with experience using, developing and researching agents in a variety of applications. You’ll join a team of researchers with deep experience in building information retrieval systems, who are now working on advancing the state of the art of agentic memory.

At NVIDIA, we’re exploring the frontier of what agents are capable of and constantly pushing to improve them. Our work builds upon the efforts of dozens of teams in the NVIDIA ecosystem and focuses on measuring and improving the performance of agentic memory. Come be a part of our world-class team building the future of agents.

What you’ll be doing:

  • Designing novel benchmark tasks and evaluation methodologies that measure the effectiveness of agentic memory systems including semantic, episodic, and procedural memory across multi-session and multi-turn agent trajectories.

  • Building and maintaining synthetic dataset generation pipelines that produce realistic, enterprise-relevant evaluation data at scale.

  • Designing and running experiments to understand where agent memory falls short, diagnose root causes, and inform improvements.

  • Developing and contributing to open-source evaluation harnesses that enable rigorous, reproducible comparison of memory system architectures.

  • Partnering with teams across NVIDIA who are deploying agents to understand the role of memory in a variety of applications and help integrate improvements.

  • Contributing to public-facing benchmarks and leaderboards that advance the state of the art in agentic memory evaluation.

  • Integrating our work to leverage and improve the full NVIDIA software ecosystem, working across team boundaries in the spirit of extreme codesign.

  • Keeping up to date with the latest developments in agentic memory across academia and industry.

What we need to see:

  • Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 12+ years of experience

  • Hands-on experience developing agentic systems and pipelines, with a preference for those that integrate and involve memory.

  • An understanding of the state of the art in retrieval research, with a focus on agentic retrieval.

  • Knowledge of best practices in batching, streaming, and scaling of large-scale data pipelines to support real-world applications.

  • Excellent Python programming skills and a strong understanding of the Python deep learning ecosystem.

  • An ability to share and communicate your ideas clearly through blog posts, papers, GitHub, etc.

  • Excellent communication and interpersonal skills are required, along with the ability to work in a dynamic, product-oriented, distributed team. A history of mentoring junior engineers and interns is a plus.

  • Candidates with a Master's, Ph.D. or equivalent experience in retrieval or multimodal research are preferred, along with a track record of publication in leading conferences like CVPR, ICLR, ICCV, ECCV, KDD, etc.

Location is flexible, and the team is remotely situated, focusing on NA/EU time zones. We are looking for candidates in any country where NVIDIA has an office, and remote work is accepted.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with most major technology providers and support a broad range of Fortune 500 companies in their machine and deep learning needs. With deep learning, we can teach AI to do almost anything. New internet services, like Google Assistant, have learned speech from sound and provide a more natural way to access information. Self-driving cars use deep learning to recognize the space they inhabit, the lanes they drive in, and the objects to avoid. In healthcare, neural networks trained with millions of medical images can find clues in MRIs that until now could only be found through invasive biopsies.

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We work with some of the most forward-thinking, versatile people in the world, and our engineering teams are growing fast in some of the most impactful fields of our generation: Artificial Intelligence, Data Science, Deep Learning. If you're a creative engineer who enjoys autonomy and shares our passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until April 12, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

5 Hours Ago
Easy Apply
In-Office or Remote
Easy Apply
175K-250K Annually
Senior level
175K-250K Annually
Senior level
Cloud • Information Technology • Security • Software
The role involves managing multiple global engineering teams, setting high engineering standards, defining technical execution plans, and improving engineering workflows with AI-assisted tools.
Top Skills: AWSAzureC++GCPGoJavaPython
5 Hours Ago
Easy Apply
In-Office or Remote
Easy Apply
175K-250K Annually
Senior level
175K-250K Annually
Senior level
Cloud • Information Technology • Security • Software
The Staff Product Manager will drive the remote authentication strategy and roadmap, ensuring a frictionless experience for users through effective product capabilities and customer advocacy.
Top Skills: Oauth 2.0Openid ConnectSaaSSAML
5 Hours Ago
Remote
US
163K-267K Annually
Senior level
163K-267K Annually
Senior level
Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
The Director of Product Marketing will lead strategy and execution for PSA's Sports Grading business, focusing on product marketing strategy, go-to-market planning, customer experience, and category leadership while managing a high-performing team.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account