NVIDIA Logo

NVIDIA

Solutions Architect, Agentic AI

Reposted 4 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Santa Clara, CA, USA
152K-242K Annually
Senior level
In-Office or Remote
Hiring Remotely in Santa Clara, CA, USA
152K-242K Annually
Senior level
As a Solutions Architect at NVIDIA, you'll develop and deploy AI systems, optimize performance, and collaborate with partners to integrate AI solutions into enterprise environments.
The summary above was generated by AI

Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software companies to build and deploy sophisticated AI-native systems, focusing on multi-agent coordination, RAG-integrated workflows, and accelerated inference. By mastering NVIDIA’s core technologies—NIM, NeMo Framework, Dynamo, and Nemo Agent Toolkit—you will guide partners through the complexities of performance optimization and production-grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused enterprise agents.

What you'll be doing:

  • Build complex agentic systems featuring multi-agent coordination, long-horizon reasoning, and advanced planning frameworks.

  • Develop full-scale solutions, including domain-specific enterprise agents and high-performance retrieval pipelines (RAG) spanning various data sources.

  • Optimize inference performance by bringing to bear GPU-accelerated frameworks and the full NVIDIA AI infrastructure stack.

  • Build hands-on PoCs and reference architectures that serve as the blueprint for production-grade generative AI pipelines.

  • Collaborate alongside Enterprise ISVs to integrate NVIDIA software into native platforms, accelerating the deployment of production workloads.

  • Collaborate with diverse internal teams to improve NVIDIA software through feedback from real-world implementations.

  • Empower partner engineering teams through technical workshops, deep-dive architecture reviews, and developer enablement.

  • Scale global expertise by crafting reusable assets and documentation that help field teams deploy agentic AI at scale.

What we need to see:

  • BS/MS/PhD in Computer Science, Electrical Engineering, AI/ML, or equivalent experience.

  • More than 5 years of experience in deep learning, machine learning, or distributed AI systems.

  • Strong programming and debugging experience in Python, C/C++, and Linux environments.

  • Background in using deep learning libraries like PyTorch or TensorFlow.

  • Hands-on experience building LLM and generative AI applications.

  • Experience working with agentic or multi-agent AI systems employing frameworks such as:

1. LangGraph

2. LlamaIndex

3. CrewAI

4. LangChain

5. OpenAI Agents SDK or similar orchestration frameworks

  • Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.

  • Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.

  • Excellent interpersonal skills and the ability to collaborate with engineering teams, partners, and executive collaborators.

Ways to Stand Out from the Crowd:

  • Experience working with NVIDIA GPUs and AI software, such as NVIDIA NIM, NeMo Framework, NeMo Retriever, and NeMo Agent Toolkit.

  • Experience with LLM evaluation frameworks, benchmarking systems, and safety guardrails for agentic workflows.

  • Experience optimizing reasoning-focused LLMs through timely engineering, quantization, or benchmarking.

  • Familiarity with Kubernetes/OpenShift, CI/CD automation, and cloud-native deployment patterns for AI systems.

  • Experience with parallel or distributed computing environments and AI workloads optimized for GPUs.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 17, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

25 Days Ago
Remote
2 Locations
134K-236K Annually
Senior level
134K-236K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Machine Learning • Software
The Principal Agentic AI Solutions Architect will design and implement AI solutions for customers, lead workshops, and ensure compliance with security and privacy regulations while driving AI adoption.
Top Skills: AWSAzureConversational AiGCPGenesys CloudJSONNlpRest Apis
3 Days Ago
In-Office or Remote
Santa Clara, CA, USA
148K-288K Annually
Senior level
148K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Solutions Architect will develop AI applications at scale, focusing on machine learning, deep learning, and generative AI. Responsibilities include integrating enterprise data, providing feedback for software improvements, and collaborating with engineering teams.
Top Skills: C/C++KubernetesLinuxOpenshiftPythonPyTorchTensorFlow
An Hour Ago
Remote or Hybrid
USA
75K-125K Annually
Senior level
75K-125K Annually
Senior level
Machine Learning • Payments • Security • Software • Financial Services
Lead business analysis for Digital Identity projects: gather and document system requirements, define capabilities, create system flows, manage backlogs, roadmap and releases, mentor junior analysts, coordinate stakeholders, and drive process improvement within Agile frameworks.
Top Skills: ConfluenceDynatraceJIRAKanbanMS OfficePostmanSafeScrumServicenowSoapui

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account