NVIDIA Logo

NVIDIA

Senior AI Infrastructure Software Engineer

Reposted 2 Days Ago
In-Office
5 Locations
184K-357K Annually
Senior level
In-Office
5 Locations
184K-357K Annually
Senior level
Design and build scalable AI infrastructure, improve architecture and performance, collaborate with teams, and contribute to advancements in AI technologies.
The summary above was generated by AI

You will collaborate closely with researchers to design and scale agents - enabling them to reason, plan, call tools and code just like human engineers. You will work on building and maintaining the core infrastructure for deploying and running these agents in production, powering all our agentic tools and applications and ensuring their seamless and efficient performance. If you're passionate about the latest research and cutting-edge technologies shaping generative AI, this role and team offer an exciting opportunity to be at the forefront of innovation. 

What you'll be doing:  

  • Design, develop, and improve scalable infrastructure to support the next generation of AI applications, including copilots and agentic tools. 

  • Drive improvements in architecture, performance, and reliability, enabling teams to bring to bear LLMs and advanced agent frameworks at scale. 

  • Collaborate across hardware, software, and research teams, mentoring and supporting peers while encouraging best engineering practices and a culture of technical excellence. 

  • Stay informed of the latest advancements in AI infrastructure and contribute to continuous innovation across the organization. 

What we need to see: 

  • Master or PhD or equivalent experience in Computer Science or related field, with a minimum of 5 years in large-scale distributed systems or AI infrastructure. 

  • Advanced expertise in Python (required), strong experience with JavaScript, and deep knowledge of software engineering principles, OOP/functional programming, and writing high-performance, maintainable code. 

  • Demonstrated expertise in crafting scalable microservices, web apps, SQL, and NoSQL databases (especially MongoDB and Redis) in production with containers, Kubernetes, and CI/CD.  

  • Solid experience with distributed messaging systems (e.g., Kafka), and integrating event-driven or decoupled architectures into robust enterprise solutions. 

  • Practical experience integrating and fine-tuning LLMs or agent frameworks (e.g., LangChain, LangGraph, AutoGen, OpenAI Functions, RAG, vector databases, timely engineering). 

  • Demonstrated end-to-end ownership of engineering solutions, from architecture and development to deployment, integration, and ongoing operations/support. 

  • Excellent communication skills and a collaborative, proactive approach. 

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 17, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Autogen
Ci/Cd
JavaScript
Kafka
Kubernetes
Langchain
Langgraph
MongoDB
NoSQL
Openai Functions
Python
Rag
Redis
SQL
Vector Databases
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

An Hour Ago
In-Office or Remote
Austin, TX, USA
60K-72K Annually
Entry level
60K-72K Annually
Entry level
Fintech • Real Estate • PropTech
As a Sales Development Representative, you'll focus on outbound prospecting in Spanish and qualify inbound leads in English, generating opportunities for sales teams.
Top Skills: Sales Tools
An Hour Ago
Hybrid
Austin, TX, USA
Senior level
Senior level
Fintech • Software • Financial Services
The Senior Product Manager will define strategy and roadmap for client configuration tools, collaborate with teams for product development, and drive market insights and recommendations.
Top Skills: Bi ToolsCloud-Based SolutionsCustomer Configuration ToolsExcelPythonRSaaSSQL
An Hour Ago
Hybrid
Austin, TX, USA
Senior level
Senior level
Fintech • Software • Financial Services
Apex Fintech Solutions seeks a Senior Software Engineer to develop and maintain scalable backend systems for their multi-asset trading platform, requiring strong backend development skills and a willingness to learn C++/C#.
Top Skills: C#C++Ci/CdGitGoJavaLinuxPythonSocket ProgrammingSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account