NVIDIA Logo

NVIDIA

Senior Computer Vision, VLM Performance Engineer

Reposted 13 Hours Ago
Be an Early Applicant
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
Develop and optimize AI computer vision pipelines focusing on Vision Language Models. Collaborate on tools and improve model efficiency.
The summary above was generated by AI

NVIDIA is a world-leader in artificial intelligence and computer vision. Our team builds hardware-accelerated computer vision pipelines, cloud services and SDKs bringing the latest AI innovations to data centers, gaming rigs, cars, robots, buildings, medical devices, and more. We are looking for an engineering expert to help us productize and optimize the latest Vision Language Models (VLMs) and their pipelines. Together, we will democratize the use of these amazing models, unlocking all sorts of innovative applications the world is barely dreaming of.

What you'll be doing:

  • Develop, profile and optimize inference pipelines for VLMs and other AI CV models: improve throughput and latency, data loading, pre- and post-processing.

  • Improve the efficiency of VLM models themselves: kernel optimization in CUDA

  • Upstream improvements to SDKs and libraries across NVIDIA and beyond to deliver accelerated computer vision at scale.

  • Promote high-performance AI computer vision across NVIDIA teams and functions (Engineering, Product Management, Marketing, and more).

What we need to see:

  • Master's of Science in Computer Science or Electrical engineering or equivalent experience.

  • 8 years practical experience or equivalent

  • Expertise in AI computer vision (VLMs, Vision Transformers, Diffusion models). Proven track record using its software ecosystem (PyTorch, HuggingFace, vLLM) to develop and release production-grade software.

  • Excellent software engineering fundamentals (source control, CI/CD, testing/validation, packaging, containerization, release).

  • Proficiency with Python, C++ and CUDA (kernel optimization)

  • Experience developing cloud applications (REST APIs, gRPC).

  • Excellent written, visual, and verbal communication to present performance challenges, tradeoffs, and architectural alternatives.

  • Curiosity and drive to learn new technologies and partner across teams and functions.

Ways to Stand Out from the Crowd:

  • Expertise in classical, non-ML computer vision

  • Strong fundamentals with system-level performance: multi-threaded, multi-process and distributed software development.

  • Grounding in mathematical fundamentals such as linear algebra, numerical methods, statistics, and exploratory data analysis.

  • History of creativity and innovation around performance in multiple problem domains.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 28, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Cuda
Huggingface
Python
PyTorch
Vllm
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

13 Minutes Ago
Remote or Hybrid
2 Locations
205K-257K Annually
Senior level
205K-257K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The role involves leading technology projects, optimizing distributed systems, collaborating on cloud-based solutions, and mentoring others while leveraging various technologies to enhance services.
Top Skills: AWSCassandraDockerGoKafkaNode.jsOpensearchPostgresPython
13 Minutes Ago
Remote or Hybrid
3 Locations
99K-136K Annually
Junior
99K-136K Annually
Junior
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Business Analyst on the AI/ML team at Velocity Black, you will analyze data, support model governance, and collaborate with cross-functional teams to enhance product offerings through AI and ML tools.
Top Skills: AIMlSQL
14 Minutes Ago
Remote or Hybrid
2 Locations
144K-165K Annually
Senior level
144K-165K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Collaborate on Agile teams to develop cloud-based solutions using various programming languages and technologies, mentoring others and staying current on tech trends.
Top Skills: AWSAzureDockerGCPGoJavaKubernetesNode.jsNoSQLOpen Source RdbmsPythonScalaSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account