NVIDIA Logo

NVIDIA

Senior Deep Learning Compiler Engineer

Reposted 6 Days Ago
Be an Early Applicant
In-Office or Remote
3 Locations
152K-288K Annually
Senior level
In-Office or Remote
3 Locations
152K-288K Annually
Senior level
The role involves analyzing deep learning networks, developing optimization algorithms, and collaborating to enhance deep learning software performance. The engineer will define APIs and create compiler infrastructure for neural networks.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world

We are looking for a Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler (DLC) team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. large language models, generative AIs, recommendation systems, image classification, speech recognition, etc. Our DLC has been the backbone of NVIDIA inference engine, spanning across data centers, personal devices, automotive, and robotics. The compiler must deliver leading inference performance, fast build time, reduced memory footprints, and ease of use in the forms of both Ahead-of-Tine and Just-in-Time. Join the team building the DLC which will be used by the entire deep learning community.

What you'll be doing:

  • Analyzing deep learning networks and developing compiler optimization algorithms.

  • Collaborating with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software.

  • Scope of these efforts includes defining public APIs, performance optimizations and analysis, crafting and implementing compiler infrastructure techniques for neural networks, and other general software engineering work.

What we need to see:

  • Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field or equivalent experience

  • 3+ years of relevant work or research experience in performance analysis and compiler optimizations.

  • Ability to work independently, define project goals and scope, and lead your own development efforts.

  • Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design.

  • Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.

Ways to stand out from the crowd:

  • Proficient in CPU and/or GPU architecture. CUDA or OpenCL programming experience.

  • Experiences in systems with constrained resources, such as embedded platforms, small memory size, and cross compilation.

  • Experience with the following technologies: MLIR, XLA, TVM, LLVM, deep learning models and algorithms, and deep learning frameworks, such as PyTorch.

  • GPU kernel generation with high performance and fast build time.

  • A track record of success in mentoring junior engineers and interns is a bonus.

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and hardworking people in the world working with us and our product lines are growing fast in some of the hottest state of the art fields such as Virtual Reality, Artificial Intelligence, Deep Learning and Autonomous Vehicles.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 218,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 18, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Top Skills

C/C++
Cuda
Llvm
Mlir
Opencl
Python
PyTorch
Tvm
Xla
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

6 Hours Ago
In-Office or Remote
3 Locations
152K-242K Annually
Senior level
152K-242K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Develop and implement compiler optimization algorithms for deep learning (XLA/OpenXLA) to accelerate training and inference on NVIDIA GPUs. Design graph partitioning/tensor sharding for distributed workloads, perform performance tuning and code generation using MLIR/LLVM/Triton, collaborate with framework and hardware teams, and contribute to JAX and related libraries.
Top Skills: C,C++,Cuda,Opencl,Xla,Openxla,Tvm,Mlir,Llvm,Openai Triton,Jax,Pytorch,Tensorflow,Gpu,High-Performance Computing,Distributed Programming
37 Minutes Ago
Remote or Hybrid
United States
160K-210K Annually
Senior level
160K-210K Annually
Senior level
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills: Kubernetes,Aws,Python,Go,Datadog,Github Actions,Argocd,Gitops,Ci/Cd,Ai/Llm
40 Minutes Ago
Remote
2 Locations
81K-122K Annually
Mid level
81K-122K Annually
Mid level
Artificial Intelligence • Productivity • Software • Automation
Support procure-to-pay operations by managing vendor master data and onboarding, performing OFAC/TIN matching and 1099/1042 determinations, coding and reviewing AP invoices, handling virtual card transactions, organizing Zendesk tickets, aiding purchase requests, maintaining process documentation, and performing month-end reconciliations and automations.
Top Skills: Zip,Erp,Zendesk,Slack

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account