NVIDIA Logo

NVIDIA

Senior Deep Learning Compiler Engineer - XLA

Posted 7 Hours Ago
Be an Early Applicant
In-Office or Remote
3 Locations
152K-242K Annually
Senior level
In-Office or Remote
3 Locations
152K-242K Annually
Senior level
Develop and implement compiler optimization algorithms for deep learning (XLA/OpenXLA) to accelerate training and inference on NVIDIA GPUs. Design graph partitioning/tensor sharding for distributed workloads, perform performance tuning and code generation using MLIR/LLVM/Triton, collaborate with framework and hardware teams, and contribute to JAX and related libraries.
The summary above was generated by AI

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”.

We are looking for versatile software engineers for our XLA team. NVIDIA is at the center for the AI revolution that's transforming how people live, work, and interact with technology. Come join us to build high-performance, production-grade software that's at the core of next-generation AI systems.

What you will be doing:

In this role, develop compiler optimization algorithms for deep learning workloads. You will optimize inference and training performance for the JAX framework and the OpenXLA compiler on NVIDIA GPUs at scale. You’ll collaborate with our partners in deep learning framework teams and our hardware architecture teams to accelerate the next generation of deep learning software. The scope of these efforts include:

  • Crafting and implementing compiler optimization techniques for deep learning network graphs.

  • Designing novel graph partitioning and tensor sharding techniques for distributed training and inference.

  • Performance tuning and analysis.

  • Code-generation for NVIDIA GPU backends using open-source compilers such as MLIR, LLVM and OpenAI Triton.

  • Designing user facing features in JAX and related libraries and other general software engineering work.

  • Working closely with GPU hardware engineering teams to design AI compiler software features for next-generation GPUs.

What we need to see:

  • Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field (or equivalent experience).

  • 4+ years of relevant work or research experience in performance analysis and compiler optimizations.

  • Ability to work independently, define project goals and scope, and lead your own development effort adopting clean software engineering and testing practices.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong foundation in architecture of CPU, GPUs or other high performance hardware accelerators. Knowledge of high-performance computing and distributed programming.

  • CUDA or OpenCL programming experience is desired but not required.

  • Experience with the following technologies is a huge plus: XLA, TVM, MLIR, LLVM, OpenAI Triton, deep learning models and algorithms, and deep learning framework design.

  • Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team. A history of mentoring junior engineers and interns is a bonus.

Ways to stand out from the crowd:

  • Experience working deep learning frameworks such as JAX, PyTorch or TensorFlow.

  • Extensive experience with CUDA or with GPUs in general.

  • Experience with open-source compilers such as XLA, LLVM, MLIR or TVM.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 1, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Top Skills

C,C++,Cuda,Opencl,Xla,Openxla,Tvm,Mlir,Llvm,Openai Triton,Jax,Pytorch,Tensorflow,Gpu,High-Performance Computing,Distributed Programming
HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

An Hour Ago
In-Office or Remote
Pearland, TX, USA
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead financial and utilization analyses to identify revenue and cost savings for healthcare service lines. Produce reconciliations, reports, dashboards, and models; validate multi-source data; automate reporting; present findings to executives; and coach junior analysts.
Top Skills: Sql,Visual Basic,Excel Macros,Microsoft Excel,Microsoft Powerpoint,Microsoft Word,Sas,R,Minitab,Spss,Crystal Reports,Microsoft Access,Ssrs,Epic,Clarity
An Hour Ago
Remote or Hybrid
2 Locations
109K-184K Annually
Senior level
109K-184K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Senior overlay Account Executive focused on selling SailPoint's Agentic Technology solutions (machine, data, and agent identity security). Drive adoption in enterprise accounts by engaging IT and cybersecurity stakeholders, supporting direct sellers, building enablement programs, managing full sales cycle, and exceeding quota while collaborating with partners and internal teams.
Top Skills: Agent Identity SecurityAgentic TechnologiesCloud Data PlatformsData Access SecurityIaasIdentity IntelligenceIdentity SecurityMachine Identity SecuritySailpointSalesforce
An Hour Ago
Remote or Hybrid
United States
223K-414K Annually
Expert/Leader
223K-414K Annually
Expert/Leader
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The VP of Engineering for the Enterprise Platform will lead engineering teams, define platform architecture, and drive innovation for identity security solutions at SailPoint.
Top Skills: AICloud-Native ArchitectureEvent-Driven SystemsGraph DatabasesGraphQLIdentity/Security PlatformsMicroservicesMl

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account