NIO Logo

NIO

LLM Algorithmic Optimization Engineer

Reposted 11 Days Ago
Be an Early Applicant
In-Office
San Jose, CA
143K-186K Annually
Mid level
In-Office
San Jose, CA
143K-186K Annually
Mid level
Optimize Large Language Models and multimodal models for efficient inference and deployment on heterogeneous hardware. Collaborate on integration into automotive applications.
The summary above was generated by AI

JOB DESCRIPTION

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO’s product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO’s product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

Roles and Responsibilities:

  • Conduct research and apply cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and implementation of the core algorithmic optimization on heterogeneous architectures, for highly efficient LLM inference as well as deployment across distributed and heterogeneous hardware environments.
  • Focus on model optimization from a systems perspective, ensuring efficient deployment in the vehicle’s digital cockpit and advanced driving (AD) domain.
  • Collaborate with cross-functional teams to ensure the integration of optimized models into real-world automotive applications.
  • Contribute to the entire pipeline from research, development, and testing, through to deployment on hardware, including GPUs and other distributed systems.
Qualifications:
  • Currently pursuing or completed a PhD or Master’s degree in Computer Science, Computer Engineering, Applied Mathematics, Communications, Electronics, or a related field with relevant research projects and publications.
  • Strong understanding of GPU/NPU architecture and optimization techniques to identify and address bottlenecks.
  • Proficient in LLM and VLM architectures and algorithms, familiar with transformer based NLP / Audio / CV algorithms and technologies.
  • Proficiency in Python and experience with AI-related training and inference tools such as PyTorch.
  • Proficiency in C/C++ programming, familiar with at least one commonly used LLM inference engines.
  • Hands-on experience with model-serving frameworks such as Open Neural Network Exchange (ONNX).
  • Familiarity with debugging code in distributed computing environments.Experience in LLM inference optimization on resource constrained edge devices is a plus.
Preferred Qualification:
  • Ph.D. in computer science, artificial intelligence, or related fields; or Masters degree + 3 years of relevant industry experience
  • Experience in inference optimization techniques of deep learning models or libraries on hardware architectures;
  • Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application framework
  • Those who have good publication records and have published high impact, innovative papers are preferred

Compensation:

The US base salary range for this full-time position is $143,200.00 - $186,000.00.
  • Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

  • Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits.

Benefits:

Along with competitive pay, as a full-time NIO employee, you are eligible for the following benefits on the first day you join NIO:

  • CIGNA EPO, HSA, and Kaiser HMO medical plans with $0 for Employee Only Coverage.  

  • Dental (including orthodontic coverage) and vision plan.  Both provide options with a $0 paycheck contribution covering you and your eligible dependents.

  • Company Paid HSA (Health Savings Account) Contribution when enrolled in the High Deductible CIGNA medical plan

  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)

  • 401(k) with Brokerage Link option

  • Company paid Basic Life, AD&D, short-term and long-term disability insurance

  • Employee Assistance Program

  • Sick and Vacation time

  • 13 Paid Holidays a year

  • Paid Parental Leave for first 8 weeks at full pay (eligible after 90 days of employment with NIO)

  • Paid Disability Leave for first 6 weeks at full pay (eligible after 90 days of employment with NIO)

  • Voluntary benefits including: Voluntary Life and AD&D options for you, your spouse/domestic partner and dependent child(ren), pet insurance

  • Commuter benefits

  • Mobile Cell Phone Credit

  • Healthjoy mobile benefit app supporting you and your dependents with benefit questions on the go & support with benefit billing questions

  • Free lunch and snacks

  • Onsite gym

  • Employee discounts and perks program

Top Skills

C/C++
Gpu
Npu
Onnx
Python
PyTorch

NIO San Jose, California, USA Office

3200 North 1st Street, San Jose, CA, United States, 95134

Similar Jobs

11 Days Ago
In-Office
San Jose, CA, USA
38-46 Hourly
Internship
38-46 Hourly
Internship
Automotive
Optimize Large Language Models for vehicle applications, focusing on algorithmic efficiency and deployment on distributed systems.
Top Skills: C/C++GpuLlmNpuOnnxPythonPyTorchVlm
27 Minutes Ago
In-Office
Costa Mesa, CA, USA
220K-292K Annually
Senior level
220K-292K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Research Engineer in Machine Learning at Anduril, you will optimize ML algorithms for edge devices, prototype LLM-based systems, and benchmark models while collaborating across business lines to identify new research problems.
Top Skills: Deep Learning ModelsMl AlgorithmsPythonPyTorchTransformer Architectures
28 Minutes Ago
In-Office
4 Locations
165K-242K Annually
Senior level
165K-242K Annually
Senior level
Cloud • Information Technology • Machine Learning
The IT SOX Director leads the company's IT SOX compliance program, focusing on IT General Controls and application controls, ensuring compliance and collaborating with various teams.
Top Skills: CoupaGitIt General Controls (Itgcs)NetSuiteSalesforceWorkday

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account