quadric.io Logo

quadric.io

AI Kernel Engineer

Reposted 5 Days Ago
Be an Early Applicant
In-Office
Burlingame, CA
Senior level
In-Office
Burlingame, CA
Senior level
The AI Kernel Engineer will develop and optimize AI kernels for Quadric's platform, enhancing performance across various hardware configurations and providing technical support.
The summary above was generated by AI

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.
Role

The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques.

Our preference is for a candidate located in the California Bay Area who can regularly collaborate from our Burlingame office. This role follows a hybrid schedule with at least two in-office days per week expected, but actual schedule may adjust depending on team and business need. We believe strong technical collaboration, rapid iteration, and shared problem-solving are well supported by working together in person. The team and company also gather periodically for onsite meetings and offsite events to connect, collaborate, and align on priorities.

Responsibilities

  • Develop AI/LLM kernels/operators on Quadric platform for efficient inference
  • Optimize the kernel performance for different hardware configurations and workloads
  • Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro-architecture and software bottlenecks and provide optimization solutions
  • Optimize kernel C/C++ codes, maximize hardware utilization
  • Collaborate across related areas of the AI inference stack to support team and business priorities
  • Make Improvement to Quadric toolchain, compiler and runtime
  • Provide technical support and documents to customers and developer community

Requirements
  • Bachelor’s or Master’s in Computer Science and/or Electric Engineering
  • 5+ years of experience in AI kernel development and optimization
  • experience with model and kernel inference performance profiling
  • experience with at least one of the following compute development: CUDA, DSP, NEON, Triton-lang
  • Proficiency in C/C++ and Python, experience with assembly language a plus
  • Demonstrate good capability in problem solving, debug and communication

Benefits

At Quadric, we value Integrity, Humility, and Happiness. What we expect from one another is simple and clear: Initiative, Collaboration, and Completion. We are a collaborative team focused on building something extraordinary in the edge computing space. 

  • Competitive salary and meaningful equity
  • Medical, dental, and vision plan options starting on day one
  • 401(k) retirement plan
  • Flexible paid time off (unlimited, non-accrual) to support work-life balance
  • When working in-office, enjoy company-provided lunches and a stocked kitchen
  • Convenient office location within walking distance of the Caltrain station
  • Support for commuting, including monthly parking or Caltrain passes
  • Downtown Burlingame office location, close to shops, cafes, and local amenities
  • A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
  • The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence

Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world’s first supercomputer designed for the real-time needs of edge devices. Quadric aims to empower developers in every industry with superpowers to create tomorrow’s technology, today. The company was co-founded by technologists from MIT and Carnegie Mellon, who were previously the technical co-founders of the Bitcoin computing company 21.

Quadric is proud to be an equal opportunity employer. We are committed to creating an inclusive environment where people from all backgrounds can do their best work. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, national origin, age, disability, veteran status, or any other protected characteristic under applicable law.

If this role resonates with you, we encourage you to apply even if your experience does not perfectly match every qualification. We value potential, curiosity, and a willingness to learn just as much as direct experience. Skills and growth come in many forms, and we would love to hear your story.

Top Skills

C++
Cuda
Dsp
Neon
Python
Triton-Lang
HQ

quadric.io Burlingame, California, USA Office

Burlingame, CA, United States, 94010

Similar Jobs

7 Hours Ago
In-Office or Remote
Santa Clara, CA, USA
184K-288K Annually
Senior level
184K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Develop AI systems for efficient inference, design and optimize kernels, and build domain-specific compilers and runtimes while collaborating with engineers.
Top Skills: Apache TvmCuda C/C++CutileFlashinferJaxMlirOnnxPyTorchSglangTensorFlowTritonVllm
7 Hours Ago
In-Office or Remote
Santa Clara, CA, USA
184K-288K Annually
Senior level
184K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves developing AI systems software, implementing and optimizing kernels for AI workloads, and collaborating on deep learning frameworks and libraries.
Top Skills: Apache TvmC++CudaFlashinferJaxMlirOnnxPythonPyTorchSglangTensorFlowVllm
3 Hours Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software • Big Data Analytics • Automation
Lead transformational enablement programs across global sales and customer success, focusing on technical certification, AI innovation, and measurable business impact.
Top Skills: Ai ToolsChatgptGenerative Ai PlatformsHighspotLms TechnologiesSalesforceSeismicSynthesia

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account