Persimmons, Inc. Logo

Persimmons, Inc.

Compiler Engineer (Mid and/or Backend)

Posted 7 Days Ago
Be an Early Applicant
In-Office
San Jose, CA
Senior level
In-Office
San Jose, CA
Senior level
The role involves optimizing mid- and backend compiler techniques for custom accelerator hardware, transforming MLIR-based languages, and implementing efficient compilation workflows for AI systems.
The summary above was generated by AI

Who we are:

Persimmons is building the infrastructure that will power the next decade of AI. Founded in 2023 by veteran technologists from the worlds of semiconductors, AI systems, and software innovation, We’re on a mission to enable smarter devices, more sustainable data centers, and entirely new applications the world hasn’t imagined yet.

Why join us:

We’re growing fast and looking for bold thinkers, builders, and curious problem-solvers who want to push the limits of AI hardware and software. If you're ready to join a world-class team and play a critical role in making a global impact - we want to talk to you.

Summary of Role:

This role focuses on transforming higher-level MLIR-based large language models by applying sophisticated mid- and backend compiler techniques to target Persimmons.ai's custom accelerator hardware. You will help design and optimize the Persimmons Compiler mid- and backend, integrate it with custom operations and kernels, as well as implement compiler passes that convert higher-level intermediate representations into runtime-oriented code and libraries. This position offers the opportunity to directly shape Persimmons.ai’s innovative AI hardware and software stack through close collaboration with teams across hardware, systems, and software.

What you’ll do:

  • Develop and enhance MLIR-based compiler pipelines targeting Persimmons' custom spatial accelerator hardware.
  • Design and optimize the Persimmons Compiler mid- and backend techniques for efficient lowering, graph-to-resources mapping, and code generation.
  • Implement transformations to convert Python, PyTorch, and similar kernel representations to LLVM IR and runtime-ready libraries.
  • Architect and implement efficient support for SPMD-based, distributed collective operations and lower them through specialized MLIR compiler dialects (e.g., MESH, SHARDY).
  • Drive advanced loop optimizations leveraging polyhedral analysis: loop tiling, fusion, interchange, skewing, and related techniques.
  • Apply and optimize techniques such as bufferization, padding, inlining, and integration of custom operations and kernels within the compilation workflow.
  • Work on register allocation and instruction scheduling for Persimmons’ spatial hardware, ensuring high resource utilization, throughput, and low latency.
  • Contribute to graph and tensor partitioning logic for optimal hardware-targeted execution.
  • Collaborate across teams to deliver performant compilation flows from high-level ML representations to low-level executable artifacts.

Requirements

What You Bring To The Table:

  • We do not expect candidates to meet all of the requirements listed below; strong candidates will demonstrate expertise in several key areas.
  • Solid understanding and experience with underlying principles and methods of the MLIR framework (SSA representation, interfaces, rewriting, dialect hierarchy, etc.).
  • Hands-on experience with developing MLIR-based compiler infrastructure, algorithms, and techniques for non-GPU/custom spatial hardware architectures.
  • Working experience with lowering SIMD operations from PyTorch, Triton, xDSL, pyDSL, or similar Python-based frontends toward LLVM IR and, further, to SIMD kernel library.
  • Extensive experience and understanding of loop optimization based on polyhedral principles.
  • Experience and understanding of SPMD-based, distributed collective operations, specialized MLIR compiler dialects (e.g., MESH, SHARDY), and collective operation lowering in compilers for spatial hardware.
  • Experience with techniques such as padding, bufferization, inlining, and other lowering techniques.
  • Knowledge of register allocation and instruction scheduling in spatial architectures.
  • Experience in lowering and integration of custom operations and kernels at the compiler mid- and backend.
  • Familiarity with graph and tensor partitioning and mapping optimization algorithms and their integration in the compiler workflow.
  • High level of understanding and 5+ years of experience with C++ and appreciation for writing clean and maintainable code. Good knowledge of Python is a big plus.

Benefits
  • Competitive salary and benefits package.
  • Flexible PTO
  • 401k

Please note: Our organization does not accept unsolicited candidate submissions from external recruiters or agencies. Any such submissions, regardless of form (including but not limited to email, direct messaging, or social media), shall be deemed voluntary and shall not create any express or implied obligation on the part of the organization to pay any fees, commissions, or other compensation. Direct contact of employees, officers, or board members regarding employment opportunities is strictly prohibited and will not receive a response.

Top Skills

C++
Llvm
Mlir
Python
PyTorch
Triton
Xdsl
HQ

Persimmons, Inc. San Jose, California, USA Office

San Jose, California, United States, 95054

Similar Jobs

38 Minutes Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
225K-275K Annually
Senior level
225K-275K Annually
Senior level
Fintech • HR Tech
Lead and innovate in defining Gusto's frontend experience, focusing on design systems, component development, and accessibility while leveraging AI for improvements.
Top Skills: AICSSHTMLJavaScriptReactSassTypescript
43 Minutes Ago
In-Office
Santa Ana, CA, USA
72K-118K Annually
Junior
72K-118K Annually
Junior
Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
The Data Analyst will analyze data to optimize internal processes, create reports, and support strategic insights for the PSA Grading and Brand Protection teams.
Top Skills: LightdashLookerModeSQLTableau
2 Hours Ago
In-Office
San Francisco, CA, USA
130K-145K Annually
Mid level
130K-145K Annually
Mid level
Software
The Growth Marketing Manager will oversee paid media strategies, optimize website conversions, manage HubSpot workflows, and develop audience targeting strategies for effective inbound marketing.
Top Skills: Google AdsHubspotLinkedin Campaign ManagerWebflow

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account