The Role
We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible.
You are a good fit if this describes you:
You geek out about model inference, torch optimizations, and memory management
You've written production PyTorch code that pushes performance boundaries
You love diving deep into how models actually work under the hood
You get excited about making insanely optimized code that just works
You think the current state of ML deployment could be way better
What you'll do:
Build and optimize the core inference engine that powers ComfyUI
Make massive models run faster and use less memory than anyone else
Work directly with our core team on architecting new features
Tackle the hardest technical problems in the visual AI space
Help shape where we take this technology next
Bonus: If you've worked with diffusion/LLM models before or built custom nodes for ComfyUI, that's awesome
ComfyUI San Francisco, California, USA Office
San Francisco, CA, United States
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine


