HyperFi Logo

HyperFi

Senior AI Engineer

Posted 7 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA, USA
Senior level
In-Office
San Francisco, CA, USA
Senior level
The Senior AI Engineer will lead the AI program, focusing on model selection, fine-tuning strategies, and GPU architecture. Responsibilities include building LLM pipelines, designing prompt strategies, and collaborating cross-functionally.
The summary above was generated by AI
About HyperFi


We're building the kind of platform we always wanted to use: fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That’s where you come in.


We’re a well-networked founding team with strong execution roots and a clear roadmap. We’re backed, focused, and delivering fast.


We're looking for a Senior AI Engineer to own our AI program end-to-end. Not a prompt engineer. Not a data engineer. The person who owns how our models get selected, trained, tuned, routed, and evaluated — and who walks in with the confidence to define the architecture from the hardware up. You'll work directly with the CTO, lead our fine-tuning strategy (LoRA is going to be core for us), and decide how we get the most out of our GPU spend. This is a senior IC role in a flat org — no management required, but you'll be the technical anchor other engineers learn from.

💥 What You’ll Do
  • Own our fine-tuning strategy end-to-end — LoRA first, full fine-tunes where they earn it. What we tune, on what hardware, against what evals
  • Define the GPU architecture. We have working infrastructure (H100 for production, A100 for training); your call to confirm, reshape, or rebuild
  • Drive model selection and routing across Gemini, Anthropic, and OpenAI — the right model for the right job, with cost and latency in the equation
  • Build agentic LLM pipelines using LangChain, LangGraph, and LangSmith
  • Design and iterate on prompt strategies, with a focus on consistency and context
  • Construct retrieval-augmented generation (RAG) systems from scratch
  • Instrument evaluation metrics, telemetry, and feedback loops to guide model and prompt evolution
  • Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows
🧰 Tech Stack
  • Python (primary language for all ML, LLM, and orchestration work)
  • LangChain + LangGraph + LangSmith
  • Gemini / Anthropic / OpenAI model routing logic
  • H100 targets for production inference, A100 for training (open to your redesign)
  • Postgres, and custom orchestration via MCP
  • GitHub Actions, GCP

There’s enough here to move fast, but still plenty of room for your fingerprints.

💻 How We Build
  • Engineers come first: your time, focus, and judgment are respected
  • Deep work > chaos: fixed cycles & cooldowns protect focus and keep context switching low
  • Autonomy is the default: trusted builders who own outcomes, no babysitters
  • Ship daily, safely: merge early, integrate vertically, ship often, use feature flags, and keep momentum
  • Outcomes over optics: solve real problems, not ticket soup
  • Voice matters: from week one, contribute, improve something, and shape how we build
  • Senior peers, no ego: collaborate in a high-trust, async-friendly environment
  • Bold problems, cool tech: work on complex challenges that actually move the needle
  • Fun is part of it: we move fast, but we also celebrate wins and laugh together
✅ What We’re Looking For
  • 8+ years building production-grade ML, data, or AI systems
  • Hands-on experience training and fine-tuning models — LoRA, QLoRA, adapter methods, or full fine-tunes. Actual model work, not just prompt iteration
  • Confidence to define GPU architecture given a goal and a budget — hardware choices, training strategy, cost/performance tradeoffs
  • Strong grasp of prompt engineering, context construction, and retrieval design
  • Comfortable working in LangChain and building agents, not just chains
  • Strong Python: testable, maintainable, clearly structured
  • Understanding of model evaluation, observability, and feedback loops
  • Excited to push from prototype → production → iteration
  • Senior IC judgment: you scope your own work, push back when it's right, and make calls others can build on
  • Confident English skills to collaborate clearly and effectively with teammates
🔥 Bonus If You:
  • Have shipped a fine-tuned model into production and can walk us through the tradeoffs you made
  • Have built agent-like workflows with LangGraph or similar
  • Have worked on semantic chunking, vector search, or hybrid retrieval strategies
  • Can walk us through a real-world model or prompt failure — and how you fixed it
  • Have experience with PySpark, Databricks, or lakehouse architecture
  • Have contributed to OSS tools or internal AI platforms
  • Think of yourself as both an engineer and a systems designer
  • Have mentored other senior engineers and enjoyed it
📍 Location & Compensation
  • Must be based in San Francisco, Las Vegas, or Tel Aviv
  • Full-time role with competitive comp
  • Flexible hours, async-friendly culture, engineering-led environment

Top Skills

Anthropic
GCP
Gemini
Github Actions
Langchain
Langgraph
Langsmith
Openai
Postgres
Python
HQ

HyperFi San Francisco, California, USA Office

Market St, San Francisco, CA, United States, 94102

Similar Jobs

Yesterday
Easy Apply
Hybrid
Easy Apply
Senior level
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Lead the transition to a self-hosted LLM ecosystem, optimizing model inference, developing agentic workflows, and executing fine-tuning strategies for chatbot enhancement.
Top Skills: BashClaude CodeFastapiHugging FacePythonPyTorchVllm
Yesterday
Remote or Hybrid
Senior level
Senior level
Security
The Senior AI Engineer will develop full stack software components, lead design and code reviews, and improve system efficiency and stability.
Top Skills: Ci/CdGoNode.jsPython
2 Days Ago
Remote or Hybrid
Senior level
Senior level
Big Data • Information Technology
Design and develop core systems for a real-time bidding engine, optimize performance, and collaborate with data scientists in a microservice environment.
Top Skills: BigQueryGoMySQLPythonRedis

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account