HyperFi

Senior AI Engineer

Posted 7 Days Ago

Be an Early Applicant

In-Office

San Francisco, CA, USA

Senior level

In-Office

San Francisco, CA, USA

Senior level

The Senior AI Engineer will lead the AI program, focusing on model selection, fine-tuning strategies, and GPU architecture. Responsibilities include building LLM pipelines, designing prompt strategies, and collaborating cross-functionally.

The summary above was generated by AI

About HyperFi

We're building the kind of platform we always wanted to use: fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That’s where you come in.

We’re a well-networked founding team with strong execution roots and a clear roadmap. We’re backed, focused, and delivering fast.

We're looking for a Senior AI Engineer to own our AI program end-to-end. Not a prompt engineer. Not a data engineer. The person who owns how our models get selected, trained, tuned, routed, and evaluated — and who walks in with the confidence to define the architecture from the hardware up. You'll work directly with the CTO, lead our fine-tuning strategy (LoRA is going to be core for us), and decide how we get the most out of our GPU spend. This is a senior IC role in a flat org — no management required, but you'll be the technical anchor other engineers learn from.

💥 What You’ll Do

Own our fine-tuning strategy end-to-end — LoRA first, full fine-tunes where they earn it. What we tune, on what hardware, against what evals
Define the GPU architecture. We have working infrastructure (H100 for production, A100 for training); your call to confirm, reshape, or rebuild
Drive model selection and routing across Gemini, Anthropic, and OpenAI — the right model for the right job, with cost and latency in the equation
Build agentic LLM pipelines using LangChain, LangGraph, and LangSmith
Design and iterate on prompt strategies, with a focus on consistency and context
Construct retrieval-augmented generation (RAG) systems from scratch
Instrument evaluation metrics, telemetry, and feedback loops to guide model and prompt evolution
Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows

🧰 Tech Stack

Python (primary language for all ML, LLM, and orchestration work)
LangChain + LangGraph + LangSmith
Gemini / Anthropic / OpenAI model routing logic
H100 targets for production inference, A100 for training (open to your redesign)
Postgres, and custom orchestration via MCP
GitHub Actions, GCP

There’s enough here to move fast, but still plenty of room for your fingerprints.

💻 How We Build

Engineers come first: your time, focus, and judgment are respected
Deep work > chaos: fixed cycles & cooldowns protect focus and keep context switching low
Autonomy is the default: trusted builders who own outcomes, no babysitters
Ship daily, safely: merge early, integrate vertically, ship often, use feature flags, and keep momentum
Outcomes over optics: solve real problems, not ticket soup
Voice matters: from week one, contribute, improve something, and shape how we build
Senior peers, no ego: collaborate in a high-trust, async-friendly environment
Bold problems, cool tech: work on complex challenges that actually move the needle
Fun is part of it: we move fast, but we also celebrate wins and laugh together

✅ What We’re Looking For

8+ years building production-grade ML, data, or AI systems
Hands-on experience training and fine-tuning models — LoRA, QLoRA, adapter methods, or full fine-tunes. Actual model work, not just prompt iteration
Confidence to define GPU architecture given a goal and a budget — hardware choices, training strategy, cost/performance tradeoffs
Strong grasp of prompt engineering, context construction, and retrieval design
Comfortable working in LangChain and building agents, not just chains
Strong Python: testable, maintainable, clearly structured
Understanding of model evaluation, observability, and feedback loops
Excited to push from prototype → production → iteration
Senior IC judgment: you scope your own work, push back when it's right, and make calls others can build on
Confident English skills to collaborate clearly and effectively with teammates

🔥 Bonus If You:

Have shipped a fine-tuned model into production and can walk us through the tradeoffs you made
Have built agent-like workflows with LangGraph or similar
Have worked on semantic chunking, vector search, or hybrid retrieval strategies
Can walk us through a real-world model or prompt failure — and how you fixed it
Have experience with PySpark, Databricks, or lakehouse architecture
Have contributed to OSS tools or internal AI platforms
Think of yourself as both an engineer and a systems designer
Have mentored other senior engineers and enjoyed it

📍 Location & Compensation

Must be based in San Francisco, Las Vegas, or Tel Aviv
Full-time role with competitive comp
Flexible hours, async-friendly culture, engineering-led environment

Top Skills

Anthropic

GCP

Gemini

Github Actions

Langchain

Langgraph

Langsmith

Openai

Postgres

Python

Market St, San Francisco, CA, United States, 94102

Similar Jobs

Navan

Senior Machine Learning Engineer

Yesterday

Easy Apply

Hybrid

Easy Apply

Senior level

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation

Lead the transition to a self-hosted LLM ecosystem, optimizing model inference, developing agentic workflows, and executing fine-tuning strategies for chatbot enhancement.

Top Skills: BashClaude CodeFastapiHugging FacePythonPyTorchVllm

Seal Security

Artificial Intelligence Engineer

Yesterday

Remote or Hybrid

Senior level

Security

The Senior AI Engineer will develop full stack software components, lead design and code reviews, and improve system efficiency and stability.

Top Skills: Ci/CdGoNode.jsPython

Arpeely

Back-end Engineer

2 Days Ago

Remote or Hybrid

Senior level

Big Data • Information Technology

Design and develop core systems for a real-time bidding engine, optimize performance, and collaborate with data scientists in a microservice environment.

Top Skills: BigQueryGoMySQLPythonRedis

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

HyperFi

Senior AI Engineer

Top Skills

HyperFi San Francisco, California, USA Office

Similar Jobs

Senior Machine Learning Engineer

Artificial Intelligence Engineer

Back-end Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech