HyperFi Logo

HyperFi

AI Systems & Data Engineer

Posted 2 Days Ago
In-Office
2 Locations
200K-250K Annually
Senior level
In-Office
2 Locations
200K-250K Annually
Senior level
Design and operate Databricks pipelines in Python for ingesting and normalizing unstructured data, build AI systems, enforce data quality, and optimize PySpark jobs.
The summary above was generated by AI
About HyperFi


We're building the kind of platform we always wanted to use: fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That’s where you come in.


We’re a well-networked founding team with strong execution roots and a clear roadmap. We’re backed, focused, and delivering fast.


We are seeking an AI Systems & Data Engineer to join our team. We are building a fast, flexible, and complex platform with a robust, event-driven architecture. This role requires expertise in building data pipelines within the Databricks environment, specifically for ingesting unstructured data, and leveraging that data to build AI agents.

💥 What You’ll Do
  • Design and operate Databricks pipelines in Python to ingest and normalize large-scale unstructured data
  • Build streaming and batch ingestion using Auto Loader, Delta Live Tables, and Workflows
  • Model and maintain AI-ready lakehouse tables with Delta Lake and Unity Catalog
  • Prepare retrieval and context datasets for RAG and agent systems
  • Orchestrate Temporal-based workflows to coordinate data prep, validation, and AI handoff
  • Enforce data quality, lineage, and access controls across pipelines
  • Optimize PySpark jobs for performance, reliability, and cost
  • Integrate pipeline outputs into production AI systems and APIs
  • Monitor freshness, schema drift, and pipeline health
🧰 Tech Stack (So Far)
  • Python (primary language for all LLM + orchestration work)
  • LangChain + LangGraph + LangSmith
  • Databricks + PySpark for processing, labeling, and training context
  • Gemini + model routing logic
  • Postgres, and custom orchestration via MCP
  • GitHub Actions, GCP

You’ll be a crucial member of rolling out products that will have immediate impact.

💻 How We Build
  • Engineers come first: your time, focus, and judgment are respected
  • Deep work > chaos: fixed cycles & cooldowns protect focus and keep context switching low
  • Autonomy is the default: trusted builders who own outcomes, no babysitters
  • Ship daily, safely: merge early, integrate vertically, ship often, use feature flags, and keep momentum
  • Outcomes over optics: solve real problems, not ticket soup
  • Voice matters: from week one, contribute, improve something, and shape how we build
  • Senior peers, no ego: collaborate in a high-trust, async-friendly environment
  • Bold problems, cool tech: work on complex challenges that actually move the needle
  • Fun is part of it: we move fast, but we also celebrate wins and laugh together
✅ What We’re Looking For
  • 5-7 years of experience building production-grade ML, data, or AI systems.
  • Strong grasp of prompt engineering, context construction, and retrieval design.
  • Comfortable working in LangChain and building agents.
  • Experience with PySpark and Databricks to handle real-world data scale.
  • Ability to write testable, maintainable Python with clear structure.
  • Understanding of model evaluation, observability, and feedback loops.
  • Excited to push from prototype → production → iteration.
  • Familiarity with Databricks Data Intelligence Platform which unifies data warehousing and AI use cases on a single platform.
  • Knowledge of Unity Catalog for open and unified governance of data, analytics, and AI on the lakehouse.
  • Understanding of data security concerns related to AI and how to mitigate them using the Databricks AI Security Framework (DASF).
  • Confident English skills to collaborate clearly and effectively with teammates
🔥 Bonus If You:
  • Have built scalable agent-like workflows on the Databricks platform.
  • Have worked on semantic chunking, vector search, or hybrid retrieval strategies.
  • Can walk us through a real-world prompt failure and how you fixed it.
  • Have contributed to OSS tools or internal AI platforms.
  • Think of yourself as both an engineer and a systems designer.
  • Are familiar with the concept of a data lakehouse architecture.
📍 Location & Compensation
  • Must be based in San Francisco, Las Vegas, or Tel Aviv
  • Full-time role with competitive comp
  • Flexible hours, async-friendly culture, engineering-led environment

Top Skills

Databricks
GCP
Github Actions
Langchain
Langgraph
Langsmith
Postgres
Pyspark
Python
HQ

HyperFi San Francisco, California, USA Office

Market St, San Francisco, CA, United States, 94102

Similar Jobs

An Hour Ago
Hybrid
Tel Aviv, ISR
Junior
Junior
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Develop high-performance iOS apps, collaborating with design and tech teams, while ensuring stability and implementing optimal software solutions. Role involves code reviews and mentoring.
Top Skills: Core AnimationCore DataFoundationMetalObjective-COpenglSwiftUikit
An Hour Ago
Hybrid
Tel Aviv, ISR
6-6 Annually
Senior level
6-6 Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Develop and maintain high performance iOS applications, evaluate technical tradeoffs, and collaborate with cross-functional teams to enhance user experience.
Top Skills: C/C++Core AnimationCore DataFoundationMetalObjective-COpenglReact NativeSwiftUikit
An Hour Ago
Hybrid
Tel Aviv, ISR
Mid level
Mid level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Join Snap Inc. as an Android Engineer to develop user-facing applications utilizing Kotlin and Java, optimizing performance and collaborating with teams.
Top Skills: C/C++DaggerJavaKotlinOpenglReact NativeRxjava

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account