Babs AI Jobs

Founding AI/ML Engineer

Babs AI

Founding AI/ML Engineer

Posted One Month Ago

In-Office

San Francisco, CA, USA

150K-225K Annually

Mid level

In-Office

San Francisco, CA, USA

150K-225K Annually

Mid level

Design and build the intelligence layer: retrieval and memory systems, RAG pipelines, feedback and evaluation loops, RL and fine-tuning experiments, LLM integration, observability for AI performance, and collaborate to productize AI features.

The summary above was generated by AI

About Babs

Babs is building the smart operating system for modern families. Most AI tools are designed for work life, but managing home life is its own full-time job filled with coordination, communication, and constant context switching. Babs acts as a second brain that brings order to everyday life by connecting calendars, tasks, and messages into one intelligent system.

Our edge is in our ability to make complex systems feel simple. Our strength is our team — driven, thoughtful builders united by a mission to bring order to everyday life and give people the clarity of an organized mind. Our mission is to give people the power to live organized lives and organized minds. Our vision is to build joyful, connected communities. That begins with creating systems and workflows that help people feel more present and effective in their daily lives. When households run smoothly, they have more capacity to connect, contribute, and strengthen their communities.

The Role

We’re looking for an Founding AI Engineer to design, build, and scale the intelligence layer that powers Babs. You’ll work on the infrastructure that connects large language models, vector databases, and real-world data into a seamless, adaptive system.

This role sits at the intersection of machine learning infrastructure, data systems, and applied product engineering. You’ll be responsible for how Babs learns, remembers, and improves — turning context into intelligence and feedback into reinforcement.

You’ll help us move from AI-assisted features to a truly AI-native product that feels personal, contextual, and trustworthy.

What You’ll Do

Design and implement retrieval and memory systems using vector databases and semantic search
Build and maintain RAG pipelines that combine structured and unstructured data
Develop feedback loops and evaluation systems to measure and improve model output quality
Explore reinforcement learning (RL) and fine-tuning approaches that adapt model behavior to user context
Integrate and experiment with multiple LLMs and APIs, choosing the right tool for each task
Collaborate with platform and product engineers to bring intelligence into real features
Create observability and evaluation systems for AI latency, accuracy, and reliability
Contribute to architectural decisions that shape Babs’ long-term AI infrastructure

Our Ideal Candidate Has

4 or more years of experience working with LLMs, machine learning infrastructure, or applied AI systems
Strong experience with Python and frameworks like LangChain, LlamaIndex, or equivalent orchestration tools
Deep understanding of retrieval-augmented generation, embeddings, and vector databases (such as Pinecone, Weaviate, or FAISS)
Familiarity with evaluation frameworks, feedback loops, and reinforcement learning techniques
Experience building and scaling data or ML pipelines in production environments
Curiosity about user behavior and how models can be tuned to better serve real human needs
A thoughtful, practical approach to experimentation and iteration — you ship and learn

Bonus Points For

Experience designing model evaluation frameworks or AI Evals
Contributions to open-source AI infrastructure projects
Familiarity with prompt optimization, tool calling, and agentic workflows
Experience with multi-modal models (text, image, or speech)
Knowledge of GCP, Firebase, or event-driven architectures
A background in consumer or productivity products

Compensation & Benefits

Base salary range: $150,000 – $225,000 and equity, depending on experience and expertise
Competitive compensation package including equity
Comprehensive medical, dental, and vision coverage
Flexible PTO and a work culture built on trust and autonomy
A team that values craftsmanship, collaboration, and purpose over fluff

If you’re excited by the idea of building the intelligence layer behind a product that helps people live with more clarity and calm, we’d love to meet you — even if you don’t check every box. We care about curiosity, integrity, and a shared belief in what we’re building.

Similar Jobs

TriFetch

Machine Learning Engineer

9 Days Ago

In-Office

San Francisco, CA, USA

Senior level

Artificial Intelligence • Healthtech • Software • Automation

Lead design and implementation of post-training alignment pipelines (SFT, RLHF, RLAIF) using proprietary medical data. Improve model calibration, safety, and clinical correctness via novel techniques, distributed training on GPU clusters, and rigorous evaluation frameworks. Collaborate with co-founders to define research roadmap and ship production-ready ML systems.

Top Skills: Distributed Training FrameworksGpu ClustersHuggingfaceKnowledge DistillationPythonPyTorchReward ModelingRlaifRlhfSftTransformersVercelVertex

VLM Run

Machine Learning Engineer

3 Days Ago

Hybrid

Santa Clara, CA, USA

150K-220K Annually

Mid level

150K-220K Annually

Mid level

Artificial Intelligence • Computer Vision • Machine Learning • Software

Build and scale the infrastructure layer for visual intelligence: optimize VLM inference and GPU serving, design multimodal APIs and structured outputs, ensure scalable backend systems, reliability, observability, CI/CD, and developer experience. Drive 0→1 product work with strong testing discipline, schema validation, and performance optimization.

Top Skills: Async ApisAWSCi/CdDockerFastapiGCPGithub WorkflowsHuggingfaceKubernetesMetricsMongoDBObservability (LoggingOllamaOpenaiOrionPostgresPythonRedisTracing)VllmVlms

Fabrion

ML/AI Research Engineer — Agentic AI Lab (Founding Team)

20 Hours Ago

In-Office or Remote

Senior level

Artificial Intelligence • Software • Industrial • Manufacturing

Design, train, evaluate, and optimize agent-native LLMs and RAG pipelines for enterprise use. Build training and RL pipelines (RLHF/DPO/PPO), embedding-based memory, evaluation harnesses, observability, and inference optimization across cloud and on-prem environments.

Top Skills: ChromaCohereDeepspeedDelta LakeDuckdbDustFaissFalconFlashattentionFsdpHuggingface TransformersIcebergJavaScriptJinaKubernetesLambdalabsLangchainLanggraphLangsmithLlama 3LlamaindexLoraMistralMixtralModalNeo4JOpenllm EvalsOwlParquetPineconePostgresPuppygraphPythonQdrantQloraRagasRayRdfRustSagemakerTgiTrulensVllmWeaviateWeights & Biases

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine