Foxglove Logo

Foxglove

ML Platform Engineer

Reposted 6 Days Ago
In-Office
San Francisco, CA, USA
183K-310K Annually
Mid level
In-Office
San Francisco, CA, USA
183K-310K Annually
Mid level
Design, deploy, and scale ML systems for production at Foxglove, focusing on data infrastructure for robotics, optimizing inference, and building evaluation workflows.
The summary above was generated by AI

Build the data infrastructure that powers robots in the real world.

Robotics is moving from research labs into production fleets across factories, warehouses, vehicles, defense systems, agriculture, logistics, and field deployments. As robots scale across the physical world, every failure, regression, edge case, and unexpected behavior becomes a data problem: what happened, when, on which robot, and why?

Every robot, in every industry, requires the same core capabilities: to sense, understand, and act on multimodal data from the physical world. At Foxglove, we built the agentic data platform robotics and Physical AI teams use to answer those questions. We help robotics teams make vast quantities of robot data actionable, creating the data flywheel they need to develop, test, train, deploy, and operate robots with confidence.

About the Role

We're looking for a ML Platform Engineer with deep infrastructure instincts to help design, deploy, and scale the systems that power Foxglove's data platform. This is a platform-first role: you'll own the infrastructure layer that makes ML possible in production, not just the models that run on top of it.

You'll be responsible for the reliability, scalability, and performance of the ML platform itself, from inference serving and pipeline orchestration to training infrastructure and evaluation frameworks. The problems are real and urgent: petabyte-scale multimodal robotics data, high-throughput retrieval and embedding pipelines, and the internal ML flywheel that lets our team ship fast. This is a hands-on infrastructure role, not research.

Key Responsibilities

  • Design, deploy, and operate production inference infrastructure — including model serving, autoscaling, load balancing, and cost optimization across cloud environments

  • Own the platform architecture for embedding and retrieval pipelines that power semantic search over multimodal robotics data (image, video, point cloud, and timeseries)

  • Build and maintain the training and evaluation infrastructure that enables rapid iteration on model performance — including job orchestration, experiment tracking, and dataset versioning

  • Drive cloud infrastructure decisions (AWS/GCP) that directly impact latency, throughput, reliability, and cost at scale

  • Define platform abstractions and internal tooling that let product engineers ship ML-powered features without needing to manage infrastructure themselves

  • Evaluate, integrate, and operationalize third-party ML infrastructure components; establish clear build vs. buy frameworks for the team

What We're Looking For

  • Deep, hands-on experience owning production ML infrastructure: inference serving, model optimization (e.g., vLLM, Triton, TorchServe), orchestration, and cloud cost management

  • Strong foundation in distributed systems and cloud infrastructure (AWS/GCP) — you think in terms of system reliability, failure modes, and operational burden, not just model accuracy

  • Experience architecting and operating retrieval systems at scale, including vector databases (e.g., Pinecone, Lance, turbopuffer, pgvector) and embedding pipelines over large, heterogeneous datasets

  • A platform engineer's mindset: you build systems that other engineers depend on, and you take that responsibility seriously

  • Proven ability to operate with high ownership — you can make hard infrastructure tradeoffs independently and move fast without breaking things

  • Strong communication skills; you can explain infrastructure tradeoffs clearly to both ML and non-ML engineers

Bonus Points

  • Familiarity with fine-tuning and domain adaptation techniques for LLMs or embedding models (i.e. SFT, PEFT)

  • Familiarity with data mining or hybrid search workflows, especially as applied in robotics autonomous vehicles, or physical AI workflows

  • Prior experience building ML platforms, evaluation frameworks, or data management tooling from the ground up

What We Offer

  • $300 monthly budget towards commuter benefits or building your personal workspace (remote only)

  • Competitive equity grant in a Series B company

  • Medical, Dental, Vision, and Term Life insurance coverage at 100% for employees and 75% for dependents

  • 401(k) matching up to 4%

  • 4 weeks vacation, plus holidays and winter break

  • All expenses paid company off-sites 2× per year

Why Join Us
  • Impact: Own growth at a fast-growing, high-leverage moment for the company.

  • Mission: Accelerate the development of the next generation of robotics and embodied AI.

  • Team: Work with world-class engineers, designers, and researchers passionate about open-source and developer tools.

  • Ownership: Drive initiatives end-to-end, with high autonomy and visibility.

HQ

Foxglove San Francisco, California, USA Office

San Francisco, CA, United States

Similar Jobs

3 Days Ago
Hybrid
Mountain View, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Build and scale a generative conversational AI platform: design APIs, optimize dialog engine for low-latency and low-memory multilingual enterprise usage, implement logging/tracing/metrics, create tooling and interfaces for customization, and collaborate with ML, product, and support teams to deliver robust, scalable solutions.
Top Skills: CloudGenerative AiLlmsLogging FrameworksMetrics SystemsMicrosoft TeamsReal-Time Multilingual TranslationSlackTracing FrameworksWeb Apis
21 Days Ago
Hybrid
San Francisco, CA, USA
217K-289K Annually
Senior level
217K-289K Annually
Senior level
Healthtech • Social Impact • Software
The Staff ML Platform Engineer will design and develop real-time ML systems, focus on infrastructure components, and set technical direction, partnering with other teams to meet business goals.
Top Skills: Cloud-Native Ml Serving PlatformsFeature StoresMl SystemsTerraform
16 Days Ago
In-Office or Remote
San Francisco, CA, USA
156K-320K Annually
Senior level
156K-320K Annually
Senior level
AdTech • Marketing Tech
Design and operate ML training and serving infrastructure, build a Kubernetes+Ray backend, improve developer experience and observability, mentor engineers, and ensure reliable, secure deployments for low-latency, high-performance systems.
Top Skills: Ci/CdEksKubernetesLinuxMlopsNixosPythonRayScalaTerraformZig

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account