Matter Intelligence Logo

Matter Intelligence

Data/ML Infrastructure Engineer

Posted 8 Days Ago
In-Office
San Francisco, CA, USA
Mid level
In-Office
San Francisco, CA, USA
Mid level
The Data/ML Infrastructure Engineer builds and operates data infrastructures for processing drone and orbital sensing data, ensuring reliability, observability, and performance while collaborating with research and product teams.
The summary above was generated by AI
About the Role

We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing data into production datasets, models, and customer-facing insights. This role spans ingestion, processing, storage, compute, and serving, with a strong emphasis on reliability, observability, performance, and cost.

You will work closely with research and product engineering to shorten iteration cycles, improve reproducibility, and raise the quality bar for production systems. You will define clear interfaces and operational standards that keep the platform trustworthy as data volume, model complexity, and product usage scale.

What You’ll Do
  • Design, build, and operate scalable data and ML infrastructure on AWS, including workloads running on Kubernetes

  • Build and maintain systems for ingestion, processing, storage, and serving, with strong guarantees around data quality, correctness, and operational safety

  • Partner closely with research to support perception model training and evaluation workflows, enabling faster experimentation and more reproducible iteration

  • Build platform primitives for observability, data versioning, lineage, evaluation, reproducibility, and operational excellence

  • Partner with product engineering to ensure data- and model-derived insights are accessible through reliable, low-latency serving and retrieval interfaces

  • Design systems that enable efficient access patterns for customer-facing products, including search, indexing, and large-scale querying

  • Identify and address bottlenecks in throughput, cost, and operational complexity as the platform scales

What We’re Looking For

You have strong software engineering fundamentals and have built production systems where reliability, cost, and performance matter. You can reason clearly about distributed systems tradeoffs, and you have experience designing data-intensive infrastructure that other engineers depend on.

You are comfortable working across data platform and ML platform concerns, and you understand how tightly coupled they become in production. You care about reproducibility, debuggability, and developer experience because you have seen how quickly they become bottlenecks.

You work effectively across research and product teams. You can translate ambiguous needs into clear interfaces and systems, and you can drive work from design through production while maintaining a high quality bar.

A few things we expect in this role:

  • Meaningful experience building production data infrastructure, ML infrastructure, or distributed systems

  • Strong programming skills in Python and SQL, with the judgment to choose the right abstractions and interfaces for production systems

  • Experience building and operating systems on AWS

  • Familiarity with modern infrastructure and platform tooling, including Kubernetes, Docker, and Terraform

  • Experience working with production storage and serving systems such as Postgres and Redis

  • Familiarity with data and ML workflow tooling such as Metaflow

  • Strong instincts for observability, testing, and operational excellence

Nice to Have
  • Experience supporting ML training, evaluation, batch inference, or model deployment in production

  • Familiarity with modern large-scale data patterns and tooling, including streaming, backfills, partitioning strategy, and schema evolution

  • Experience building internal platform primitives such as data versioning and lineage, dataset curation, experiment tracking, or tooling for reproducible workflows

  • Exposure to perception, multimodal, or geospatial systems, especially where data originates from real sensors and is used in real products

Location

This is a full-time role based in San Francisco, CA.

ITAR Requirements

To comply with U.S. export regulations, applicants must be one of the following:

  • A U.S. citizen or national

  • A lawful permanent resident (green card holder)

  • Eligible to obtain required authorizations from the U.S. Department of State

Employee Offerings & Benefits

At Matter, we believe in rewarding high performance and providing the support you need to thrive. Our compensation and benefits package includes:

  • Competitive compensation based on experience

  • Early-stage equity package

  • 100% employer-paid health, dental, and vision coverage

  • Opportunity to work on novel sensing, data, and AI systems with real-world deployment paths across drone, aerial, and orbital platforms

Who You Are

You are a strong engineer who likes building reliable systems that other teams can trust. You care about infrastructure quality, operational rigor, and clear interfaces. You are energized by working close to the data, close to the models, and close to the product.

Similar Jobs

8 Days Ago
In-Office
Mountain View, CA, USA
194K-352K Annually
Senior level
194K-352K Annually
Senior level
Artificial Intelligence • Automotive • Information Technology • Robotics
Design and develop data pipelines for autonomous driving systems, create storage for data metrics, build dashboards, and maintain data integrity, focusing on ML components.
Top Skills: BigQueryC++GCPGcsPostgresPython
8 Days Ago
In-Office
Mountain View, CA, USA
160K-241K Annually
Junior
160K-241K Annually
Junior
Artificial Intelligence • Automotive • Information Technology • Robotics
Responsible for designing and developing scalable data pipelines, storage systems, and monitoring tools for ML data infrastructure, ensuring system integrity and performance evaluation.
Top Skills: BigQueryC++GCPGcsPostgresPython
21 Days Ago
Remote or Hybrid
6 Locations
Mid level
Mid level
Artificial Intelligence • Information Technology • Software
The role involves designing scalable data pipelines for 3D, video, and sensor data, optimizing infrastructure, and productionizing ML models with researchers.
Top Skills: SparkAWSAzureDaskDvcFlyteGCPKubernetesMlflowPythonPyTorchRay

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account