Mach9 Logo

Mach9

Software Engineer, Sensor Data Integration

Posted 6 Days Ago
In-Office
San Francisco, CA, USA
Mid level
In-Office
San Francisco, CA, USA
Mid level
Build and maintain scalable pipelines to ingest, standardize, store, and serve large geospatial datasets (point clouds, imagery) for ML and product use. Implement CI/CD, automated checks, performance optimizations, agentic harnesses for dataset triage/patching, and collaborate with ML, product, and customers to integrate and troubleshoot data.
The summary above was generated by AI
The Role

At Mach9, Sensor Data Integration Engineers build the algorithms and pipelines that transform large-scale geospatial datasets into structured, accessible formats to power our survey product, Digital Surveyor. You’ll work with high-volume data sources — LiDAR-collected point clouds, on-road imagery, overhead aerial ortho photos — and own the systems that ingest, standardize and store them for our training and product use. Every single piece of data that our customers upload will pass through your systems first.

This role is ideal for an engineer who loves puzzle-hunting — reverse-engineering sparsely-documented formats, wrangling coordinate systems and transforms, hunting down strange camera projection issues.

You’ll sit at the divide between our customers and our product, making messy real-world sensor data trustworthy at scale. This role sits at the front of everything we do: our models are only as good as the data feeding them, and you'll be the one making that data trustworthy at scale.

Responsibilities
  • Develop and maintain scalable, reproducible workflows for ingesting and processing large volumes of point cloud, imagery, and geospatial data.

  • Convert datasets from various sensor providers into Mach9's standardized internal formats.

  • Build CI/CD pipelines and automated checks that guarantee the correctness and consistency of data pipelines, including regression detection on dataset processing.

  • Optimize processing performance, query speed, and storage efficiency across large geospatial datasets.

  • Work closely with the customer success team to efficiently resolve issues and unblock customer projects.

    • Build and maintain agentic harness for automated dataset triage and code patching. Automatically propose or apply fixes, and escalate when human judgment is needed.

  • Work closely with ML and product teams to make data readily usable for training, inference and visualization.

  • Work closely with customers and data-provider partners to facilitate data integration (with occasional travels).

  • Puzzle-hunting: work with data formats with sparse or missing documentation.

Requirements
  • Strong software development, problem-solving, and debugging skills, with hands-on experience building production systems in Python.

  • Solid foundation in distributed systems and parallel computing.

  • Comfort operating with ambiguity — able to dig into undocumented or messy data formats, reverse-engineer how they work, and make steady progress without a clear spec.

  • Experience building agentic systems and setting up agent harnesses — orchestrating LLM-driven workflows for triage, debugging, or automated code patching.

  • Strong communication and collaboration skills, with the ability to work across ML, product, and customer-facing teams.

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience.

Bonus qualifications
  • Experience building agentic systems and setting up agent harnesses — orchestrating LLM-driven workflows for triage, debugging, or automated code patching.

  • Understanding of geospatial data formats (e.g., LAS/LAZ, COPC, E57, GeoTIFF, Shapefiles) and tooling (e.g., GDAL, PDAL, untwine, laz-perf).

  • Expertise designing and managing data schemas and storage systems for geospatial data (e.g., Postgres/PostGIS, AWS S3).

  • Experience with large-scale data processing frameworks and cloud platforms (e.g., Spark, AWS Batch).

  • Familiarity with coordinate reference systems and transforms (CRS, WKT, pyproj, affine transforms).

  • Experience building data versioning, lineage, or artifact-tracking systems.

  • Experience operating data pipelines that feed ML training and inference.

  • Familiar with C++.

HQ

Mach9 San Francisco, California, USA Office

San Francisco, CA, United States

Similar Jobs

An Hour Ago
In-Office
180K-231K Annually
Senior level
180K-231K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Information Technology • Software • Defense • Manufacturing
Lead design and implementation of an AI-first enterprise platform (Hyperdrive) to automate aerospace operations. Build full-stack, data-dense React interfaces, scalable distributed systems, data pipelines, APIs, and AI integrations (LLMs, agents, RAG). Mentor engineers, set architecture, and collaborate with hardware, supply chain, and finance to turn operational bottlenecks into automated workflows.
Top Skills: Agentic WorkflowsAPIsAWSAzureCloud-NativeData LakeETLGoJavaScriptKubernetesLlmsMicroservicesPostgresPythonRagReactReal-Time ProcessingSnowflakeTypescript
An Hour Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
201K-279K Annually
Expert/Leader
201K-279K Annually
Expert/Leader
Fintech • Machine Learning • Mobile • Security • Software
Lead creative strategy and execution for Growth and Product Marketing, managing a multidisciplinary team to produce performance-driven paid social, video, web, and lifecycle creative. Build scalable toolkits, AI-enabled workflows, and production systems that increase speed, personalization, and measurement while maintaining brand quality and creative excellence.
Top Skills: Agent-Powered SystemsAi Creative ToolsDisplay AdvertisingDrtvPaid SocialPmm ToolkitsSemStreamingVideo Production
An Hour Ago
Hybrid
50K-70K Annually
Mid level
50K-70K Annually
Mid level
eCommerce • Fashion • Retail • Sales • Wearables • Design
Lead and coach store staff, manage sales floor and stockroom operations, ensure excellent customer service, develop direct reports and build effective teams, and perform physical tasks (lifting, bending, climbing) as needed to meet store performance goals.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account