Specter Logo

Specter

Software Engineer - ML Infrastructure

Posted 3 Days Ago
In-Office
San Francisco, CA, USA
Mid level
In-Office
San Francisco, CA, USA
Mid level
The role involves designing ML pipelines for computer vision, optimizing models for edge deployment, and developing data management systems for sensor datasets.
The summary above was generated by AI

Company Background
Specter is creating a software-defined "control plane" for the physical world. We are starting with protecting American businesses by granting them ubiquitous perception over their physical assets.

To do so, we are creating a connected hardware-software ecosystem on top of multi-modal wireless mesh sensing technology. This allows us to drive down the cost and time of deploying sensors by 10x. Our platform will ultimately become the perception engine for a company's physical footprint, enabling real-time perimeter visibility and autonomous operations management.

Our co-founders Xerxes and Philip are passionate about empowering our partners in the fast-approaching world of physical AI and robotics. We are a small, fast-growing team who hail from Anduril, Tesla, Uber, and the U.S. Special Forces.

Role + Responsibilities
Specter is hiring an ML infrastructure engineer to build and scale the machine learning systems that power real-time perception and inference across our edge-cloud platform. This role owns the training, deployment, and optimization of computer vision and sensor fusion models that enable autonomous monitoring and decision-making for our customers' physical assets.

Key responsibilities include:

  • Designing and implementing scalable ML training pipelines for computer vision models (object detection, tracking, classification, segmentation).

  • Building efficient model serving infrastructure for real-time inference on edge devices with constrained compute and power budgets.

  • Optimizing models for deployment on embedded hardware (quantization, pruning, TensorRT, ONNX, CoreML).

  • Developing continuous training and evaluation systems to improve model performance from production data feedback loops.

  • Creating data pipelines for ingesting, labeling, versioning, and managing massive multi-modal sensor datasets (video, radar, lidar, thermal).

  • Implementing model monitoring, A/B testing frameworks, and performance analytics for deployed perception systems.

  • Collaborating with perception researchers to transition models from research to production at scale across thousands of edge nodes.

  • Building tools and infrastructure for distributed training, hyperparameter optimization, and experiment tracking.

Preferred Qualifications

  • Strong experience with ML frameworks (PyTorch, TensorFlow) and model optimization tools (TensorRT, ONNX Runtime, OpenVINO).

  • Deep understanding of computer vision architectures and their deployment tradeoffs (YOLO, transformers, CNNs, real-time detection/tracking).

  • Hands-on experience deploying models on edge devices (NVIDIA Jetson, ARM processors, or similar embedded platforms).

  • Expertise building MLOps infrastructure — experiment tracking (Weights & Biases, MLflow), feature stores, model registries, CI/CD for ML.

  • Experience with distributed training frameworks (PyTorch DDP, DeepSpeed, Ray) and GPU cluster management.

  • Strong software engineering skills in Python and systems languages (C++, Rust) for performance-critical inference code.

  • Familiarity with video processing, sensor fusion, or multi-modal perception systems is a plus.

  • Prior experience in robotics, autonomous systems, or real-time ML applications is highly valued.

Similar Jobs

7 Days Ago
In-Office
Sunnyvale, CA, USA
180K-250K Annually
Senior level
180K-250K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Cybersecurity
The role involves deploying and optimizing LLMs, designing the ML serving stack, and ensuring high-performance GPU services for production readiness.
Top Skills: CudaKubernetesLinuxLlmMlNcclPyTorchTensorFlowTensorrtTriton Inference Server
20 Days Ago
Hybrid
San Francisco, CA, USA
200K-250K Annually
Senior level
200K-250K Annually
Senior level
Artificial Intelligence • Security • Software
The role involves managing ML infrastructure, building scalable data pipelines, operating training frameworks, leading projects, and ensuring best DevOps practices for machine learning.
Top Skills: ArgocdDaskGcsGithub ActionsGrafanaKubernetesPrometheusPythonRayS3SparkTensorrtTerraform
17 Hours Ago
Remote or Hybrid
6 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Design and develop scalable, high-performance data and API infrastructure for real-time processing. Mentor engineers and collaborate with teams to enhance AI model evaluations.
Top Skills: APIsDistributed SystemsLow-Latency PipelinesPyTorchScalable Backend ArchitectureStream Processing

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account