Scout AI (scoutco.ai) Logo

Scout AI (scoutco.ai)

AI Cloud Infrastructure Engineer

Reposted 7 Days Ago
Easy Apply
In-Office
Sunnyvale, CA
160K-240K Annually
Mid level
Easy Apply
In-Office
Sunnyvale, CA
160K-240K Annually
Mid level
Design and scale the infrastructure for Fury's AI model training and deployment, ensuring efficient data pipelines and collaboration with AI teams.
The summary above was generated by AI

The future of defense will be decided by those who field intelligent machines at scale. At Scout AI, we’re developing Fury, the first robotic foundation model for defense, to give U.S. forces overwhelming, adaptable, and autonomous power across every domain. Fury enables human operators to command fleets of robots through natural language, and empowers those machines to sense, decide, and act together as one. This mission will ask everything of us: urgency, precision, and relentless work.

The Role
We’re looking for an AI Infrastructure Engineer to build and scale the backbone of Fury’s model training and deployment ecosystem. You’ll design the data, compute, and orchestration infrastructure that enables our vision-language-action models to learn from massive real-world datasets and operate across edge and cloud environments. This role bridges systems engineering, distributed computing, and machine learning infrastructure. Your work will ensure our teams can iterate rapidly, train large models efficiently, and deploy them reliably on robotic platforms in the field.

We’re a startup. You’ll be moving fast, context-switching daily, and helping define the culture and process as we go. This is a rare opportunity to come in early and architect the future of defense.

Responsibilities

  • Design and implement data pipelines for ingesting, transforming, and storing petabytes of multimodal data from Fury’s robotic and operator systems
  • Develop internal tooling for dataset exploration, curation, versioning, and quality monitoring over time
  • Build and maintain distributed training infrastructure (cloud and on-prem) for large-scale multimodal and foundation model training
  • Implement job orchestration workflows for launching, tracking, and debugging large-scale model runs
  • Identify and remediate bottlenecks in compute, memory, storage, and network performance to optimize throughput and cost efficiency
  • Collaborate with AI, autonomy, and systems teams to ensure data and training infrastructure supports real-time and mission-critical use cases
  • Maintain observability and reliability tooling for training and inference pipelines
  • Stay current on best practices in MLOps, distributed training frameworks, and AI infrastructure at scale

Qualifications

  • 3+ years of experience in ML infrastructure, MLOps, or large-scale data systems
  • Proven experience with distributed training (PyTorch DDP, DeepSpeed, Ray, or similar) and workflow orchestration (Kubernetes, Airflow, or equivalent)
  • Strong proficiency in Python and cloud-native infrastructure (AWS, GCP, or Azure)
  • Deep understanding of data engineering (ETL pipelines, object storage, data versioning, metadata management)
  • Familiarity with containerization and deployment (Docker, Kubernetes) and monitoring systems (Prometheus, Grafana)
  • Experience optimizing GPU cluster utilization, scaling training jobs, and profiling model performance
  • Bachelor’s degree or higher in Computer Science, Electrical Engineering, or related technical field
  • Bonus: Experience with edge-deployed ML systems, federated training, or robotic data collection pipelines
  • Must be a U.S. Person due to required access to U.S. export controlled information or facilities

Why Join Scout

  • Work on the world’s most important frontier, ensuring U.S. and allied dominance in the age of intelligent machines
  • Be a core part of a team building the first defense-specific robotic foundation model
  • Collaborate with some of the top engineers in autonomy, AI, and national security
  • See your work deployed on real systems
  • Help define the future of intelligent defense systems
  • Backed by Draper Associates, Booz Allen Ventures, and other top investors

Benefits

  • Competitive compensation package including base salary and bonus.
  • Generous equity participation in company growth.
  • Premium medical, dental, and vision plans with $0 paycheck contribution
  • Competitive PTO and company holiday calendar
  • Catered lunch daily and fully stocked kitchen
  • EV charging
  • Relocation assistance (depending on role eligibility)

The stated salary range below represents an estimated base pay only and reflects consideration of multiple compensation factors. Final salary offers may differ depending on factors including, but not limited to, relevant experience or training background, specialized skills, and business needs. Most full-time positions also include highly competitive equity awards, which form part of Scout AI's overall compensation package. In addition, Scout AI provides comprehensive, top-tier benefits to full-time employees.

US Salary Range
$160,000$240,000 USD

Top Skills

Airflow
AWS
Azure
Deepspeed
Docker
GCP
Grafana
Kubernetes
Prometheus
Python
Pytorch Ddp
Ray
HQ

Scout AI (scoutco.ai) Sunnyvale, California, USA Office

Sunnyvale, California, United States, 94089

Similar Jobs

8 Days Ago
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves developing and optimizing AI infrastructure for large-scale training and inference, ensuring system reliability and efficiency through software engineering practices.
Top Skills: C/C++ElkIb VerbsJaxLibfabricsLokiNcclPrometheusPythonPyTorchRdmaTensorFlowUcx
13 Days Ago
In-Office
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
As a Solutions Engineer, you'll design technical solutions for customers, conduct product demonstrations, and provide implementation guidance, ensuring client success in AI integration.
Top Skills: AIAzure)GCPGoMachine LearningPublic Cloud (AwsPythonPyTorchTensorFlow
15 Minutes Ago
Remote or Hybrid
United States
93K-135K Annually
Junior
93K-135K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Product Owner drives feature development for Disability Solutions, collaborating with teams to prioritize user stories and manage stakeholder relations, ensuring alignment with business goals.
Top Skills: Ai ToolsAPIsData Management

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account