Figure.ai Logo

Figure.ai

Helix AI Engineer, Video Pretraining

Reposted Yesterday
Be an Early Applicant
In-Office
San Jose, CA, USA
Senior level
In-Office
San Jose, CA, USA
Senior level
Lead the development of large-scale video foundation models for humanoid autonomy, focusing on training strategies and model evaluation for real-world applications.
The summary above was generated by AI

Figure is an AI robotics company developing autonomous general-purpose humanoid robots. Our goal is to build embodied AI systems that can perceive, reason, and act in the real world. Figure is headquartered in San Jose, CA, and this role requires 5 days/week in-office collaboration.

Our Helix team is responsible for developing the core AI systems that power humanoid autonomy. We are looking for a Helix AI Engineer, Video Pretraining to lead the development of large-scale video foundation models trained on diverse real-world and robot-collected data.

This role focuses on pretraining models that learn from raw video—capturing motion, interaction, and temporal structure—to enable downstream capabilities in perception, prediction, and embodied reasoning.

Responsibilities
  • Design and train large-scale video foundation models on diverse datasets spanning internet-scale video and robot-collected data
  • Develop pretraining strategies that capture temporal dynamics, motion, and object interaction from raw video sequences
  • Build models that learn transferable representations for downstream tasks such as perception, tracking, prediction, and control
  • Explore architectures for video understanding and generation, including transformer-based and diffusion-based approaches
  • Implement efficient data pipelines and training strategies for high-throughput video ingestion and large-scale distributed training
  • Optimize model performance across compute, memory, and training efficiency constraints
  • Collaborate closely with generative modeling, agent, and robot learning teams to integrate pretrained models into the autonomy stack
  • Design evaluation frameworks and benchmarks to measure temporal understanding, prediction quality, and generalization
Requirements
  • Experience training large-scale models on video data or other high-dimensional sequential modalities
  • Strong understanding of modern deep learning architectures for video, vision, or multimodal systems
  • Experience with large-scale pretraining, including dataset curation, training dynamics, and scaling laws
  • Proficiency in Python and deep learning frameworks such as PyTorch
  • Experience working with distributed training systems and large GPU clusters
  • Strong experimental rigor and ability to iterate quickly on model design and training strategies
  • Solid software engineering skills and ability to build scalable, reliable systems
  • Ability to operate independently and drive ambiguous, high-impact research directions
Bonus Qualifications
  • Experience working on frontier video models or multimodal foundation models
  • Background in video diffusion, autoregressive video modeling, or world models
  • Experience at leading AI labs such as OpenAI, Google DeepMind, Google, ByteDance, Midjourney, or Adobe
  • Experience with large-scale dataset construction and filtering for video pretraining
  • Familiarity with robotics, embodied AI, or learning from egocentric / first-person video
  • Publication record in machine learning, computer vision, or multimodal AI

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended. 

HQ

Figure.ai San Jose, California, USA Office

San Jose, CA, United States

Similar Jobs

8 Minutes Ago
Easy Apply
Hybrid
2 Locations
Easy Apply
216K-480K Annually
Expert/Leader
216K-480K Annually
Expert/Leader
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Lead and scale a unified global Audit, Risk, and Compliance function. Own SOX/ICFR readiness, risk-based internal audit, regulatory compliance (including OFAC, KYC/KYB), and tech-forward automation of controls and monitoring. Advise the C-suite and Audit Committee, recruit and develop a multidisciplinary team, and partner cross-functionally to embed durable compliance and remediation into business workflows.
Top Skills: AutomationCloud InfrastructureCosoData AnalyticsIsoItgcsKybKycOfacSox/Icfr
An Hour Ago
Hybrid
70K-90K Annually
Junior
70K-90K Annually
Junior
Hardware • Healthtech • Software • Analytics
The Sales Development Representative will help build the Sales Development function, drive outbound prospecting, respond to inbound leads, and maintain CRM excellence while refining outreach effectiveness.
Top Skills: Crm SystemsHubspotSales Engagement Tools
An Hour Ago
Hybrid
San Francisco, CA, USA
160K-250K Annually
Senior level
160K-250K Annually
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Own People & Talent data domain: design and govern HR/recruiting data models and pipelines, build dashboards and self-serve analytics, partner with cross-functional teams, enable AI-powered insights, and improve automation, data quality, and access for global people decisions.
Top Skills: AshbyAtsBamboohrDatabricksDbtDelta TablesHexHrisLlm-Powered AnalyticsNotebook-Based Analytics PlatformsPythonRSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account