Figure.ai Logo

Figure.ai

Helix AI Engineer, Training Infrastructure

Reposted Yesterday
Be an Early Applicant
In-Office
San Jose, CA, USA
150K-350K Annually
Mid level
In-Office
San Jose, CA, USA
150K-350K Annually
Mid level
The role involves managing training infrastructure, implementing distributed training algorithms, and collaborating with AI researchers on model training.
The summary above was generated by AI
Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home and commercial markets. Figure is headquartered in San Jose, CA.
Figure's vision is to deploy autonomous humanoids at a global scale. Our Helix team is looking for an experienced Training Infrastructure Engineer to take our infrastructure to the next level. This role is focused on managing the training cluster, implementing distributed training algorithms, data loaders, and developer tools for AI researchers.
Responsibilities
  • Design, deploy, and maintain Figure's training clusters
  • Architect, optimize, and maintain scalable deep learning frameworks for training on massive robot datasets
  • Work together with AI researchers to implement training of new model architectures at a large scale
  • Implement distributed training, advanced parallelization strategies, and high-performance data loaders to reduce model development cycles
  • Profile, identify, and eliminate training bottlenecks at the hardware and software levels to maximize Model FLOPs Utilization (MFU)
  • Implement tooling for data processing, model experimentation, and continuous integration
Requirements
  • Strong software engineering fundamentals
  • Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field
  • Extensive professional experience with Python and PyTorch
  • Proven track record of scaling and running large-scale training experiments personally on 800+ GPUs
  • Experience managing HPC clusters for deep neural network training
  • Minimum of 4 years of professional, full-time experience building reliable backend systems and infrastructure
Bonus Qualifications
  • Experience contributing to or maintaining open-source distributed training frameworks (Megatron-LM, DeepSpeed, TorchTitan)
  • Experience managing cloud infrastructure (AWS, Azure, GCP)
  • Experience with job scheduling / orchestration tools (SLURM, Kubernetes, LSF, etc.)
  • Experience with configuration management tools (Ansible, Terraform, Puppet, Chef, etc.)
  • Deep understanding of CUDA and hands-on experience writing custom GPU kernels to optimize training

The US base salary range for this full-time position is between $150,000 - $350,000 annually.

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.

HQ

Figure.ai San Jose, California, USA Office

San Jose, CA, United States

Similar Jobs

4 Minutes Ago
Remote or Hybrid
7 Locations
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Lead design and optimization of data models and pipelines for compliance and risk; standardize metrics and documentation; build data quality, lineage, and monitoring (including AI agents for automation); manage ETL scheduling, on-call pipeline support, and collaborate with product and non-technical partners to translate business needs into automated, production-ready data solutions.
Top Skills: AirflowDatabricksDbtGitOmniPrefectPythonSnowflakeSQLTerraform
13 Minutes Ago
Easy Apply
Hybrid
2 Locations
Easy Apply
216K-480K Annually
Expert/Leader
216K-480K Annually
Expert/Leader
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Lead and scale a unified global Audit, Risk, and Compliance function. Own SOX/ICFR readiness, risk-based internal audit, regulatory compliance (including OFAC, KYC/KYB), and tech-forward automation of controls and monitoring. Advise the C-suite and Audit Committee, recruit and develop a multidisciplinary team, and partner cross-functionally to embed durable compliance and remediation into business workflows.
Top Skills: AutomationCloud InfrastructureCosoData AnalyticsIsoItgcsKybKycOfacSox/Icfr
An Hour Ago
Hybrid
70K-90K Annually
Junior
70K-90K Annually
Junior
Hardware • Healthtech • Software • Analytics
The Sales Development Representative will help build the Sales Development function, drive outbound prospecting, respond to inbound leads, and maintain CRM excellence while refining outreach effectiveness.
Top Skills: Crm SystemsHubspotSales Engagement Tools

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account