Metamorphic Additive Manufacturing Ltd Logo

Metamorphic Additive Manufacturing Ltd

ML Research Engineer (Distributed Training)

Posted 9 Hours Ago
Be an Early Applicant
In-Office
Palo Alto, CA, USA
200K-280K Annually
Senior level
In-Office
Palo Alto, CA, USA
200K-280K Annually
Senior level
Design, build, and optimize distributed training systems for foundation models across thousands of GPUs. Implement advanced parallelism, fault-tolerance, profiling, and tooling to enable large-scale, reproducible ML experiments in cloud/HPC environments.
The summary above was generated by AI
About Metamorphic

Metamorphic is developing new approaches to intelligence by combining machine learning with large-scale experimental neuroscience, informed by the principles that make the brain efficient, flexible, and robust. We are building foundation models trained on rich, continuous neural data — a high-resolution model of the brain at a scale never before possible.

Our founding team spans machine learning, neuroscience, and neurotechnology, with prior work including the MICrONS project, Neuropixels, and the Enigma project, as well as foundational scientific contributions in learning, neural computation, and generative modeling. Our work sits at the frontier of AI research, and we believe the highest-impact discoveries will come from researchers and engineers working as a single, tightly collaborative team.

The name Metamorphic reflects our belief that the next advances in intelligence will come from a change in form, beyond scale — from artificial to natural intelligence.

About the Role

We are hiring Research Engineers to join our growing AI research team. You will work on building and scaling the distributed systems that enable training Metamorphic’s state-of-the-art foundation models across thousands of GPU’s. This is a high-impact, technically deep role working at the frontier of ML research and engineering. You will design and optimize our distributed training framework, implement advanced parallelism strategies, build fault-tolerant infrastructure, and provide the tooling researchers need to run large-scale experiments quickly and reproducibly. You'll have substantial autonomy to shape foundational technical decisions on a small, high-impact team.

You'll thrive in this role if you:

  • Have significant software engineering experience and can move quickly without sacrificing rigor

  • Are able to balance research goals with practical engineering constraints

  • Are happy to take on tasks outside your job description to support the team

  • Enjoy pair programming and deeply collaborative work

  • Are eager to learn more about machine learning research in a novel scientific domain

  • Are enthusiastic to work at an organization that functions as a single, cohesive team pursuing large-scale AI research

  • Have ambitious goals for AI progress and are excited to create the best outcomes over the long term

We offer:

  • The chance to work on one of the most scientifically consequential AI projects being pursued today

  • A small, world-class team where your contributions directly shape the science and the company

  • Competitive compensation and benefits, along with visa sponsorship

  • Strong mentorship and career development

Salary Range

$200,000 - $280,000 USD

Based on experience. We additionally offer a competitive equity package and comprehensive benefits, as well as visa sponsorship for international candidates.

Minimum Qualifications
  • Bachelor's degree or equivalent experience in Computer Science, Machine Learning, or a related field

  • Strong software engineering skills with a proven track record of building complex systems

  • Hands-on experience building and debugging distributed training infrastructure (PyTorch FSDP, DeepSpeed ZeRO, Megatron, TorchTitan, or similar) and optimizing advanced parallelism strategies

  • Strong understanding of GPU architecture and performance: memory hierarchy, tensor core utilization, bandwidth vs compute limitations

  • Strong understanding of the NVIDIA ecosystem: CUDA, NCCL, NVLink/NVSwitch topologies, mixed-precision training (MXFP8/NVFP4), and profiling tools

  • Deep familiarity with PyTorch internals, including torch.distributed, autograd, memory management, and torch.compile

  • Experience with cloud/HPC environments and job orchestration across hundreds of GPUs

Nice to Have
  • Experience building fault-tolerant training pipelines, including checkpointing, automatic recovery, and infrastructure for reproducible experimentation

  • Experience with the latest in mixture-of-experts architectures, diffusion model training, or multimodal models

  • Experience with inference serving frameworks (vLLM, TensorRT-LLM) or building custom inference solutions

We encourage you to apply even if you do not believe you meet every single qualification. If you don't see a role that fits, we encourage you to submit a general application and tell us how you'd like to contribute to our mission.

Similar Jobs

52 Minutes Ago
Hybrid
65K-105K Annually
Junior
65K-105K Annually
Junior
Consumer Web • eCommerce • Information Technology • Retail • Software • Analytics • App development
Manage and grow a portfolio of residential customers within a territory by building relationships with owners, managers, and service providers; generate new business; plan and lead customer meetings; use CRM and company selling tools to track activities, analyze trends, and execute strategic sales plans; participate in trainings and trade events; apply consultative selling to achieve sales goals. Individual contributor role.
An Hour Ago
Hybrid
15-24 Hourly
Junior
15-24 Hourly
Junior
eCommerce • Fashion • Retail • Sales • Wearables • Design
Front-line brand ambassador delivering personalized luxury retail service. Drive sales via omni-channel selling (mobile POS, clienteling, social selling), meet individual and store KPIs, handle transactions, inventory, visual merchandising, and daily store operations. Build client relationships, source customers, support teammates, participate in training and brand initiatives, and maintain store standards and asset protection.
Top Skills: Clienteling ToolsIpadLaptopMobile PosShort-Form VideoSocial Selling PlatformsWalkie-Talkie
An Hour Ago
Remote or Hybrid
2 Locations
105K-163K Annually
Senior level
105K-163K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Manage and grow strategic partnerships with Presidio and Trace3 by developing and executing joint GTM plans, coordinating cross-functional enablement and marketing, leveraging investments to maximize ROI, aligning with sales leadership, and using data-driven insights to drive partner-sourced revenue and brand elevation.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account