This role focuses on large-scale world models for temporal reasoning and generation, including video models, multimodal generative models, LLM/VLM/VLA models, and predictive models of traffic participants and scenes. Your work will directly power Waabi World’s ability to model future evolution, synthesize realistic safety-critical scenarios, and provide rich generative priors for downstream planning, testing, and training.
You will…
- Conduct fundamental and applied research in generative and predictive world-modeling
• Video generation and prediction.
• Latent diffusion / autoregressive / flow-matching models.
• Multimodal foundation models for driving scenes.
• LLM / VLM / VLA methods for scene understanding, reasoning, and control.
• Generative scenario modeling and controllable simulation.
• Model distillation.
- Collaborate with engineers to integrate models into large-scale, distributed training and rendering pipelines.
- Publish high-impact research at top conferences (CVPR, ECCV, ICCV, NeurIPS, ICLR, ICRA, SIGGRAPH).
- Mentor junior scientists and interns; foster a culture of scientific rigor and rapid experimentation.
- Stay on top of emerging advances in generative AI, differentiable rendering, knowledge distillation/compression, and robotics.
Qualifications:
- Demonstrated technical innovation: You have a Ph.D. in Computer Vision, Machine Learning, Robotics, or a related field or equivalent research experience pushing the boundaries of a technical field..
- Strong prototyping and implementation: You have expert-level Python & PyTorch (or JAX) skills; strong software-engineering fundamentals and experience with distributed training.
- Expert domain knowledge: You have built generative or predictive models of the physical world with scale and efficiency in mind for real-world applications
- Team player: You have worked in a close-knit team of researchers and engineers and have strong communication to deliver successful projects.
Bonus:
- Proven ability to translate research into production-quality code and measurable product impact.
- Demonstrated publications (first-author) in top-tier venues on topics such as world models, generative simulation, video prediction, diffusion, flow-matching, or foundation models for autonomy.
Waabi San Francisco, California, USA Office
San Francisco, California, United States
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)

