Pika Logo

Pika

Multimodal LLM Researcher (MLLM)

Reposted 8 Days Ago
Be an Early Applicant
In-Office
Palo Alto, CA, USA
185K-400K Annually
Senior level
In-Office
Palo Alto, CA, USA
185K-400K Annually
Senior level
Lead research on multimodal generative models focusing on real-time synthesis from text, image, video, and audio, and collaborate with teams to develop scalable technologies.
The summary above was generated by AI
Multimodal LLM Researcher (MLLM)
About the Role

At Pika, we are pioneering next-generation creative infrastructure built around real-time, multimodal generation and intelligent, agentic platforms. We are seeking accomplished Multimodal LLM Researchers (LLM, VLM, and Audio LM) to drive forward our mission to make agentic real-time generative technology accessible, dynamic, and transformative for millions of creators.

 

As a core member of our research team, you will be integral to designing and building foundational technologies, developing novel approaches for large multimodal language models (LLMs/VLMs/Audio LMs), and orchestrating intelligent agentic systems that power scalable, interactive multimedia experiences. You will collaborate closely with engineering and product teams, shaping the future of real-time creative platforms.

 
What You’ll Do
  • Lead and contribute to research efforts focused on real-time, multimodal generation—including text, image, video, and audio synthesis—as well as orchestration of agentic platform infrastructure

  • Design and prototype novel algorithms and architectures for high-fidelity, real-time multimodal synthesis and interactive experiences

  • Focus on real-time aspects of model inference and synthesis across modalities

  • Work on diffusion model distillation and/or develop diffusion-based world models for multimodal applications

  • Train and finetune autoregressive and diffusion models in LLM, VLM, or Audio LM contexts with a focus on real-time performance

  • Curate specific datasets, especially for video, audio, cross-modal, and sensory-rich data

  • Collaborate with cross-functional teams to bring research advancements into production-ready technologies

  • Publish work in top-tier conferences and journals; communicate research results internally and externally

  • Stay at the cutting edge of real-time multimodal generative AI and agentic orchestration

 
What We’re Looking For
  • 5+ years of relevant experience, including research during graduate studies, in large language models, vision-language models, audio language models, deep learning, or related fields

  • Demonstrated impact as first author on major publications in top conferences or journals (e.g., NeurIPS, ICML, ICLR, frontier research background)

  • Deep expertise in at least one area: language modeling (LLM), vision-language modeling (VLM), or audio language modeling (Audio LM)

  • Strong experience with generative models, including autoregressive and diffusion models, and their real-time deployment

  • Hands-on experience curating, constructing, or augmenting large, high-quality multimodal datasets

  • Experience developing and deploying real-time systems and/or agentic orchestration infrastructure

  • Strong programming and prototyping skills (Python, PyTorch, TensorFlow, etc.)

  • Passion for building creative tools and platforms that empower users

  • Excellent communication and collaboration skills

 
What We Offer
  • Competitive salary and substantial equity in a high-growth startup

  • Full health benefits + 401k matching and more

  • Collaborative, mission-driven team environment with major growth opportunities

  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

 
 
About Pika

Pika empowers creators by building state-of-the-art agentic and multimedia platforms. Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all. Join us and shape the next evolution of creative technology!

 

If you are a leading researcher excited by real-time multimodal AI and agentic platforms, we want to hear from you.

Similar Jobs

9 Minutes Ago
Remote or Hybrid
United States
50K-50K Annually
Junior
50K-50K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Sell and manage group benefits (life, disability, dental, vision, voluntary) in the Utah market. Build broker and client relationships, develop strategic sales plans, grow renewals and new business for 2,000–4,999 life groups, coordinate cross-functionally for implementation, and track pipeline and sales activity to meet territory goals.
9 Minutes Ago
Remote or Hybrid
United States
42K-42K Annually
Junior
42K-42K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Provide phone and digital customer support for insurance policy, coverage, billing, and service inquiries. Use AI-guided tools and CRM systems to resolve complex issues, validate call summaries, document interactions, escalate as needed, and contribute to process improvements. Participate in paid training and ongoing development.
Top Skills: Ai-Powered ToolsAutomated SummarizationCopilotCRMCustomer Communication SystemsGuided Decision WorkflowsKnowledge Bases
10 Minutes Ago
Remote or Hybrid
United States
110K-150K Annually
Expert/Leader
110K-150K Annually
Expert/Leader
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead a regional financial team overseeing billing, analysis, forecasting, and reporting. Partner with Sales and Client Services, own revenue and earnings projections, drive process improvements and compliance, manage year-end reporting and key financial approvals, and build cross-functional relationships to meet customer and financial goals.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account