LlamaIndex Logo

LlamaIndex

Multimodal AI Engineer, Document Understanding

Reposted 19 Hours Ago
In-Office or Remote
Hiring Remotely in San Francisco, CA
180K-250K Annually
Mid level
In-Office or Remote
Hiring Remotely in San Francisco, CA
180K-250K Annually
Mid level
Develop and optimize machine learning models for document understanding, handle production ML systems, and integrate innovations into APIs.
The summary above was generated by AI

Join us and help shape the future of AI by redefining document workflows with AI agents.

About the Role:

We are seeking exceptional AI engineers to join our core document understanding team. You will work at the intersection of computer vision, natural language processing, and production ML systems to push the boundaries of what's possible in document parsing and understanding.

Our document understanding team builds the intelligence behind LlamaParse, LlamaExtract, and our other processing products. These systems are processing millions of complex documents including PDFs, PowerPoints, Word documents, and spreadsheets. Your work will directly impact thousands of developers building RAG applications and document agents, while also contributing to our open-source frameworks that shape how the industry approaches document processing.

Depending on your background and interests, you might focus more on data curation and evaluation, model fine-tuning and experimentation, or ML infrastructure and production systems. We're hiring multiple people and will work with you to find the best fit.

Responsibilities:
  • Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing

  • Build robust data pipelines, evaluation frameworks, and experimentation infrastructure

  • Design and implement production ML systems that handle complex, real-world documents at scale

  • Stay current with latest advances in vision-language models, document AI, and multimodal learning

  • Collaborate with engineering teams to integrate ML innovations into production APIs

  • Contribute to both our open-source frameworks and enterprise offerings

  • Drive technical decisions while balancing research exploration with product delivery

Required Qualifications:
  • 3-7 years of experience in machine learning engineering or applied research

  • Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)

  • Hands-on experience training, fine-tuning, or deploying ML models in production

  • Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning

  • Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure

  • Ability to read and implement from research papers and technical specifications

  • Track record of executing with high intensity in fast-paced environments

  • Strong technical communication skills and comfort with open-source collaboration

Preferred Qualifications:
  • Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)

  • Experience building evaluation frameworks, benchmarks, or data quality pipelines

  • Experience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps tools

  • Experience specifically with document understanding, OCR, or layout analysis

  • Contributions to open-source ML projects or frameworks

  • Experience with LLM applications and RAG systems

  • Strong understanding of model optimization techniques (quantization, distillation, pruning)

  • Experience with Docker/Kubernetes and distributed systems

  • Active participation in ML research community

Location:

We offer a hybrid-friendly culture based out of our downtown San Francisco office. Remote candidates will be considered for exceptional fits.

Why Join Us?
  • Impactful Mission: Work on innovative AI products that redefine how knowledge is accessed and utilized. Your models will process millions of documents and directly impact thousands of developers.

  • Cutting-Edge Technology: Work with the latest vision-language models, contribute to open-source frameworks used industry-wide, and shape the future of document AI.

  • Collaborative Team: Join a focused team of passionate engineers and researchers committed to pushing the boundaries of what's possible in document understanding.

  • Technical Autonomy: Significant creative freedom to explore new approaches while maintaining focus on delivering high-quality, production-ready solutions.

  • Growth Opportunities: Be at the forefront of the AI revolution, with ample opportunities to grow alongside our scaling organization. Shape your role based on your interests and strengths.

Additional Benefits:
  • Competitive base salary and equity compensation

  • Comprehensive medical/dental/vision coverage for you and your family

  • Unlimited paid time off policy

  • Daily catered lunch and snacks in the San Francisco office

  • Budget for conferences, research materials, and professional development

  • Access to cutting-edge compute resources and research tools

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

LlamaIndex does not accept unsolicited agency resumes. Please do not forward resumes to our jobs alias, employees, or any other organization location. LlamaIndex is not responsible for any fees related to unsolicited resumes.

Top Skills

Docker
Kubernetes
Mypy
Onnx
Pydantic
Python
Ruff
Tensorrt
Uv
HQ

LlamaIndex San Francisco, California, USA Office

San Francisco, California, United States

Similar Jobs

5 Hours Ago
Remote or Hybrid
8 Locations
108K-203K Annually
Mid level
108K-203K Annually
Mid level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Account Services Manager will enhance and retain relationships with Sports and Entertainment sellers, identify growth opportunities, and collaborate with various teams to optimize client experiences.
Top Skills: Ai ToolsGoogle SuiteLookerRevenue.IoSalesforceSnowflake
17 Hours Ago
In-Office or Remote
9 Locations
79K-117K Annually
Senior level
79K-117K Annually
Senior level
Gaming
Seeking a Senior Talent Sourcer to develop sourcing strategies, engage candidates, analyze metrics, and collaborate with recruiting teams to build talent pipelines in the gaming industry.
Top Skills: ArtstationGitGreenhouseLinkedInTalent NeuronWorkday
19 Hours Ago
Remote or Hybrid
8 Locations
153K-270K Annually
Senior level
153K-270K Annually
Senior level
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Manage strategic planning and goal-setting processes, develop tools, support automation, partner with stakeholders, and analyze business opportunities.
Top Skills: AIProcess Automation

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account