Exa (exa.ai) Logo

Exa (exa.ai)

Research, ML

Reposted 14 Days Ago
In-Office
San Francisco, CA, USA
180K-350K Annually
Mid level
In-Office
San Francisco, CA, USA
180K-350K Annually
Mid level
The ML Research Engineer will develop and train embedding models for a search engine, focusing on novel transformer architectures and dataset creation.
The summary above was generated by AI

Exa is an applied AI lab building a search engine unlike the world has ever seen. We build massive-scale infra to crawl the entire web, train state-of-the-art embedding models to process it, and design super high performant vector databases to retrieve over it. We now power search for Cursor, Cognition, HubSpot, and over 400,000 developers and have raised $350m from Lightspeed, Benchmark, and a16z.

 

Our ultimate goal is to build perfect search over all the world's information, far beyond Google. If you want to build massive-scale ML systems that will define the way the new AI world consumes information, this is the place for you.

 
Research at Exa
 

The ML organization sits at the heart of our mission. We train foundational models for search. Our goal is to build systems that can instantly filter the world's knowledge to exactly what you want, no matter how complex your query. Basically, put the web into an extremely powerful database.

 

We're looking for an ML Research Engineer to train embedding models for perfect search over the web. The role involves dreaming up novel transformer-based search architectures, creating datasets, creating evals, beating our internal SoTA, and repeat.

Desired Experience
  • You have graduate-level ML experience (or are an exceptionally strong undergrad)

  • You can code up a transformer from scratch in PyTorch

  • You like creating large-scale datasets and diving deeply into the data

  • You care about the problem of finding high quality knowledge and recognize how important this is for the world

Example Projects
  • Pre-training: Train a hundred billion parameter model

  • Fine-tuning: Build an RLAIF pipeline for search

  • Dream up a novel architecture for search in the shower, then code it up and beat our best model's top score

  • Build an eval system that answers: How do we know we're advancing our search quality? (this is an incredibly difficult question to answer)

Logistics
  • Location: This is an in-person opportunity in San Francisco.

  • Visas: We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3). While we cannot guarantee your visa, we have historically been successful in sponsoring candidates from all over the world. If you receive an offer, our team will work hard to get you a visa.

  • Benefits: We offer premium healthcare benefits (medical, dental, vision), fertility benefits, 16 weeks of fully paid parental leave for all new parents, and a monthly wellness stipend to all of our employees.

Exa (exa.ai) San Francisco, California, USA Office

533 Page St, , San Francisco, California , United States, 94117

Similar Jobs

2 Days Ago
In-Office
Santa Clara, CA, USA
164K-194K Annually
Junior
164K-194K Annually
Junior
Fintech • Information Technology • Logistics
Conduct research and implement deep learning and computer vision models (detection, ID, tracking, few-shot learning). Build proofs-of-concept with customers, commercialize solutions, publish papers and patents, and collaborate with production teams to deliver robust Industrial AI systems.
Top Skills: CC++Computer VisionDeep LearningFew-Shot LearningGenerative AiJavaMachine LearningNlpPythonPyTorchRScikit-LearnTensorFlow
3 Days Ago
Remote or Hybrid
San Francisco, CA, USA
165K-259K Annually
Mid level
165K-259K Annually
Mid level
Biotech • Pharmaceutical
Develop and evaluate representation-learning architectures for atomistic systems, including equivariant GNNs and geometric transformers; design pre/mid/post-training curricula and RL strategies; prototype, benchmark, and scale research into reusable components; collaborate with physicists, chemists, and engineers to ground models in physical principles and deploy on distributed infrastructure.
Top Skills: 3D Geometric Deep LearningEquivariant Graph Neural NetworksGeometric TransformersJaxMaceMulti-Cloud Distributed ComputeNequipPainnPythonPyTorchReinforcement LearningSchnet
3 Days Ago
Remote or Hybrid
San Francisco, CA, USA
165K-259K Hourly
Mid level
165K-259K Hourly
Mid level
Biotech • Pharmaceutical
Conduct research to develop generative and sampling models for molecular systems, integrating probabilistic inference, deep learning, and reinforcement learning. Design, train, prototype, and benchmark diffusion, autoregressive, flow-based, and latent-variable architectures. Map between simulation and reality, collaborate with chemists and physicists, and work with engineers to deploy scalable research on distributed compute. Contribute end-to-end from ideation to production in a hybrid San Francisco/New York environment.
Top Skills: JaxPythonPyTorch

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account