Prime Intellect Jobs

Applied Research - Forward-Deployed

Prime Intellect

Applied Research - Forward-Deployed

Reposted 18 Days Ago

Be an Early Applicant

In-Office

San Francisco, CA, USA

Junior

In-Office

San Francisco, CA, USA

Junior

Embed with customers to design and run RL/post-training workflows: build custom environments, evaluation harnesses, and training runs on the Lab stack; translate field insights into platform improvements and reference implementations.

The summary above was generated by AI

Be Your Own Lab
Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool use, agent workflows, and deployment. We validate everything by using it ourselves, training open state-of-the-art models on the same stack we put in your hands. We're looking for people who want to build at the intersection of frontier research and real infrastructure.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

About the Role

We're looking for a Forward-Deployed Research Engineer (FDRE) to serve as the primary technical interface between Prime Intellect and our most important customers: AI companies, research labs, and enterprises running post-training and agentic RL on our platform.

This is not a traditional research role. You'll spend most of your time embedded with customers, understanding their models, workflows, and goals. Then, you'll translate those objectives into concrete training runs, environment designs, evaluation harnesses, and deployment recipes using the Lab stack. You are the person who makes the platform work in practice for real workloads.

You'll work closely with our research, product, and infrastructure teams to feed field insights back into the platform, shaping what we build next based on what customers actually need.

What You'll DoCustomer Engagement & Technical Delivery

Embed directly with strategic customers to understand their agent architectures, failure modes, and product goals
Design and build custom RL environments, evaluation harnesses, and verifiers that capture what "good" looks like for each customer's domain
Architect agent scaffolding — tool use, multi-step reasoning, memory, sandbox execution — tailored to customer workflows
Configure and launch training runs on Lab, iterating on reward functions, rollout strategies, and evaluation criteria
Serve as the technical lead for engagements end-to-end: from discovery through deployed, improved models

Platform Feedback & Ecosystem

Identify repeatable patterns from customer engagements and codify them into reference implementations, templates, and documentation
Serve as the voice of the customer internally, shaping the roadmap for Lab, verifiers, the Environments Hub, and training infrastructure
Build high-quality examples and "recipes" that make it easy for new customers and open-source contributors to extend the stack
Contribute to technical content (blog posts, tutorials, case studies) that demonstrates real-world platform usage

Applied Research & Experimentation

Develop novel evaluation methodologies for agentic behavior — multi-step reasoning, tool use correctness, recovery from failure, long-horizon task completion
Prototype and iterate on agent harnesses for real-world tasks: code generation, workflow automation, document processing, and more
Experiment with reward design, rubric construction, and environment shaping to improve training signal quality
Stay current on the frontier of agentic AI, evals, and post-training methods, and bring that knowledge directly into customer work

What We're Looking For

Deep hands-on experience building, evaluating, or deploying LLM-based agents in the past 1–2 years — you've seen what breaks in production and know what good evals look like
Strong intuition for evaluation design: you can look at a customer's agent and quickly identify what to measure, how to construct a rubric, and where the reward signal is weak
Working understanding of RL and post-training concepts (GRPO, RLHF, reward modeling, SFT) — you don't need to have written a trainer from scratch, but you should understand what the knobs do and why they matter
Strong Python skills and comfort with the modern AI stack (Hugging Face, inference engines, agent frameworks)
Experience in a customer-facing or consulting-adjacent technical role, or as a technical founder — you're comfortable in a room with a customer's engineering team figuring out what to build
Excellent written and verbal communication — you can write a clear environment spec, a compelling case study, and a useful Slack message to a frustrated customer
High agency and comfort with ambiguity. You don't wait for specs; you scope the problem, ship a solution, and iterate

Nice-to-Haves

Experience with agent frameworks and tooling (DSPy, LangGraph, MCP, Stagehand, browser automation)
Experience building or running LLM evaluation pipelines at scale (benchmarks, synthetic data generation, model grading)
Research experience — publications, open-source contributions, or benchmarks in ML/RL/agents
Familiarity with sandbox/code execution environments for agent evaluation
Web programming experience (React, TypeScript, Next.js) for building demos and customer-facing tooling

What We Offer

Cash Compensation Range of $150-300k + equity incentives
Flexible Work (San Francisco or hybrid-remote)
Visa Sponsorship & relocation support
Professional Development budget
Team Off-sites & conference attendance

Growth Opportunity

You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to:

Shape the evolution of agent-driven solutions—from research breakthroughs to production systems used by real customers.
Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL and post-training.
Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem.

If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you.

Ready to build the open superintelligence infrastructure of tomorrow?
Apply now to help us make powerful, open AGI accessible to everyone.

San Francisco, CA, United States

Similar Jobs

Acquia

Sales Advisor

6 Minutes Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

85K-95K Annually

Mid level

85K-95K Annually

Mid level

AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation

The Senior Sales Advisor will identify new business opportunities, lead sales for Acquia's DAM and PIM solutions, and engage with enterprise executives to close deals. They'll develop strategies based on market insights and collaborate across teams to drive growth.

Top Skills: DamDxpMartechPimPxmSaaS

STR

Software Engineer

6 Minutes Ago

Easy Apply

In-Office

Easy Apply

174K-225K Annually

Mid level

174K-225K Annually

Mid level

Machine Learning • Security • Software • Analytics • Defense

The Autonomy Software Engineer will develop and integrate systems, lead teams, conduct tests, and engage with customers to ensure robust mission systems solutions.

Top Skills: C++Ci/CdiOSLinuxLive-Virtual-Constructive (Lvc) InfrastructureMatlabWindows

STR

Product Manager

6 Minutes Ago

Easy Apply

In-Office

Easy Apply

265K-325K Annually

Expert/Leader

265K-325K Annually

Expert/Leader

Machine Learning • Security • Software • Analytics • Defense

The Chief Product Manager will lead product development and marketing strategies for collaborative autonomy, ensuring alignment with customer needs and overseeing multi-agent system technologies.

Top Skills: Advanced AlgorithmsCollaborative Autonomy StacksIntelligent System ArchitecturesMachine LearningSoftware Development Best Practices

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Prime Intellect

Applied Research - Forward-Deployed

Prime Intellect San Francisco, California, USA Office

Similar Jobs

Sales Advisor

Software Engineer

Product Manager

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech