Snorkel AI

Research Scientist, RL Training

Posted 3 Days Ago

In-Office or Remote

Hiring Remotely in San Francisco, CA, USA

200K-275K Annually

Expert/Leader

In-Office or Remote

Hiring Remotely in San Francisco, CA, USA

200K-275K Annually

Expert/Leader

The Research Scientist will focus on reinforcement learning for training large language models, designing data pipelines, and translating research into products, contributing to Snorkel's capabilities and research agenda.

The summary above was generated by AI

About Snorkel

At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.

We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI breakthroughs of today. But one thing has remained constant: the data you use to build AI is the key to achieving differentiation, high performance, and production-ready systems. We work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before. Excited to help us redefine how AI is built? Apply to be the newest Snorkeler!

ABOUT THE ROLE

We're looking for a Research Scientist to work on reinforcement learning for training and aligning large language models. This is a foundational research role focused on one of the most consequential open data problems in AI: how to generate the data, reward signals, and training procedures that steer LLM behavior in reliable and generalizable directions — and a core capability that directly differentiates Snorkel's data-as-a-service offering.

You'll work closely with Snorkel's research, engineering, and delivery teams to advance our RL data capabilities — translating research ideas into the preference datasets, reward models, and RL-ready corpora we produce for frontier AI labs, and contributing to a research agenda that is central to Snorkel's long-term differentiation as a provider of bespoke training data.

MAIN RESPONSIBILITIES

Research and implement reinforcement learning techniques — including GRPO, RLHF, RLAIF, DPO, and reward modeling — and translate them into data products (preference datasets, reward signals, verifiable rewards) that customers can use to train and fine-tune large language models.
Design and build data pipelines that generate high-quality training signal for RL workflows, including AI-assisted data annotation and curation data pipelines to improve model generalization to unseen benchmarks .
Prototype and iterate on end-to-end RL training recipes that inform what data Snorkel ships as part of its data-as-a-service deliveries.
Work closely with research scientists, ML engineers, and delivery teams to translate RL research into customer-ready data products.
Stay current with the latest developments in large-scale muli-node LLM training, alignment research, and scalable RL methods (on complex environments such as Terminal-Bench), bringing relevant advances into Snorkel's data-as-a-service approach.
Contribute to Snorkel's research publications and internal knowledge base in RL and model training.

PREFERRED QUALIFICATIONS

Deep expertise in reinforcement learning from human or AI feedback, reward modeling and credit attribution ideally with a clear perspective on what data makes these techniques work.
Experience training or fine-tuning 30B+ large language models at scale, including familiarity with distributed training infrastructure.
Strong proficiency in Python and ML frameworks, especially PyTorch and HuggingFace and hands-on experience with RL frameworks such as Verl and SkyRL.
Solid software engineering fundamentals — you can build research prototypes that others can run, extend, and integrate into data production workflows.
Familiarity with ML infrastructure and cloud platforms and tools (AWS, GCP, Kubernetes, Slurm, etc.); experience with large-scale RL training pipelines a strong plus.
Comfort operating in a high-iteration environment with open-ended research questions and shifting, customer-driven technical constraints.
Ph.D. in machine learning, reinforcement learning, or a related field strongly preferred; exceptional industry experience considered.

Salary Range

$200,000—$275,000 USD

Be Your Best at Snorkel

Joining Snorkel AI means becoming part of a company that has market proven solutions, robust funding, and is scaling rapidly—offering a unique combination of stability and the excitement of high growth. As a member of our team, you’ll have meaningful opportunities to shape priorities and initiatives, influence key strategic decisions, and directly impact our ongoing success. Whether you’re looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you’re fully supported in building your career in an environment designed for growth, learning, and shared success.

Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Top Skills

AWS

GCP

Huggingface

Kubernetes

Python

PyTorch

Skyrl

Slurm

Verl

55 Perry Street, Redwood, CA, United States, 94063

Similar Jobs

Coupa

Solution Advisor - 11186

An Hour Ago

In-Office or Remote

142K-161K Annually

Mid level

142K-161K Annually

Mid level

Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI

Provide technical and functional expertise during the sales process, deliver platform demonstrations, RFP assistance, IT workshops, and training; advise sales teams, manage accounts with travel, and help scale product knowledge across the organization.

Top Skills: AribaConcurCoupaEprocurementExpense ManagementGreat PlainsIvaluaNetSuiteOracleSaaSSAPSource-To-Pay

Coupa

Sr. Customer Value Manager - 11436

An Hour Ago

In-Office or Remote

127K-140K Annually

Senior level

127K-140K Annually

Senior level

Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI

The Sr. Customer Value Manager leads customer engagement, develops mutual success plans, and ensures customer value through strong relationships and strategic guidance across various industries.

Top Skills: Business Spend ManagementData AnalyticsSource-To-Pay

OCC

Senior Associate, Technology Operations

An Hour Ago

Remote or Hybrid

USA

78K-113K Annually

Senior level

78K-113K Annually

Senior level

Big Data • Cloud • Fintech • Information Technology • Financial Services

The Senior Associate in Technology Operations monitors the production environment, reviews processes, drives improvements, and ensures operational readiness, with responsibilities across various technical platforms and teams.

Top Skills: AWSControl MIbm JclItilLinuxMqStorageUc4UnixWindows

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Snorkel AI

Research Scientist, RL Training

Top Skills

Snorkel AI Redwood, California, USA Office

Similar Jobs

Solution Advisor - 11186

Sr. Customer Value Manager - 11436

Senior Associate, Technology Operations

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech