Handshake Logo

Handshake

Staff AI Research Scientist - Data Quality, Handshake AI

Reposted 8 Days Ago
In-Office or Remote
3 Locations
350K-420K Annually
Senior level
In-Office or Remote
3 Locations
350K-420K Annually
Senior level
As a Staff AI Research Scientist, you'll lead high-impact research on data quality frameworks for LLMs, design systems to improve data integrity, and mentor junior team members while collaborating across functions.
The summary above was generated by AI
About Handshake AI

Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.

Handshake AI is a human data labeling business that leverages the scale of the largest early career network. We work directly with the world’s leading AI research labs to build a new generation of human data products. From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.

This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.

Now’s a great time to join Handshake. Here’s why:

  • Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.

  • Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.

  • World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.

  • Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.

About the Role

As a Staff Research Scientist, you will play a pivotal role in shaping the future of large language model (LLM) alignment by leading research and development at the intersection of data quality and post-training techniques such as RLHF, preference optimization, and reward modeling.

You will operate at the forefront of model alignment, with a focus on ensuring the integrity, reliability, and strategic use of supervision data that drives post-training performance. You’ll set research direction, influence cross-functional data standards, and lead the development of scalable systems that diagnose and improve the data foundations of frontier AI.

You will:

  • Lead high-impact research on data quality frameworks for post-training LLMs — including techniques for preference consistency, label reliability, annotator calibration, and dataset auditing.

  • Design and implement systems for identifying noisy, low-value, or adversarial data points in human feedback and synthetic comparison datasets.

  • Drive strategy for aligning data collection, curation, and filtering with post-training objectives such as helpfulness, harmlessness, and faithfulness.

  • Collaborate cross-functionally with engineers, alignment researchers, and product leaders to translate research into production-ready pipelines for RLHF and DPO.

  • Mentor and influence junior researchers and engineers working on data-centric evaluation, reward modeling, and benchmark creation.

  • Author foundational tools and metrics that connect supervision data characteristics to downstream LLM behavior and evaluation performance.

  • Publish and present research that advances the field of data quality in LLM post-training, contributing to academic and industry best practices.

Desired Capabilities
  • PhD or equivalent experience in machine learning, NLP, or data-centric AI, with a track record of leadership in LLM post-training or data quality research.

  • 5 years of academic or industry experience post-doc

  • Deep expertise in RLHF, preference data pipelines, reward modeling, or evaluation systems.

  • Demonstrated experience designing and scaling data quality infrastructure — from labeling frameworks and validation metrics to automated filtering and dataset optimization.

  • Strong engineering proficiency in Python, PyTorch, and ecosystem tools for large-scale training and evaluation.

  • A proven ability to define, lead, and execute complex research initiatives with clear business and technical impact.

  • Strong communication and collaboration skills, with experience driving strategy across research, engineering, and product teams.

Extra Credit
  • Experience with data valuation (e.g. influence functions, Shapley values), active learning, or human-in-the-loop systems.

  • Contributions to open-source tools for dataset analysis, benchmarking, or reward model training.

  • Familiarity with evaluation challenges such as annotation disagreement, subjective labeling, or multilingual feedback alignment.

  • Interest in the long-term implications of data quality for AI safety, governance, and deployment ethics.

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

🎯 Ownership: Equity in a fast-growing company

💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching

🍼 Family Support: Paid parental leave, fertility benefits, parental coaching

💝 Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

📚 Growth: $2,000 learning stipend, ongoing development

💻 Remote & Office: Internet, commuting, and free lunch/gym in our SF office

🏝 Time Off: Flexible PTO, 15 holidays + 2 flex days

🤝 Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.

Top Skills

Python
PyTorch
HQ

Handshake San Francisco, California, USA Office

We're located right in the center of everything in the financial district of downtown San Francisco. We're just 1 block from Montgomery St Bart!

Similar Jobs

14 Minutes Ago
Easy Apply
Remote
USA
Easy Apply
175K-250K Annually
Senior level
175K-250K Annually
Senior level
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
Lead the engineering team for product delivery at Runpod, focusing on customer-facing features while managing roadmaps, team growth, quality, and cross-functional collaboration.
Top Skills: Cloud Systems EngineeringGoKubernetesLinuxPythonTypescript
14 Minutes Ago
Remote or Hybrid
CO, USA
80K-120K Annually
Senior level
80K-120K Annually
Senior level
Information Technology • Insurance • Software
The role involves consulting for insurance clients, implementing AIM software, analyzing business operations, and managing multiple engagements. Strong communication and expertise in insurance accounting are essential.
Top Skills: Aim AccountingClaims ModulesUnderwriting
14 Minutes Ago
Easy Apply
Remote
USA
Easy Apply
200K-275K Annually
Senior level
200K-275K Annually
Senior level
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
The Director of Software Engineering will lead and scale product-delivery engineering teams, ensuring high-quality launches, effective strategy execution, and collaboration across departments while fostering a culture of ownership and excellence in a remote-first environment.
Top Skills: GoPythonTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account