Harvey Logo

Harvey

Research Engineer, Post-Training

Posted 2 Days Ago
Be an Early Applicant
Hybrid
San Francisco, CA, USA
231K-340K Annually
Mid level
Hybrid
San Francisco, CA, USA
231K-340K Annually
Mid level
Drive post-training experiments to improve agent performance for legal tasks: optimize harnesses, design grading/reward systems, study agent behavior, and collaborate with internal and external researchers to convert findings into training data, evals, and model improvements.
The summary above was generated by AI
Why Harvey

At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.

This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.

Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you.

At Harvey, the future of professional services is being written today — and we’re just getting started.

Role Overview

Post-training is how Harvey turns expert feedback and agent traces into models that are meaningfully better at legal work. We are looking for a research engineer who can help scale that loop: defining and running model training experiments, interpreting results, and working with internal and external research partners to build better data, environments, graders, and training recipes.

This role is for someone who can self-manage model training and applied research projects. You will work closely with internal and external research collaborators on post-training efforts that matter to our product roadmap. The ideal candidate has extensive hands-on experience training open weight models, either in a research or production setting, and enough engineering depth to run and debug experiments efficiently.

What You'll Do
  • Drive post-training experiments, pushing agent performance while navigating the Pareto frontier of cost, latency, security, and governance.

  • Optimize agent harnesses, including domain-specific skills, tools, subagents, retrieval strategies, and validation loops that improve quality on long-horizon legal work.

  • Design and develop grading and reward systems that are reliable enough for evaluation, efficient enough for iteration, and strict enough for high-stakes legal work.

  • Study agent behavior, identifying patterns that correlate with successful work product, and converting those findings into training data, evals, or harness changes.

  • Work with Harvey researchers and external research partners to define experiments, evaluate methodology, review results, and keep projects moving toward concrete model improvements.

What You Have
  • Hands-on experience with post-training or model-training work, such as SFT, preference optimization, RLHF/RLAIF, reward modeling, distillation, or adapting open-weight models to specialized domains.

  • Strong judgment about model behavior: you can read traces, inspect outputs, identify failure modes, and reason about whether a metric is measuring the thing that matters.

  • Strong Python and research-engineering ability. You can write clean code, debug experiments, and build the simple but reliable systems needed to make research move faster.

  • Ability to self-manage ambiguous applied research projects and communicate clearly with researchers, engineers, product teams, domain experts, and external partners.

Nice to Have

  • Experience building data or evaluation infrastructure for ML workflows, such as dataset curation pipelines, model-output processing, experiment tracking, evaluation dashboards, or regression analysis tooling.

  • Experience with distributed training, inference systems, GPU workloads, or large-scale ML experimentation.

  • Research publications, open-source contributions, or shipped industry work in LLMs, agents, evaluation, or ML systems.

Compensation

$231,000 - $340,000

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-AK1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing [email protected]

HQ

Harvey San Francisco, California, USA Office

San Francisco, California, United States

Similar Jobs

9 Days Ago
In-Office
Redwood City, CA, USA
275K-400K Annually
Expert/Leader
275K-400K Annually
Expert/Leader
Artificial Intelligence • Software • Conversational AI • Generative AI
Lead technical vision and execution for post-training systems that adapt OSS LLMs into production conversational products. Drive research in alignment, RL and fine-tuning, architect scalable training/inference infrastructure, build data pipelines and evaluation frameworks, and mentor teams to improve model behavior, safety, and user engagement at scale.
Top Skills: A/B Testing FrameworksCloud-Native Ml InfrastructureData PipelinesDistributed TrainingDockerGpu-Based SystemsKubernetesLarge Language Models (Llms)MistralModel ObservabilityModel ServingOrchestration PlatformsPreference OptimizationQwenReinforcement LearningSupervised Fine-TuningTransformers
21 Days Ago
Hybrid
San Francisco, CA, USA
200K-275K Annually
Mid level
200K-275K Annually
Mid level
Software
Build in-house tools for post-training models, focusing on improving model efficiency using various ML techniques across the system stack.
Top Skills: CgroupsDaskGpudirectInfinibandJaxKubernetesPyTorchRayRoceSlurmTensorFlow
21 Days Ago
In-Office
San Mateo, CA, USA
100K-300K Annually
Mid level
100K-300K Annually
Mid level
Artificial Intelligence • Robotics • Business Intelligence
As a Robotics Engineer, you will design and implement software and hardware adaptations for customer deployments, manage testing practices, and collaborate with robotics teams to enhance product quality.
Top Skills: CC++GoPythonRosRos2Rust

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account