AfterQuery Logo

AfterQuery

Software Engineer - RL Environments

Posted 2 Days Ago
In-Office
San Francisco, CA, USA
200K-200K Annually
Junior
In-Office
San Francisco, CA, USA
200K-200K Annually
Junior
Design datasets, evaluation rubrics, and reward signals for RLHF/RLVR; build real and synthetic data pipelines; run experiments modeling annotator behavior; develop quantitative metrics for dataset quality, diversity, and downstream impact; partner with research teams to translate training objectives into data and evaluation specs.
The summary above was generated by AI
About AfterQuery

AfterQuery is an applied research lab curating data solutions for foundation model development.

We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve.

This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate.

We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI.

The Role

As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale.

Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications.

What You'll Do

  • Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows

  • Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines

  • Model annotator behavior and run experiments to improve different model capabilities

  • Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability

  • Create and manage both real world & synthetic data pipelines

  • Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications

What We're Looking For

  • 1-4 YOE

  • Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc..

  • Genuine obsession with how data structure, selection, and quality drive model behavior

  • Ability to design lightweight experiments, move fast, and extract actionable insights from messy results

  • Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right.

Compensation Structure:

$200k base + profit share (around 150% of base) + competitive equity

Similar Jobs

43 Minutes Ago
In-Office
2 Locations
157K-210K Annually
Senior level
157K-210K Annually
Senior level
Cloud • Information Technology • Machine Learning
Lead maturation and operations of security risk and M&A security programs: scale AI-assisted exception management and KRI dashboards from pilot to production, run risk review cadences and annual enterprise risk assessment, maintain M&A security integration playbook, drive decisions on stalled work, and partner cross-functionally to deliver program reporting and executive visibility.
Top Skills: Ai ToolsGrc ToolingKri Dashboards
44 Minutes Ago
In-Office
Sunnyvale, CA, USA
207K-275K Annually
Senior level
207K-275K Annually
Senior level
Cloud • Information Technology • Machine Learning
Lead technical direction and build a scalable network observability platform. Design collectors, storage, alerting, and visualization; standardize telemetry, mentor engineers, drive cross-team initiatives, and serve as senior on-call escalation for observability incidents and architecture.
Top Skills: AlertmanagerAnsibleBashClickhouseGnmiGoGrafanaHpe JunosIp NetworkingJaegerJinja2KubernetesLinuxLokiNokia Sr OsNvidia Cumulus LinuxOpentelemetryPrometheusPythonSnmpSonicSr LinuxZipkin
44 Minutes Ago
In-Office
2 Locations
150K-170K Annually
Expert/Leader
150K-170K Annually
Expert/Leader
Cloud • Information Technology • Machine Learning
Lead and coach a field team of Account Managers to grow, retain, and expand existing enterprise customers across territories. Drive revenue, pipeline, and adoption through cross-functional coordination, operating cadences, territory management, hiring, and performance development.
Top Skills: AIAi-Native PlatformCloudSalesforce

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account