Notion Logo

Notion

Model Behavior Engineer

Reposted 6 Days Ago
Hybrid
2 Locations
98K-140K Annually
Mid level
Hybrid
2 Locations
98K-140K Annually
Mid level
Owner of Notion AI quality: design and iterate prompts and context strategies, analyze production data, build evals and metrics, evaluate/launch models with labs, and drive cross-functional quality improvements.
The summary above was generated by AI
About Us:

Notion helps you build beautiful tools for your life's work. In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting docs, notes, projects, calendar, and email—with AI built in to find answers and automate work. Millions of users, from individuals to large organizations like Toyota, Figma, and OpenAI, love Notion for its flexibility and choose it because it helps them save time and money.

In-person collaboration is essential to Notion's culture. We require all team members to work from our offices on Mondays, Tuesdays, and Thursdays, our designated Anchor Days. Certain teams or positions may require additional in-office workdays.

About the Role:

You'll own the quality bar for Notion AI products. You’ll work with product and engineering teams to build systems to define what “good” looks like, measure our progress, and drive changes to deliver reliable and high-quality AI experiences. Your work directly shapes how Notion's AI products behave for millions of users.

This isn't a traditional software engineering role. It’s an art & science role. You won't spend your days writing code. Instead, you'll focus on understanding and shaping how our AI products behave through context engineering, designing evaluation systems, and analyzing data. This team sits in our AI engineering team, working directly with engineering, product, design, and data.

This role is a unique blend of ops, strategy, and product thinking. Day to day, you'll live in production data, ship prompt fixes, run evals and, in effect, shape our quality strategy. As part of that you'll shape Notion's model strategy and work directly with frontier AI labs (OpenAI, Anthropic, Google) to evaluate and launch new models.

We're looking for problem-seeking generalists interested in 0 → 1: curious people with high agency who thrive in ambiguous, fast-moving product areas. We're building a product, but also building a new function. You'll have real ownership from day one and help write the playbook as we scale.

What You'll Achieve:
  • Context engineering — Design, test, and iterate on system prompts, tool prompts, and context strategies that shape how Notion's AI products behave. Understand the nuances of how models respond to different context structures and use that knowledge to drive quality improvements directly.

  • Understand & debug — Live in production data: transcripts, logs, user feedback. Reproduce issues, identify root causes, and translate symptoms into actionable problem statements. Find signal in noisy data.

  • Build evals & Measurement — Design eval strategies, build datasets, run evaluations. Track quality over time. Identify issues before users do. Own the loop: define quality goals, create evals, test and improve

  • Evaluate and launch new models with leading research labs — Evaluate and launch models from OpenAI, Anthropic, Google, and others. Benchmark across dimensions: quality, latency, cost, edge cases. Help shape Notion's model strategy based on real data.

  • Drive quality priorities — Work embedded with eng and product teams to surface the most important issues. Own the quality narrative: severity, frequency, what to fix and why. Be the voice of quality in the room.

  • Build tooling & systems — Help manage AI observability and eval platforms (e.g., Braintrust). Build the playbooks and tools that enable all teams at Notion to build AI products.

Skills You’ll Need to Bring:
  • Driver mentality — You treat problems as yours. If something's broken, it's your job to fix it, even if you didn't cause it. You have a bias to action.

  • Curiosity -You’re excited about exploring the “jagged frontier” of LLM capabilities and how AI products meet reality

  • Analytical instinct — Your first move is to look at data. You can find signal in noise.

  • Comfortable working with data — You can self-serve insights from large datasets, whether through SQL, coding agents, or other tools.

  • Clear communication — You can explain complex issues simply.

  • Experience with LLMs, prompting, or AI products

Nice to Have's:
  • Backgrounds in engineering, product, data science, research, consulting

  • You've built something on your own to solve a problem — side project, startup, tool, whatever

We hire talented and passionate people from a variety of backgrounds because we want our global employee base to represent the wide diversity of our customers. If you’re excited about a role but your past experience doesn’t align perfectly with every bullet point listed in the job description, we still encourage you to apply. If you’re a builder at heart, share our company values, and enthusiastic about making software toolmaking ubiquitous, we want to hear from you.

Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let your recruiter know.

Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role’s scope and complexity, and the candidate’s experience and expertise, and may vary from the range provided below. For roles based in San Francisco or New York City, the estimated base salary range for this role is $98,000 - $140,000 per year.

By clicking "Submit Application", I understand and agree that Notion and its affiliates and subsidiaries will collect and process my information in accordance with Notion's Global Recruiting Privacy Policy and NYLL 144.

#LI-Onsite

Top Skills

Anthropic Models
Braintrust
Coding Agents
Google Models
Llms
Openai Api
Prompt Engineering
SQL
HQ

Notion San Francisco, California, USA Office

San Francisco, CA, United States, 94110

Similar Jobs at Notion

11 Hours Ago
Hybrid
2 Locations
250K-300K Annually
Expert/Leader
250K-300K Annually
Expert/Leader
Artificial Intelligence • Productivity • Software
Lead global recruiting efforts for sales teams, developing strategies, building talent pipelines, managing teams, and optimizing recruiting processes across multiple regions.
Top Skills: AtsCandidate Engagement PlatformsSourcing Tools
11 Hours Ago
Hybrid
2 Locations
126K-180K Annually
Junior
126K-180K Annually
Junior
Artificial Intelligence • Productivity • Software
As an Early Career Software Engineer, you'll shape user experiences, tackle challenges, build features, and improve product performance, collaborating closely with teams.
Top Skills: Node.jsPostgresReactTypescript
11 Hours Ago
Hybrid
2 Locations
255K-285K Annually
Senior level
255K-285K Annually
Senior level
Artificial Intelligence • Productivity • Software
Lead the Solutions Engineering team for Mid-Market sales, building technical demos and advising on integrations, security and compliance for clients while driving technical wins and expanding customer base.
Top Skills: AIAPIsIamNotionSaaSSso

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account