Principal Research Engineer, Gemini Evals

Sorry, this job was removed at 08:06 a.m. (PST) on Monday, Mar 30, 2026

Be an Early Applicant

Easy Apply

In-Office

Mountain View, CA, USA

Easy Apply

In-Office

Mountain View, CA, USA

Similar Jobs

Freed

Product Support Specialist

An Hour Ago

Hybrid

San Francisco, CA, USA

75K-100K Annually

Mid level

75K-100K Annually

Mid level

Artificial Intelligence • Healthtech • Software

The Product Support Specialist provides support to clinicians using Freed's AI tools, identifies product improvements, and creates educational resources while collaborating with product teams.

Top Skills: Ai ToolsCRMIntercom

Seed Technology

Account Executive

3 Hours Ago

In-Office or Remote

Mid level

Cannabis • Marketing Tech • Retail • Software

As an Account Executive at Seed Technology, you'll manage the full sales cycle, build your pipeline, and close deals with dispensary operators and regional groups, focusing on business outcomes and expanding accounts post-sale.

Forge

Senior Director, Private Securities

3 Hours Ago

Easy Apply

Hybrid

San Francisco, CA, USA

Easy Apply

250K-450K Annually

Senior level

250K-450K Annually

Senior level

Fintech • Financial Services

The Senior Director will manage client relationships and drive revenue by executing institutional sales and private placements in the private market.

Top Skills: Finra Series 63Finra Series 7

Snapshot

This role is for a Principal level Research Engineer to lead the strategic development and execution of robust data pipelines, evaluation frameworks, and metric systems for the Gemini family of models and their associated product applications. As a key technical leader and individual contributor, you will apply deep expertise in large-scale machine learning, statistical rigor, and scalable engineering to ensure the safety, performance, and ethical alignment of our frontier AI systems before and after deployment.

About us

Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

This role is part of the Gemini Evaluation research teams. The Gemini Evals team defines success for Gemini, establishes metrics to track progress, and provides clear, actionable insights to guide development. As a Research Engineer on this team, you will be at the forefront of building the data and evaluation systems that ensure the safety and quality of the Gemini family of models.

The Role

As a Principle Research Engineer, you will operate as a technical expert and leader within the Gemini Data and Evaluation team. Your primary focus will be to architect and execute the rigorous evaluation and data systems that underpin all major model release and product launch decisions for Gemini.

This is a highly cross-functional role requiring a blend of deep ML research, world-class software engineering, and strategic influence. You will define the data strategy for critical evaluation campaigns, design novel metrics to measure safety and performance at scale, and mentor a team of engineers and researchers to build high-quality, reproducible systems. You will be accountable for communicating complex evaluation results directly to leadership stakeholders to guide the responsible deployment of our most advanced AI technology.

Key responsibilities

Technical Leadership & Strategy

Work on post-training evaluation and fine-tuning of large-scale models to improve performance and safety.
Define and champion the technical roadmap for large-scale data and evaluation supporting the Gemini model family and its real-world applications
Drive the research of novel, high-signal evaluation methods (automated, human-in-the-loop, and adversarial) to measure model capabilities, alignment, safety, and trustworthiness.
Actively contribute to the broader scientific community by presenting findings on cutting-edge AI evaluation and safety methods.

About You

In order to set you up for success as a at Google DeepMind, we look for the following skills and experience:

10+ years of experience in researching engineering, with at least 5 years in a technical leadership role.
Experience with large-scale machine learning systems, data processing pipelines and evaluation methodologies.
Experience with large language models (LLMs) and their evaluation.
Experience in post-training evaluation research

Ampitheatre Pkwy, Mountain View, CA, United States, 94034

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Deepmind

Principal Research Engineer, Gemini Evals

Similar Jobs

Product Support Specialist

Account Executive

Senior Director, Private Securities

Deepmind Mountain View, California, USA Office

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech