Technical Program Manager, Agent Quality and Evaluation, DeepMind

DeepMindMountain View, CA, USA

Minimum qualifications:

Bachelor's degree or equivalent practical experience.
10 years of experience in program or project management.
10 years of experience managing cross-functional or cross-team projects.

Preferred qualifications:

Experience building and scaling evaluation infrastructure for AI/ML systems, including benchmark design, metrics definition, and quality tracking.
Experience partnering with research and engineering teams in fast-paced environments to guide program delivery from concept to completion.
Understanding of the unique challenges in evaluating agentic behavior with a passion for AI agents and self-sustaining systems.
Ability to prioritize, adapt to change, and provide flexible thought partnership in an evolving landscape.
Excellent communication skills with the ability to develop meaningful relationships with key partners and influence action and outcomes.

About the job

A problem isn’t truly solved until it’s solved for all. That’s why Googlers build products that help create opportunities for everyone, whether down the street or across the globe. As a Program Manager at Google, you’ll lead complex, multi-disciplinary projects from start to finish — working with stakeholders to plan requirements, manage project schedules, identify risks, and communicate clearly with cross-functional partners across the company. Your projects will often span offices, time zones, and hemispheres. It's your job to coordinate the players and keep them up to date on progress and deadlines.

As the Technical Program Manager for AI Agent Quality and Evaluation, you will be the strategic owner of evaluation infrastructure that ensures our AI agents deliver reliable, high-quality outcomes at scale. You will scale evaluation efforts across agent quality (e.g., capability-based evaluations, user feedback pipelines, quality dashboards) and product evaluations (e.g., workflow validation, real-world task completion metrics). This role is critical to establishing the quality bar for self-sustaining agent execution across software development, operations, and enterprise workflows.

In this role, you will own the evaluation strategy for our AI agent programs. You will work at the intersection of research, engineering, and product to ensure our AI agents meet the highest quality standards before deployment.

Artificial intelligence will be one humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer diverse learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.

The US base salary range for this full-time position is $240,000-$334,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

Build and scale capability-based evaluation frameworks for AI agents.
Establish quality dashboards and leaderboards for tracking agent performance and latency.
Guide user feedback pipelines to collect and curate high-quality evaluation examples.
Coordinate benchmark evaluations comparing agent capabilities against baselines.
Partner with evaluation teams to validate agent capabilities across various use cases.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

job details

Jobs search results

Software Engineer III, YouTube Knowledge

Product Manager, Events Platform, Global Business Organization Engineering

Staff Engineering Analyst, Gen AI Trust and Safety

Technical Program Manager, Cloud, Supply Chain Analytics

Strategic Partner Sales Manager, Google Workspace (English)

Compensation Analyst, People Operations

AI Creative Activation Sales Specialist, LCS, CEE

Federal Go-To-Market Lead, AI Infrastructure, Google Public Sector

Staff Software Engineer, Torch TPU

Silicon Technical Recruiter

Senior Product Manager, Privacy Centric Measurement

Video Associate, Large Customer Solutions

Senior Strategic Agency Manager, GCS

Software Engineer III, Payments Platform

Engineering Analyst, Trust and Safety, Search

Technical Program Manager III, Hardware Quality and Reliability, Supply Chain

Thermal Manufacturing Engineer

Senior UX Quantitative Researcher, Search

UX Designer, Corporate Engineering

Software Engineer II, Android Frameworks

More about us

Related information

Equal opportunity