Senior Staff Research Scientist, Gemini Safety Post-Training, DeepMind

DeepMindMountain View, CA, USA

Minimum qualifications:

PhD in Computer Science, a related field, or equivalent practical experience.
6 years of experience in Machine Learning Algorithms and Language Modeling.
One or more scientific publications in the ML/AI conferences or journals (e.g., NeurIPS, ICML, ICLR, CVPR).

Preferred qualifications:

6 years of experience in ML research, with 3 years of experience shipping Reinforcement Learning-based (or equivalent) post-training pipelines.
5 years of experience leading the cross-functional teams in complex, matrixed environments and ability to influence stakeholders, resolve incentives, and provide strategic technical judgment.
Ability to deploy the performance improvements in production foundation models.

About the job

As models become more agentic, executing long-horizon tasks, using tools, writing and running code, operating across multi-step workflows, the challenge of making them safe fundamentally changes. Surface-level safety methods (output filtering, refusal tuning, policy guardrails) were designed for single-turn interactions. They are not enough for agents that plan, act, and adapt over extended horizons.

We are looking for a Senior Staff Research Scientist to rethink safety post-training for this new reality. You will bring frontier post-training expertise, to develop training methods that make Gemini models deeply safe and aligned, especially in agentic settings. This role sits in Gemini Safety and partners closely with the Artificial General Intelligence (AGI) Safety team and the Gemini post-training organization.

Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer learning opportunities and varied career pathways
for those driven to achieve exceptional results through collective effort.

Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $262000 - $365000 (USD) + 25% bonus target + bonus + equity + benefits

Learn more about benefits at Google.

Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $262000 - $365000 (USD) + 25% bonus target + bonus + equity + benefits

Learn more about benefits at Google.

Responsibilities

Rethink how safety is trained into models, especially for agentic, long-horizon behavior.
Design and ship post-training recipes (Reinforcement Learning (RL), Supervised Fine-Tuning (SFT), and beyond) that install safety and alignment properties into Gemini models. You own the path from research to production.
Build the metrics and evaluations that tell us whether training is actually making models safer in deployment, not just on benchmarks.
Work directly with the post-training pipeline and infrastructure. Partner with the AGI Safety team to bring alignment research into practical training. Translate between research and production.
Shape the road map for where safety post-training goes next. Build and grow the team to execute on it.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

job details

Jobs search results

Demand Generation Lead, Google Cloud Marketing (Fixed-Term Contract) (English, German)

Senior Software Engineer, Cloud Spanner, Site Reliability Engineering

IP DFT Engineer

Data Center Facilities Technician, Electrical

Agentic Safety and Ecosystem Architect, Trust and Safety

Silicon CAD Engineer, University Graduate, PhD

Silicon Physical Design CAD Engineer

Staff Software Engineer, Machine Learning, GeminiApp Personalization, DeepMind

System Development Engineer, Silicon, Google Cloud

Facility Manager, Data Center Operations

Technical Program Manager II, Network Demand Planning, Cloud Infrastructure

Cloud Customer Engineer, Platform