Job Application for Research Scientist, Model Collaborativity at DeepMind

Snapshot

Artificial Intelligence could be one of humanity’s most useful inventions. At DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

About Us

We aim to enhance Gemini's capabilities to become a truly collaborative partner. User intent is typically unveiled through an ongoing, interactive, and multi-step conversation, rather than a single, isolated prompt.

We seek for the model to be able to proactively engage by asking clarifying questions, making timely suggestions, and automatically incorporating relevant context from past interactions and personal data.

This challenge is conceptualized as an imperfect information game. The core objective is for the model to deduce a user's hidden goals and preferences through cues provided during a conversation. We plan to utilize advanced Reinforcement Learning methods to optimize for long-term user satisfaction, which necessitates solving intricate credit assignment issues within interactive, stateful environments; model robustness issues; and exploration/exploitation strategies.

The techniques you develop will have a direct impact on a wide range of Gemini applications.

Key responsibilities:

Design and implement novel multiturn RL algorithms to train collaborative LLMs. This includes exploring advanced methods for credit assignment, model robustness, and exploration/exploitation strategies.
Develop and scale our training infrastructure, building on our existing framework for training against stateful user simulators.
Formalize the problem of collaborativity by creating new metrics, environments, and evaluation methodologies that capture long-term user satisfaction and preference alignment.
Do cutting-edge research that pushes the boundaries of how agents learn from interactions with users, user simulators and other agents with diverse model behaviors.
Collaborate with research and product teams to integrate these capabilities into core Gemini products, improving tasks that require sustained interaction and user understanding.

To make this effort successful, we need a strong RS who can help us deliver state-of-the-art collaborative models. We are looking for a candidate with deep expertise in reinforcement learning and large-scale ML systems. You should be passionate about solving complex, long-horizon problems and excited by the challenge of building truly adaptive and intelligent agents.

About You

In order to set you up for success as a Research Scientist at DeepMind, we look for the following skills and experience:

PhD in Machine Learning, Reinforcement Learning, Natural Language Processing, or a related field.
Strong data analysis and synthetic data generation skills.
Strong development skills in Python and experience with deep learning frameworks like JAX, PyTorch, or TensorFlow.
Experience building and working with large-scale ML training systems.

In addition, the following would be an advantage:

Deep theoretical and practical experience in Reinforcement Learning (e.g., policy gradient methods, value-based methods, model-based RL, credit assignment, robustness).
Experience developing and training large generative models (LLMs).
Strong track record of academic publications in top-tier conferences (e.g., NeurIPS, ICML, ICLR, AAAI).
Familiarity with research on game theory, multi-agent systems, or learning from human feedback (RLHF/RLAIF).
Experience building or using user simulators for RL training.

The US base salary range for this full-time position is between $166,000 - $291,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application Deadline: December 15, 2025

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Create a Job Alert

Interested in building your career at DeepMind? Get future opportunities sent straight to your email.

First Name

Last Name

Country

Phone

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf

LinkedIn Profile

Link to external profile e.g. LinkedIn, GitHub etc.

Where did you hear about this role?

Select...

This Research Scientist role demands both cutting-edge algorithmic innovation in multi-turn Reinforcement Learning and robust, scalable engineering to integrate these methods into the Gemini framework. Considering this dual requirement, how do your specific experiences—from deep theoretical understanding of RL (e.g., credit assignment and robustness) to hands-on development of large-scale ML systems—position you as the optimal candidate to successfully design, scale, and deploy models that achieve long-term user satisfaction and collaborative intelligence?

U.S. Standard Demographic Questions

Google DeepMind is subject to certain governmental recordkeeping and reporting requirements for the administration of civil rights laws and regulations. In order to comply with these laws and achieve our goal of a diverse and inclusive workforce, Google DeepMind invites employees to voluntarily self-identify their race or ethnicity. Submission of this information is voluntary and refusal to provide it will not subject you to any adverse treatment. The information obtained will be kept confidential and may only be used in accordance with the provisions of applicable laws, executive orders, and regulations, including those that require the information to be summarized and reported to the federal government for civil rights enforcement. When reported, data will not identify any specific individual. If you'd like more information about your EEO rights as an applicant under the law, please click here https://www.eeoc.gov/employers/eeo-law-poster.

How would you describe your gender identity? (mark all that apply)

Select...

How would you describe your racial/ethnic background? (mark all that apply)

Select...

How would you describe your sexual orientation? (mark all that apply)

Select...

Do you identify as transgender?

Select...

Do you have a disability or chronic condition (physical, visual, auditory, cognitive, mental, emotional, or other) that substantially limits one or more of your major life activities, including mobility, communication (seeing, hearing, speaking), and learning?

Select...

Are you a veteran or active member of the United States Armed Forces?

Select...

Research Scientist, Model Collaborativity

Snapshot

About Us

About You

Apply for this job

U.S. Standard Demographic Questions