Software Engineer, Large Scale Pre-Training Multimodal
Snapshot
At Google DeepMind, we've built a unique culture and work environment where long-term ambitious research can flourish. We are seeking a highly motivated Software Engineer to join our team and contribute to groundbreaking fundamental research and deployment in large scale pre-training.
About Us
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The Role
We're looking for a Software Engineer with strong empirical understanding of deep learning, computer vision, and large language models as well as strong engineering skills. Bonus points for experience with distributed systems or reinforcement learning.
Key responsibilities:
- Day to day work is centered around reinforcement learning and diving into the details of data and evaluations
- Conduct modelling research: Use empirical and theoretical insights to derive novel research ideas that improve Gemini models.
- Dive deep into specific aspects of post-training (reinforcement learning, data infrastructure) to understand and improve model dynamics.
- Be deeply integrated with the Multimodal Vision team to focus specifically on scaling of Multimodal models.
About you:
In order to set you up for success as a Software Engineer at Google DeepMind, we are looking for the following skills and experience:
- 2-5 years of relevant professional experience as a Software Engineering or Technical role.
- A proven track record of large scale deep learning with hands-on experience with Python and neural network training (publications, open-source projects, relevant work experience, …).
- An in-depth knowledge of Transformer models and LLM training dynamics.
- Ability to communicate technical ideas effectively, e.g. through discussions, whiteboard sessions, written documentation.
- Strong knowledge and experience of Python and/or C++.
In addition, the following would be an advantage:
- Experience with Large Language Models and notably Vision Language Models.
- Working knowledge of JAX/Pytorch or similar frameworks.
- Experience with data analysis at scale, data visualization and data ingestion pipelines for training models.
The US base salary range for this full-time position is between $141,000 - $202,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
Application deadline: December 8, 2025
Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
Create a Job Alert
Interested in building your career at DeepMind? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field