tags.new

Research Scientist, Audio

New York City, New York, US

Snapshot

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

About Us

Members of the team are a group of researchers with core contributions into the Gemini Audio pillar. Specifically, the team works on audio and audio-visual understanding and generation tasks using large language models. Research includes, but is not limited to, better acoustic representations and tokenizers, better generation modeling, and audio and audio-visual open-ended tasks such as dialog, TTS, question-answering and dubbing.

The Role

Research Scientists at Google DeepMind lead our efforts in developing novel algorithmic architecture towards the end goal of solving and building Artificial General Intelligence.

In this role, responsibilities will include making key contributions into the latest research developed in the Gemini audio pillar, such as:

Key responsibilities:

  • Data: Unlocking new audio to X capabilities within the model, both in pre-training and post-training.
  • Models: Improving quality of models for understanding and generation. This includes research to improve our tokenizers, better techniques for generation quality, and looking at joint audio and visual representations. 
  • Evals: Better evaluation methods (human, auto raters, automated metrics) to measure quality of open-ended tasks.

About You

In order to set you up for success as a Research Scientist at Google DeepMind,  we look for the following skills and experience:

  • PhD in Computer Science, Computer Vision, Speech Processing, or Machine Learning related field.
  • Experience working with LLMs.
  • Audio or video understanding and/or generation experience.

In addition, the following would be an advantage: 

  • A proven track record of research and publications in some of the following areas: audio generation, video generation, LLMs
  • A real passion for AI!

 

The US base salary range for this full-time position is between $147,000 - $211,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

 

Create a Job Alert

Interested in building your career at DeepMind? Get future opportunities sent straight to your email.

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...

U.S. Standard Demographic Questions

Google DeepMind is subject to certain governmental recordkeeping and reporting requirements for the administration of civil rights laws and regulations. In order to comply with these laws and achieve our goal of a diverse and inclusive workforce, Google DeepMind invites employees to voluntarily self-identify their race or ethnicity. Submission of this information is voluntary and refusal to provide it will not subject you to any adverse treatment. The information obtained will be kept confidential and may only be used in accordance with the provisions of applicable laws, executive orders, and regulations, including those that require the information to be summarized and reported to the federal government for civil rights enforcement. When reported, data will not identify any specific individual. If you'd like more information about your EEO rights as an applicant under the law, please click here https://www.eeoc.gov/employers/eeo-law-poster.

Select...
Select...
Select...
Select...
Select...
Select...