Back to jobs

Staff Research Engineer, Text to Speech, Innovation

SoundHound AI believes every person should be able to interact naturally with the products around them–by simply talking. With a global reach spanning two dozen languages, we build Voice AI products with conversational intelligence for cars, restaurant ordering, retail businesses, and more, allowing our customers to extend their brand in new and meaningful ways.

In this role, you will:

  • Conduct research on TTS
  • Work with language specialist and labelers to organize the collection and maintenance of necessary data
  • Implement and train State Of The Art TTS models
  • Work with the systems and infrastructure team to produce production grade implementation, integration and deployment of those models, serving millions of queries a month

We would love to hear from you if:

  • You have a proven track record of doing cutting edge research on TTS (published papers in top-tier conferences and journals, submitted patents, …)
  • You have at least ten years of working in academia or industry on TTS or related topics such as voice conversion/cloning, emotion recognition, style transfer, vocoders, ASR, voice biometrics
  • You have contributions to open-source TTS projects or communities
  • You possess excellent Python and C++ skills, and are familiar with the latest tools, standards and best practices
  • You have a holistic mindset, and thrive on owning the entire technical stack for products with research and production components
  • You enjoy working with cross-functional teams, including linguists, software engineers and SREs, and are able to communicate complex technical concepts clearly

We’d be especially excited if you:

  • Are familiar with cloud and MLOps technologies such as kubernetes, docker
  • Have experience in deploying ML models at scale and are driven by performance and cost efficiency
  • Are familiar with diffusion models

 

This role is available throughout France. We're also open to Germany. Employees within a 100-kilometer radius of our Paris or Berlin office are expected to work from the office on three pre-scheduled “core days” each month to encourage cross-team connection and in-person collaboration. Aside from these office-specific “core days,” this job allows for virtual/remote, hybrid, and in-office workplace setting options. In addition to salary and equity, you will receive comprehensive healthcare, paid time off, and other benefits. Our recruiting team will provide a specific salary range based on location and years of experience.

[Please note that if your application is advanced, the initial step will be an invitation to partake in a pre-assessment.]

___________________

SoundHound AI strives to be a values-driven company that is supportive of one another, open and honest, undaunted by challenges, nimble and focused, and determined to excel and win. Diversity, equity, inclusion, and belonging are key to who we are as a company. With a mission to build Voice AI for the world, creating a team with global perspectives is critical to our success. Learn more about our philosophy, benefits, and culture at https://www.soundhound.com/careers. 

We care deeply about fostering an environment where everyone is supported and can do their best work. SoundHound ensures that individuals with disabilities are provided reasonable accommodations to participate in the interview process, perform essential job functions, and receive other employment benefits.

To view our job applicant privacy policy, please visit https://static.soundhound.com/corpus/ta/applicantprivacynotice.html.

Come join our growing team and bring your unique voice to our mission!

#LI-REMOTE

#LI-MR1

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf

Select...
Select...
Select...

Education

Select...
Select...
Select...
Select...
Select...

DE&I Voluntary Survey Questions

At SoundHound, we believe in fostering an environment where a diversity of perspectives can thrive as we build the future of voice AI together. This core value is a pillar of our business and critical to our success. Your responses, if you choose to share them, will be used (in aggregate only) to help us identify areas of improvement in our process. Your responses will not be associated with your specific application and will not in any way be used in a hiring decision.

Select...
Select...
Select...