Combined ASR-TTS systems and prosody prediction

This topic has 1 reply, 1 voice, and was last updated 5 years, 2 months ago by Evelyn W.

Viewing 1 reply thread

Author

Posts
- December 10, 2020 at 22:55 #13533
  Evelyn W
  Student
  Has any research been done on transferring (or using as context) recognised prosody in combined ASR-TTS systems?
  
  I’m thinking specifically about assistive voice technology, where an ASR component could extract prosodic features from a conversation partner and use them to improve prosody prediction in the synthesis component.
  
  I guess this could also apply to voice assistants. I had a look but couldn’t find anything related to this topic
- December 11, 2020 at 11:31 #13534
  Evelyn W
  Student
  I found this one paper from Interspeech 2020 which might be what I’m looking for https://arxiv.org/abs/2009.01475
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.