- This topic has 1 reply, 1 voice, and was last updated 4 years ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › General questions › Combined ASR-TTS systems and prosody prediction
Has any research been done on transferring (or using as context) recognised prosody in combined ASR-TTS systems?
I’m thinking specifically about assistive voice technology, where an ASR component could extract prosodic features from a conversation partner and use them to improve prosody prediction in the synthesis component.
I guess this could also apply to voice assistants. I had a look but couldn’t find anything related to this topic
I found this one paper from Interspeech 2020 which might be what I’m looking for https://arxiv.org/abs/2009.01475
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in