- This topic has 1 reply, 1 voice, and was last updated 5 years, 2 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › General questions › Combined ASR-TTS systems and prosody prediction
Has any research been done on transferring (or using as context) recognised prosody in combined ASR-TTS systems?
I’m thinking specifically about assistive voice technology, where an ASR component could extract prosodic features from a conversation partner and use them to improve prosody prediction in the synthesis component.
I guess this could also apply to voice assistants. I had a look but couldn’t find anything related to this topic
I found this one paper from Interspeech 2020 which might be what I’m looking for https://arxiv.org/abs/2009.01475
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
This is the new version. Still under construction.Copyright © 2026 · Balance Child Theme on Genesis Framework · WordPress · Log in