- This topic has 1 reply, 2 voices, and was last updated 4 years, 8 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › Unit selection › Why is a smaller inventory good?
Hi,
I was looking at the slides for the module 3, 4 and 5 videos. On page 60, it said that an advantage of having a smaller inventory of units is that there would be a ‘smaller set of possible unit type sequences for any given utterance to be synthesised (possibly a unique sequence; e.g., phonemes, diphones)’. Could you please explain why this is desired?
Thanks
This slide is talking about the most basic form waveform concatenation synthesis in which we store only one example of each unit type:
inventory = the set of stored waveform units
But similar arguments apply to unit selection:
inventory = the set of unique unit types
database = the stored waveforms (usually complete natural utterances) from which units are selected; multiple instances of each unit type are available
Unit selection involves search all possible candidate unit sequences to find the best-sounding sequence. Even using dynamic programming, this will involve significant computation.
As the database of speech to draw candidates from increases in size, the number of available candidates increases in proportion, but the number of possible sequences increases exponentially.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in