- This topic has 1 reply, 2 voices, and was last updated 8 years, 2 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › Unit selection › boundary of diphone
According to the video, we should choose the mid-phone point as the boundary. I’m just wondering whether every diphone contains the whole consonant waveform and the half-cut vowel waveform. For example, kak, the first diphone is (k)+(half a) or (half k)+(half a)? Logically I think it should be (whole k)+(half a) since the stop contains first silence and second burst with some aspiration. So how to explain the half-cut(mid-phone point)?
We have joins in consonants too. So in the example /k ae t/, the diphones would be
sil_k k_ae ae_t t_sil
where sil is “silence” and is just another phoneme.
You correctly spot that we might not want the place the cut point at exactly the centre (50% point) in all cases. In the case of stops, we will make the join in the closure portion.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in