- This topic has 1 reply, 2 voices, and was last updated 6 months, 1 week ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › Festival › Diphone information from Festival
I am unsure about how to get information about the diphones and their context from the front end of Festival. Are there a few commands I could use to get this information for a given sentence to be used in the text selection algorithm?
What you need to do is pass your text through Festival’s front end, to obtain the pronunciation, from which you can easily determine the diphones.
You are already doing that as part of building the voice, in the step where you do forced alignment – looks at the Creating the initial labels step of Time-align the labels
See also this topic for other ways to do this.
It’s important that, whichever method you use, you load the same phone set and pronunciation dictionary that your final voice will use. (e.g., don’t use CMUdict).
Some of the posts in this topic may also be helpful.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in