- This topic has 1 reply, 2 voices, and was last updated 5 months, 2 weeks ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › Festival › Out-of-dictionary words
I have been running Festival’s script to check for out-of-dictionary words in the corpus I chose. It is still running after about six hours and seems to have found hundreds of out-of-dictionary words.
What should I do in this situation? Should I still use this corpus or try to find another one?
How large is your corpus?
What is the goal of finding the out-of-dictionary words?
If you wish to exclude all sentences that contain such a word, then you’ll have to find them all – you could do this using the provided Festival script (which might be slow for a very large corpus) or some other way (by writing your own code).
But if your aim is to identify all the words you might need to add to the dictionary, then you are not expected to do that for the large source corpus. You might need to rely on letter-to-sound to provide pronunciations during the text selection phase.
You should only manually write pronunciations for a modest number of words appearing in your (much smaller) recording script.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in