letter to sound alignment

letter to sound alignment – question 2

This topic has 1 reply, 2 voices, and was last updated 8 years, 12 months ago by Simon.

Viewing 1 reply thread

Author

Posts
- June 2, 2016 at 16:50 #3217
  Norbert G
  Student
  hello,
  
  I managed to do the alignment by implementing the ideas mentioned in the previous question. However, to build a voice I need utterance structures which in this case do not seem feasible, as no linguistic information is used.
  Is there anyway I can skip this step when building a DNN voice?
  
  thanks,
  Norbert
- June 5, 2016 at 10:39 #3223
  Simon
  Professor
  You don’t need utterance structures for the very simple case that you are trying at this point (treat letters as phonemes, and use no other linguistic information). To build a voice, you simply need to figure out how to create the input features for training the DNN. You need to use the Prepare the input labels steps of the DNN voice building exercise as your starting point, but replace some steps with your own scripts.
  
  For example, you do not need the step “Convert utterance structures to full context labels” – you need to create these full context labels using your own script (I suggest starting with a “full context” of triphones or quinphones).
  
  The “Convert label files to numerical values” will be essentially the same, but you’ll need to modify the questions so that they correctly query your labels.
  
  It’s well worth doing all of this with your own scripts (they are quite simple) because this will give you a deeper understanding of all the steps involved. Then, you could switch to the Ossian framework, which will automate some of this for you.
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.

letter to sound alignment – question 2

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis