"for "pau"" features in HTS labels

This topic has 2 replies, 2 voices, and was last updated 9 years, 8 months ago by Simon King.

Viewing 2 reply threads

Author

Posts
- June 13, 2016 at 11:34 #3258
  Etienne D
  Student
  Hello,
  I am currently going through all the input labels, working on the temporary files created by utts_to_mlf.sh that serve as a basis for the awk scripts that convert it to monophones and full-context labels:
  # oo th 0 0 1 0 0 1 0 . . . 0 2 0 7 0 5 # @ 1.0700001 1.258
  
  I’m using the scheme script label.feats as a reference for the meaning of each feature, however there is a series of features (49 to 62) that I do not seem to understand:
  the script simply says “for “pau””, and each feature seems to give information about whether the previous/next syllable/word/phrase is stressed/accented, or give its length/part-of-speech…
  Some of the features seem to have almost the same values as previous (but identically named) features, but not quite. They Additionally, the pauses in HTS labels are marked ‘#’, not ‘pau’ (whether that matters or not).
  I have searched in quite a few of the scripts and the HTS website, but could not find relevant information.
  Could you please clarify what these “for “pau”” features stand for?
  
  Thank you.
- July 6, 2016 at 19:07 #3322
  Etienne D
  Student
  Hi,
  
  I went as far as possible with many other things, but I still need to know about those label features if I am to build a full front-end.
  Does anyone have any further information about how the values of these features are defined?
  
  Thanks.
- July 12, 2016 at 15:07 #3328
  Simon King
  Professor
  These features are apparently especially for short pauses (we think) but their origin is somewhat lost in time. I suspect you may be able to omit them with only a little degradation (try it and see for yourself though).
Author

Posts

Viewing 2 reply threads

You must be logged in to reply to this topic.

"for "pau"" features in HTS labels

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis