Record your own speech data

The recorded speech data comprises text-speech pairs from which we will train a model. The model will therefore be influenced by both the content (e.g., words, phonetic coverage) and speaking style.
  • The recording script

    You will record two sets of speech data. The first is "neutral read-text" and the second will be of your own design.

  • Skills: recording speech in the studio

    With our carefully chosen script, we now need to go into the recording studio and ask our voice talent to record it. Consistency is the key here, especially when the recording is done over multiple sessions.

  • Prepare the recordings

    Move your recordings into the workspace, convert the waveforms to the right format, and do some sanity checking.