Finish

You now have a complete picture of unit selection speech synthesis. The synthetic speech is created by concatenating pre-recorded waveform fragments. These fragments are found from a database of natural speech.

The quality of the synthetic speech is heavily reliant on two things, each of which is covered in more detail in the next part of the course:

  1. The target cost function
  2. The database of recorded natural speech