An overview of the complete process and some tips for success.
In this practical exercise, you’re going to build a neural text-to-speech synthesiser using recordings of your own voice.
Before starting, be a proper engineer and
- keep a logbook to record every single step
You’ll find this invaluable if you need to repeat any steps, and your notes will also be useful for writing up a lab report at the end.
To build your synthetic voice, you will follow step-by-step instructions and use a variety of existing tools. Currently, we only support the University of Edinburgh “Eddie” compute cluster, because some steps require GPUs.
Here are the main stages in this exercise:
- Get access to the necessary computing facility and set-up your environment
- Learn how to train the model, using some pre-existing data
- Create your own data
- Select or design a recording script
- Make the recordings in the studio
- Prepare the data for training the model
- Train the model on your own data
- Evaluate the model(s) you have trained
- Write up.
Read all the way through the instructions before you start!
Related forums
-
- Forum
- Topics
- Last Post
-
-
Speech Synthesis – Assignment
Please post general or theory type questions in the public forums under the category "Speech Synthesis". The forums here are for questions specific to the practical assignment.
- -
- 6 days, 5 hours ago
-
Speech Synthesis – Assignment