Introduction

An overview of the complete process and some tips for success.

In this practical exercise, you’re going to build a neural text-to-speech synthesiser using recordings of your own voice.

Before starting, be a proper engineer and

keep a logbook to record every single step

You’ll find this invaluable if you need to repeat any steps, and your notes will also be useful for writing up a lab report at the end.

To build your synthetic voice, you will follow step-by-step instructions and use a variety of existing tools. Currently, we only support the University of Edinburgh “Eddie” compute cluster, because some steps require GPUs.

Here are the main stages in this exercise:

Get access to the necessary computing facility and set-up your environment
Learn how to train the model, using some pre-existing data
Create your own data
- Select or design a recording script
- Make the recordings in the studio
- Prepare the data for training the model
Train the model on your own data
Evaluate the model(s) you have trained
Write up.

Read all the way through the instructions before you start!

Related forums

- Forum
- Topics
- Posts
- Last Post
- Speech Synthesis – Assignment
  Please post general or theory type questions in the public forums under the category "Speech Synthesis". The forums here are for questions specific to the practical assignment.
- -
- -
- 6 days, 5 hours ago