Build your own unit selection voice (2019-20 archive)

Archived version of 2019-20-specific material.

Click here to change the layout back to normal

Getting back on track with the assignment
Information about completing the assignment under the current circumstances, including revised expectations about what you will be able to achieve.
Revised assessment arrangements

In the light of an announcement from the School about changes to assessment, here is a definition of the assessment for Speech Synthesis this year, for both levels 10 and 11:
- The coursework is the single item of assessment. There will be no exam for level 10.
- ~~The coursework will receive a pass/fail mark but no grade.~~
- 2020-03-29 UPDATE: After initially announcing the change to pass/fail, the School was required by the University to revise its decision. The coursework will be graded on a simplified marking scale and feedback will be provided:
  - 2020-04-02 UPDATE: Level 10 – 35/45/55/65/75/85
  - 2020-04-02 UPDATE: Level 11 – 35/45/55/65/75/85
- The due date has been revised to Thursday 23rd April at 12 noon
- There is no change to the word limit or other requirements for the report.
The coursework is designed to support your learning, so we will be delighted to support those of you wishing to complete the coursework as originally designed, to a high standard, with a comprehensive and high-quality report. In return, we will provide comprehensive feedback on your work. Nevertheless, students facing external challenges may wish to scale back and submit a report that is adequate for a pass mark, by meeting the minimum expectations below.

2020-03-29 UPDATE: Please refer to guidance from the University and the School regarding the “no detriment” arrangements and the following course-specific advice for students needing to scale back and submit adequate work:

2020-03-29 UPDATE: Level 10 minimum expectations:
- A mark of 45 can be obtained by building one voice using your own recordings of ARCTIC A and performing a limited investigation of a small number of the design choices, with no listening test.
- A mark of 55 can be obtained by doing the above, plus one further voice using either your additional recordings, or the ‘slt’ ARCTIC A+B recordings and performing more investigation of the various design choices, with a very simple listening test (possibly with only yourself as listener).
- Students unable to complete either of the above should seek individual advice from the lecturer by email. You will be able to obtain a passing mark with a written report that focuses more on theory than practice.
2020-03-29 UPDATE: Level 11 minimum expectations:
- A mark of 55 can be obtained by building one voice using your own recordings of ARCTIC A, plus one further voice using either your additional recordings, or the ‘slt’ ARCTIC A+B recordings, and performing an investigation of a few design choices, with a very simple listening test (possibly with only yourself as listener).
- Students unable to complete the above should seek individual advice from the lecturer by email. You will be able to obtain a passing mark with a written report that focuses more on theory than practice.
Only you can decide what you wish to do, to achieve your personal goals.

What are your personal goals?

Some of you may wish to complete the course with the minimum amount of further practical work, perhaps because you have more important concerns at the moment. But others of you may wish to maximise your learning by attempting to complete the coursework as originally planned, possibly even going as far as making recordings away from Edinburgh.

Either of those positions, or anything in-between, is acceptable. There is a new forum to ask for help and clarification about what is achievable, or you can email the lecturer.

How good are your technical skills?

To do any further practical work, you need to get set up for remote working. This should be within the capabilities of most students, but if you cannot work remotely (e.g., you are on a very slow connection, have limited or no access to a suitable personal computer, do not have anywhere to work, or cannot get the software working) then you should seek individual advice from the lecturer by email.

How far have you got?

You will all be at different stages in the assignment, some ahead and some behind the milestones. I recommend the following options for completing the assignment, depending on which milestone you have reached. If you don’t fall into one of these categories, or have any doubts, please ask for help on the forum or by email to the lecturer.

Milestone C

You are behind schedule and should complete up to Milestone E, then follow the advice below, under “Milestone E”, but omit the following from the practical work:
- Omit the sub-task of Milestone E “Implemented your automatic script design algorithm, or manually created your additional script “
- Omit any other Milestone sub-tasks involving your own script and its recording
- Use footnotes to explain where you were unable to complete the practical work
and take the following approach to the written report:
- Address all sections of the structured marking scheme
- In cases where you have not completed the relevant practical work, provide a discussion of the issues instead. For example, for “Critical thinking – Data” you could discuss the importance of coverage and how it is typically achieved.
Milestone D

This is the milestone just before Flexible Learning Week and the strike. You are therefore on-track, but should now complete up to Milestone E, then follow the advice below, under “Milestone E”.

Milestone E

You are on-track but face the big decision of whether to attempt more recordings. Choose between “Default advice” and “Advice for those determined to record their own domain-specific script”.

Default advice

Do not make more recordings. Instead, reconfigure your hypotheses and experiments to work within the constraints of the data you have available, comprising your own ARCTIC A recordings and the ARCTIC A+B recordings of speaker ‘slt’.

If the voice built from your own ARCTIC A recordings sounds reasonable and is intelligible, then use it as far as possible, including in part or all of your listening test. However, if your voice is very poor quality (after checking nothing went badly wrong in voice building), then use voices built from the ‘slt’ ARCTIC A+B recordings, and especially in all parts of your listening test.

I recommend the following practical work:
- Use your own ARCTIC A recordings to explore the effect of design choices that don’t relate to quantity of data (e.g., pitch tracking settings), but fall back to using the ‘slt’ recordings if your own voice is very poor.
- Use the ‘slt’ ARCTIC A+B recordings to explore the effect of design choices that do relate to quantity of data (e.g., forced alignment).
- Complete the remaining milestones, omitting anything involving your domain script recordings
- Use voices built from either your own ARCTIC A or the ‘slt’ ARCTIC A+B recordings in the listening test. But do not attempt to compare across the two data sets: only make comparisons between voices built from the same speaker’s recordings.
- Run the listening test online (e.g., implemented in Qualtrics) and be flexible about listener demographics (e.g., do not insist on only using native speakers)
- Omit milestone “I – optional follow-on listening test”
and the following approach to the written report:
- Use your own ARCTIC A recordings to illustrate the voice building steps (the “Understanding” section of the structured marking scheme)
- Report your additional script design and use a footnote to explain that you were unable to a make recording of it (the “Critical thinking – Data” section of the structured marking scheme).
- Report listening test results for voices built from either your own ARCTIC A or the ‘slt’ ARCTIC A+B recordings. You will not be able to test any hypotheses related to domain, but there are plenty of other things to explore in a formal listening test, including some of the many other design choices.
Advice for those determined to record their own domain-specific script

This is entirely optional and only recommended for students who would be disappointed not to complete this aspect of the assignment. You should only proceed down this route if the voice built from your existing studio recordings is intelligible and sounds reasonably good.

It only makes sense to record your own domain script if you are also willing to re-record ARCTIC A, because they need to be closely matched, for two reasons: 1) comparing between a voice built only on ARCTIC A vs. a voice only built on your domain script will be unfair if the underlying recording quality is different; b) you can only combine the two sets of recordings in a single voice if they come from the same studio.

The SpeechRecorder tool used for making recordings is free, but is only available for Apple OS X. If you are on another operating system, making the recordings and dividing them into separate files would involve manual work and therefore cannot be recommended – sorry.

You will need a microphone to record with. A surprisingly good option is the microphone built in to wired Apple iPhone earbuds (the type with a 3.5mm 4-ring jack connector), which will plug in to a Mac’s headphone socket. Another option is a USB headset like you might use for Skype. The built-in microphone on a laptop is not ideal, because it’s not close-talking. If you’re in the market for a microphone for podcasting or videoconferencing, then the Blue Snowball iCE is excellent at that price point. Whatever you use, it needs to be used close to the mouth but not directly in front, just like in the recording studio.

Use a quiet place with lots of soft furnishings. Make a number of test recordings to perfect your setup and listen back carefully on headphones to check quality and recording level. You must minimise not only background noise but also reverberation.

You can now follow the advice under “Milestone F” below.

Milestone F or later

You are doing well and will be able to complete the assignment as originally planned, with the following modifications:
1. Run the listening test online (e.g., implemented in Qualtrics) and be flexible about listener demographics (e.g., do not insist on only using native speakers)
2. Omit milestone “I – optional follow-on listening test”
Remote working
You will use the lab computers remotely. This is easier to set up (and for us to support you) than installing Festival on your personal computer. You may still need to install some additional software.
Click here to change the layout back to normal
Log in
1. Getting started
  Install some software and learn how to log in to a lab computer.
  Setup your personal computer and install necessary software
  
  You will need a terminal to work in, ssh (secure shell) for logging in to a remote machine, scp and/or rsync for copying files between your personal computer and a remote machine, Praat and/or Wavesurfer for inspecting audio
  
  Apple OS X
  - Terminal, ssh, scp, rsync – built-in, nothing to install
  - Install Praat
  - Install Wavesurfer
  - Install a plain text editor such as Atom
  - Optionally, if you want to use ssh keys, then
    - Install Homebrew
    - sudo brew install ssh-copy-id
  Microsoft Windows
  - Terminal and ssh client – install Putty
  - scp client – install WinSCP
  - Install a plain text editor such as Atom
  - Install Praat
  - Install Wavesurfer
  Linux
  - terminal, ssh, scp, rsync – built-in, nothing to install
  - Install Praat
  - Install Wavesurfer
  Establish a connection to the University network
  
  Do one of the following
  - - Default choice: Connect to the University of Edinburgh VPN then open a terminal (Mac or linux) or Putty (Windows)
    - Alternative option: Open a terminal and ssh in to a DICE gateway machine (requires DICE account) such as student.ssh.inf.ed.ac.uk – this can be accessed without using the VPN, which may be faster when the VPN is in heavy use (experiment for yourself). You will be limited to workflows that either copy files to/from your personal computer, or work entirely on the lab computers.
  Tip from a Windows user: “set up the VPN connection as a private network rather than public / allow filesharing by changing security settings”.
  
  If using the VPN, note that all traffic from your computer to/from the internet now goes through the University network. That has several implications, such as
  - it’s more secure than a direct connection, if on unsecured WiFi
  - it will be slower than without the VPN
  - avoid heavy traffic over the VPN (e.g., watching YouTube while working) because you may slow it down for everyone
  - the University servers may log traffic, such as the names of any websites that you visit
  Choose a lab machine and log in
  
  Choose a server address from the ones here: https://resource.ppls.ed.ac.uk/whoson/atlab.php
  
  For this example, we’ll use ppls-atl-1079.ppls.ed.ac.uk, but you should choose a different one!
  
  If you’re on Mac or Linux, you can quickly test if that machine is responding before attempting to log in (if not, it’s probably booted into Windows and cannot be used):
```
$ ping ppls-atl-1079.ppls.ed.ac.uk
64 bytes from ppls-atl-1079.ppls.ed.ac.uk (129.215.204.197): icmp_seq=1 ttl=64 time=0.014 ms
64 bytes from ppls-atl-1079.ppls.ed.ac.uk (129.215.204.197): icmp_seq=2 ttl=64 time=0.022 ms
[use ctrl-C to terminate ping]
```
  Now attempt to log in. If using Putty on Windows, follow the instructions on how to start a session – it’s similar to the following, but using the Putty interface and not the command line.
```
$ ssh s1234567@ppls-atl-1079.ppls.ed.ac.uk

The authenticity of host 'ppls-atl-1079.ppls.ed.ac.uk (129.215.204.197)' can't be established.
ECDSA key fingerprint is SHA256:o5lK7XZUoJfF2IVanCfPbfguAxg6vYIoaoH1FjyTpr8.
Are you sure you want to continue connecting (yes/no)?
```
  type “yes” and hit Enter
```
Are you sure you want to continue connecting (yes/no)? yes

Warning: Permanently added 'ppls-atl-1079.ppls.ed.ac.uk,129.215.204.197' (ECDSA) to the list of known hosts.
s1234567@ppls-atl-1079.ppls.ed.ac.uk's password: 
Last login: Fri Oct 18 07:33:02 2019
/usr/bin/id: cannot find name for group ID 12345
$
```
  Your username is your UUN (e.g., s1234567) and your password is the same as you use in the lab (your Active Directory password). You are now logged in to a lab machine, which you can confirm like this:
```
$ hostname
ppls-atl-1079.ppls.ed.ac.uk
$
```
  If you can’t log in, try another random machine. If you wish, you can check and update computer status on this spreadsheet
  
  Optionally you can see if anyone else is using the machine, and if so whether it is busy. The machines are quite powerful, so there is no problem with multiple students using the same machine.
```
$ users
s1234567 s1234568 s1234569
$ top
[exit 'top' by pressing 'q']
```
2. Accessing and transferring files
  Access files from the lab on your personal computer, and transfer files back and forth.
  For all of these methods, your username is your UUN (e.g., s1234567) and your password is the same as you use in the lab (your Active Directory password).
  
  Option A: mount the remote filesystem
  
  To avoid copying files back and forth, you can access your lab home directory directly from your personal computer.
  - Advantages: direct access to files using your operating system’s graphical interface, in a terminal, or an editor, all running on your personal computer
  - Disadvantages: can be slow over the network, especially when browsing through folders containing many files or deeply-nested folders
  Start by connecting to the VPN. Then follow the instructions for your operating system:
  
  Apple OS X
  
  Open the Finder, press ⌘K then enter smb://fs1.ppls.ed.ac.uk, then navigate to your home directory.
  
  Microsoft Windows
  
  Type \\fs1.ppls.ed.ac.uk in the Windows Explorer address bar, then navigate to your home directory. Your username may need to be in the form s1234567@ed.ac.uk rather than just s1234567. This method has been reported as working for some, not for others. If it doesn’t work for you, use Option B instead, and try Option A again later.
  
  Linux
  
  There are several methods, but since you are smart enough to own a Linux machine, you surely know how to find out! Use the SMB protocol with the server mentioned above, fs1.ppls.ed.ac.uk.
  
  Option B: copy files to/from the remote filesystem
  
  If the above method isn’t working for you, or is very slow, then you can instead copy files between the lab computers and your personal computer (e.g., audio saved from Festival that you wish to listen to).
  - Advantages: once files are copied to your personal computer, access is instant and doesn’t use the network
  - Disadvantages: less convenient than Option A
  Start by connecting to the VPN. Then follow the instructions for your operating system:
  
  Apple OS X / Linux
  
  For example, you have created a directory called output in your ss directory, into which you have saved output from Festival. Instead of using scp to copy files, we’ll use rsync which is smart about only copying changed or new files. Do the following on your personal computer (not on the remote lab computer):
```
$ rsync -avul s1234567@scp1.ppls.ed.ac.uk:/home/s1234567/Documents/ss/output ~/somewhere/on/your/personal/computer
```
  Another example, this time you want a complete copy of your Documents folder on your personal computer:
```
$ mkdir ~/somewhere/you/choose/Documents/
$ rsync -avul s1234567@scp1.ppls.ed.ac.uk:/home/s1234567/Documents/ ~/somewhere/you/choose/Documents/
```
  The trailing “/” in rsync are meaningful when working with folders – don’t omit them. Imagine we now changed a file and wanted to send it back to the lab. It’s a bit risky synchronising your entire Documents back, so let’s just do part of it:
```
$ rsync -avul ~/somewhere/you/choose/Documents/ss/ s1234567@scp1.ppls.ed.ac.uk:/home/s1234567/Documents/ss/ 
```
  It is possible to run rsync without having to enter your password every time, if you set up ssh keys, which also allow you to log in without a password too.
  
  Importantly, ssh keys only work for lab computers (e.g., ppls-atl-0020.ppls.ed.ac.uk) and do not currently work for scp1.ppls.ed.ac.uk. So, once you know a lab machine is running linux (e.g., you are logged in to it!) you can also rsync to it
```
$ rsync -avul s1234567@ppls-atl-0020.ppls.ed.ac.uk:/home/s1234567/Documents/ss/output ~/somewhere/on/your/personal/computer
```
  Microsoft Windows
  
  On Windows, you will need to use an scp client, such as WinSCP. The server name is scp1.ppls.ed.ac.uk . Using keys (as described above) is also possible on Windows with Putty and WinSCP.
3. Using Festival
  Small changes needed when working with Festival on a remote machine, including how to play audio.
  
  Normally, Festival will try to play audio. When you are logged in remotely, it cannot do this and may actually crash when it is denied access to the audio device. You need to adjust the way audio is handled like this:
```
festival> (Parameter.set 'Audio_Method 'Audio_Command)
festival> (Parameter.set 'Audio_Required_Rate 16000)
festival> (Parameter.set 'Audio_Required_Format 'riff)
festival> (Parameter.set 'Audio_Command "cp $FILE ~/Documents/output.wav")
festival> (set! myutt (SayText "Example Sentence"))                
```
  after which you will find the file output.wav in your Documents folder. You can access this file from your personal computer in order to listen to it. (Mac tip: select the file in the Finder and use the space bar to play it; no need to open any application.)
  
  You may prefer to save the file somewhere other than your Documents folder. It may be best to make a special folder just for this purpose, which contains as few files as possible. Browsing folders with many files and sub-folders using your operating systems’ graphical interface can be slow over the network.
4. Workflows
  Recommended ways of working, including different options for editing files on the lab computers.
  Choose one of the following workflows:
  
  Recommended workflow: Use the lab computers for all computation; Edit files over the network on your personal computer
  
  All files remain only on the lab computers. You directly work with them over the network, including such things as opening them in an editor, making changes, and saving. With a good network connection to the lab, this is the simplest option and feels most similar to sitting in the lab.
  
  You will need to mount the remote filesystem (“Option A: mount the remote filesystem”) for this to be possible.
  
  Use a plain text editor (e.g., Atom) running on your local computer to edit files, and run all scripts and programs in a terminal logged in to a lab computer.
  
  Alternative workflow: Use the lab computers for all computation; Edit copies of files on your personal computer
  
  The previous method might not work if your connection is slow, drops, or hangs frequently. In this case, you’ll need to copy files to your personal computer for editing, then copy back to the lab computers before running programs on them.
  
  Here’s an example of how you would work in this way
  1. Use rsync to copy the necessary files or folders from the lab to your personal machine – I suggest doing that for an entire ss folder or even your entire Documents folder
  2. edit the local copy of a file (i.e., the one on your personal computer)
  3. rsync back to the lab computers
  4. run the scripts or program in a terminal logged in to a lab computer
  You only need to do step 1 once, then iterate steps 2+3+4 as you work.
  
  Do not combine “Recommended workflow” and “Alternative workflow” – you risk editing the wrong version of a file and generally getting confused.
  
  Advanced workflow: Use the lab computers only for voice building and Festival
  
  If you prefer to perform tasks such as running Python code (e.g., your script selection algorithm, or to analyse diphone coverage, etc) and shell scripts on your own computer, you might need to install additional software. If you want to do this, but don’t know how, ask on the forums.
  
  Old-school workflow: Work entirely on the lab computers; Listen to audio on your personal computer
  
  It is possible to work almost entirely in a terminal, including editing files (e.g., using nano). Those of you who choose this option probably will already know how to do it. If you want more details and suggestions, please ask.
  
  You will still need to either mount the remote filesystem or copy files, in order to listen to audio on your personal computer.
5. Getting help
  How to access help from the lecturer and tutors, for configuring remote working, then for the assignment itself.
  Virtual AT 4.02 PPLS computing lab
  
  In the “Labs & Recording Studio” section in Learn, you will find a link to a Blackboard Collaborate session. This session is always open, so you can use it to interact with other students at any time. You all have the role “presenter” to allow screen sharing etc.
  
  Tips for using the virtual computing lab
  - Chrome and Firefox are the recommended browsers (but Safari on iOS)
  - Upload a profile image and use your full name – it really helps others to see who is present
  - If there are more than 4 people in the room, mute your microphone (to reduce background noise) and optionally turn off video (to reduce bandwidth)
  - Use the ‘raise hand’ and chat features to get attention
  - You can use a shared whiteboard and also share your screen, or a single application (e.g., your terminal)
  Forums
  
  There is a new forum for help with remote working.
  
  Live help
  
  It is unclear whether live sessions at fixed times would be helpful, but we will try them to find out. See the weekly schedule for the schedule.
  
  With students across a wide range of timezones, access to live sessions may be tricky. Please use the forums to tell us what you find most helpful.
6. Advanced techniques
  Optional advanced techniques to work more quickly and efficiently. If you found the previous sections technically challenging, then don't attempt this part - it is not essential.
  Click here to change the layout back to normal
  Log in
  1. Avoiding a password
    How to avoid the need to enter your password every time you log in or rsync files.
    You will generate a pair of keys and place the public key on the remote machine, keeping the private key safe on your personal computer. Then, when you attempt to log in, the private and public keys are automatically compared and you are logged in without a password.
    
    You only need to set this up once.
    
    Apple OS X / Linux
    
    Generate a pair of keys on your personal computer
    
    $ ssh-keygen -t rsa
    
    accept the default location to save the keys by pressing Enter at the prompt
    
    Enter file in which to save the key (/Users/s1234567/.ssh/id_rsa): [press Enter]
    
    For this assignment, an empty passphrase is fine (you could then delete the public key from the lab computers when you’ve finished the assignment). Optionally, you can provide a passphrase of your choice, and make a note of it somewhere. You would have to enter this each time you use the keys, which is less convenient but more secure than an empty passphrase.
    
    Now we need to copy the public key to the lab computers. Connect to the VPN, then do
    
    $ ssh-copy-id s1234567@ppls-atl-0020.ppls.ed.ac.uk
    
    Note for Mac users: if you don’t want to install Homebrew just to get ssh-copy-id then you can manually install your public key on the lab computers by following that section of the Windows instructions below.
    
    Once you have ssh keys set up, you should find that both ssh and rsync now work for any lab computer (e.g., ppls-atl-0020.ppls.ed.ac.uk) without a password. The same pair of keys, created using the instructions above will work for all lab machines. You do not need to repeat this if you change to a different machine (e.g., ppls-atl-0021.ppls.ed.ac.uk).
    
    Currently, ssh keys do not work for rsync to scp1.ppls.ed.ac.uk so you will have to rsync to a lab machine, such as ppls-atl-0020.ppls.ed.ac.uk . Note that scp1.ppls.ed.ac.uk is always available (which is why we use that server when mounting the filesystem) whilst individual lab machines are only available if booted into Linux.
    
    Microsoft Windows
    
    This tutorial looks to be correct, or follow the instructions below.
    
    Read the instructions above to understand the concept of public/private keys. Then use the PuTTYgen utility (which was installed when you installed Putty) to generate keys.
    
    You now need to install your public key on the lab computers. Here’s one way to do that by logging in to any lab computer and doing this on the lab computer:
    
    $ cd $ mkdir .ssh $ cd .ssh $ touch authorized_keys $ nano authorized_keys $ chmod 600 authorized_keys
    
    now find your public key in PuTTYgen and copy it to the clipboard, paste it into nano (after any existing content), then exit the nano editor with control-x, answering “Y” to the prompt about saving the file.
  2. (Re)connecting
    How to create - and re-connect to - a persistent session that continues running on a lab computer even if the VPN drops, or your personal computer goes to sleep or is rebooted.
    under construction
    
    screen
  under construction…

Build your own unit selection voice (2019-20 archive)

Revised assessment arrangements

What are your personal goals?

How good are your technical skills?

How far have you got?

Milestone C

Milestone D

Milestone E

Milestone F or later

Setup your personal computer and install necessary software

Apple OS X

Microsoft Windows

Linux

Establish a connection to the University network

Choose a lab machine and log in

Option A: mount the remote filesystem

Apple OS X

Microsoft Windows

Linux

Option B: copy files to/from the remote filesystem

Apple OS X / Linux

Microsoft Windows

Recommended workflow: Use the lab computers for all computation; Edit files over the network on your personal computer

Alternative workflow: Use the lab computers for all computation; Edit copies of files on your personal computer

Advanced workflow: Use the lab computers only for voice building and Festival

Old-school workflow: Work entirely on the lab computers; Listen to audio on your personal computer

Virtual AT 4.02 PPLS computing lab

Forums

Live help

Apple OS X / Linux

Microsoft Windows