Weekly schedule

The weekly schedule shows which module we are covering each week. Timetabled ('synchronous') classes for modules 6 to 9 are also listed here, but tutorials are not. See the calendar in Teams to find (and join) classes and tutorials.

Each module leads you through the material in a specific order: simply follow the tabs from left to right. But do also explore the entire course content on your own, look ahead to what is coming later, and browse the rest of this website including the forums.

This course operates a policy of continuous improvement. You will find some content is not yet ready. Don’t worry, it will be ready in time for you to study it This year, we are making the following improvements:

New PHON videos.
New SIGNALS videos.
New TTS videos.
(The ASR videos will remain the same as last year.)
New Jupyter notebooks, focusing on PHON and SIGNALS.
More quizzes.

We have also made the following changes in the way the course is delivered:

Replace the weekly whole-class lecture and large-group computer lab with small-group tutorials.
For each tutorial group: two 50 minute tutorials per week.
A more focussed topic for each tutorial, with specific tasks for you to complete in advance (individual, pair or group work).

In response to mid-course feedback, we have additionally:

Added overview video(s) for modules 6 to 9. You should watch these before the live class.
Scheduled a live class every Tuesday 11.10 to 13.00 on Teams to go over that video in an interactive fashion.

The readings are specified in each module and are categorised as

Essential (read all, aiming to complete within the same week as the module)
Recommended (read as many as you can, catching up when you have spare time)
Extra (only read if you’re interested; some readings may be challenging; most are beyond the scope of the course)

Thursday, 19 September 2024

Speech Processing Lecture 1: Introduction to Speech Processing / Speech Production

Thursday, 19 September 2024 09:00 - 11:00

In the first lecture, we'll give an overview of the course and then make a start on understanding the relationship between speech and text through articulatory phonetics.

Before this class: complete Module 1

Lecturer: Catherine Lai

Wednesday, 25 September 2024

Speech Processing Lab - Group 1

Wednesday, 25 September 2024 09:00 - 10:50

Praat basics and phonetics exercises  . Before this lab: read Module 1 - Phonetics and Representations of Speech - Lab   Staff: Catherine Lai, Atli Sigurgeirsson
Speech Processing Lab - Group 2

Wednesday, 25 September 2024 16:10 - 18:00

Praat basics and phonetics exercises   Before this lab: read Module 1 - Phonetics and Representations of Speech - Lab   Staff: Catherine Lai, Ariadna Sanchez, Atli Sigurgeirsson

Thursday, 26 September 2024

Speech Processing Lecture 2: Acoustic phonetics

Thursday, 26 September 2024 09:00 - 10:50

Before this class: watch module 2 videos and complete essential readings   Lecturer: Catherine Lai

Wednesday, 2 October 2024

Speech Processing Lab - Group 1

Wednesday, 2 October 2024 09:00 - 10:50

Acoustic phonetics, speech segmentation   and variability Before this lab: go through Module 2 - Acoustics phonetics - Lab   Staff: Catherine Lai, Atli Sigurgeirsson
Speech Processing Lab - Group 2

Wednesday, 2 October 2024 16:10 - 18:00

Acoustic phonetics, speech segmentation   and variability Before this lab: go through Module 2 - Acoustics phonetics - Lab Staff: Catherine Lai, Ariadna Sanchez

Thursday, 3 October 2024

Speech Processing Lecture 3: Digital Speech Signals

Thursday, 3 October 2024 09:00 - 10:50

What are spectrograms really? An introduction to Digital Signal Processing and the Discrete Fourier Transform

Before this class: complete Module 3

Lecturer: Catherine Lai

Wednesday, 9 October 2024

Speech Processing Lab - Group 1

Wednesday, 9 October 2024 09:00 - 10:50

Interpreting the DFT and issues with digital signals (Jupyter Notebooks)   Before this lab: read Module 3 - Digital Speech Signals - Lab Staff: Catherine Lai, Atli Sigurgeirsson
Speech Processing Lab - Group 2

Wednesday, 9 October 2024 16:10 - 18:00

Interpreting the DFT and issues with digital signals (Jupyter Notebooks)   Before this lab: read Module 3 - Digital Speech Signals - Lab   Staff: Catherine Lai, Atli Sigurgeirsson

Thursday, 10 October 2024

Speech Processing Lecture 4: The Source-Filter Model

Thursday, 10 October 2024 09:00 - 10:50

Building on our understanding of the digital signal processing, we look at source-filter model from more of an engineering perspective

Before this class: complete Module 4

Lecturer: Catherine Lai

Wednesday, 16 October 2024

Speech Processing Lab - Group 1

Wednesday, 16 October 2024 09:00 - 10:50

Exploring the source-filter model (Jupyter notebooks).    Before this lab: read Module 4 - the Source-Filter Model - Lab   Staff: Catherine Lai, Ariadna Sanchez  (If there is time left over, you can start playing with Festival.)
Speech Processing Lab - Group 2

Wednesday, 16 October 2024 16:10 - 18:00

Exploring the source-filter model (Jupyter notebooks).   Before this lab: read Module 4 - the Source-Filter Model - Lab   Staff: Catherine Lai, Ariadna Sanchez   (If there is time left over, you can start playing with Festival.)

Thursday, 17 October 2024

Speech Processing Lecture 5: Speech Synthesis - Phonemes and the Front-End

Thursday, 17 October 2024 09:00 - 10:50

Pronunciation, including letter-to-sound models, and predicting prosody. All these tasks can be done with Classification And Regression Trees (CARTs).

Before this class: complete Module 5

Lecturer: Catherine Lai

Wednesday, 23 October 2024

Speech Processing Lab - Group 1

Wednesday, 23 October 2024 09:00 - 10:50

Start TTS assignment   Before this lab: read Module 5 - speech synthesis – phonemes and the front end - Lab    Work through the practical exercises “Getting Started” and “Step-by-step” in the assignment specification https://speech.zone/exercises/the-festival-text-to-speech-system/   Staff: Simon King, Atli Sigurgeirsson, Opeyemi Okasuade
Speech Processing Lab - Group 2

Wednesday, 23 October 2024 16:10 - 18:00

Start TTS assignment  

Before this lab: read Module 5 - speech synthesis – phonemes and the front end - Lab

  Work through the practical exercises “Getting Started” and “Step-by-step” in the assignment specification https://speech.zone/exercises/the-festival-text-to-speech-system/

  Staff: Simon King, Ariadna Sanchez, Opeyemi Osakuade

Thursday, 24 October 2024

Speech Processing Lecture 6: Speech Synthesis - Waveform Generation

Thursday, 24 October 2024 09:00 - 10:50

Manipulating recorded speech signals to create new utterances.

Before this class: complete Module 6

Lecturer: Catherine Lai

Wednesday, 30 October 2024

Speech Processing Lab - Group 1

Wednesday, 30 October 2024 09:00 - 10:50

Continue TTS assignment   Before this lab: read Module 6 - Speech Synthesis – waveform generation and connected speech Lab, and bring a writing sample with you to the lab   Staff: Simon King, Atli Sigurgeirsson, Opeyemi Okasuade
Speech Processing Lab - Group 2

Wednesday, 30 October 2024 16:10 - 18:00

Continue TTS assignment   Before this lab: read Module 6 - Speech Synthesis – waveform generation and connected speech Lab, and bring a writing sample with you to the lab   Staff: Simon King, Ariadna Sanchez, Opeyemi Osakuade

Wednesday, 6 November 2024

Speech Processing Lab - Group 1

Wednesday, 6 November 2024 09:00 - 10:50

Command line and shell scripting

Before this lab: Work through the material in the Intermission module then come to the lab to get help.

Staff: Atli Sigurgeirsson, Simon King
Speech Processing Lab - Group 2

Wednesday, 6 November 2024 16:10 - 18:00

Command line and shell scripting

Before this lab: Work through the material in the Intermission module then come to the lab to get help.

Staff: Simon King, Ariadna Sanchez

Thursday, 7 November 2024

Speech Processing Lecture 7: Speech Recognition - Pattern Recognition

Thursday, 7 November 2024 09:00 - 10:50

We now start on an introduction to Automatic Speech Recognition, starting with the concept of pattern recognition.

Before this class: complete Module 7

Lecturer: Catherine Lai

Wednesday, 13 November 2024

Speech Processing Lab - Group 1

Wednesday, 13 November 2024 09:00 - 10:50

Start ASR assignment   Before this lab: read Module 7 - Speech Recognition – Pattern matching - Lab   Staff: Simon King, Atli Sigurgeirsson
Speech Processing Lab - Group 2

Wednesday, 13 November 2024 16:10 - 18:00

Start ASR assignment   Before this lab: read Module 7 - Speech Recognition – Pattern matching Lab Staff: Simon King, Ariadna Sanchez

Thursday, 14 November 2024

Speech Processing Lecture 8: Speech Recognition - Feature Engineering

Thursday, 14 November 2024 09:00 - 10:50

To get the best out of machine learning, we can prepare features that reflect our knowledge of the problem, and suit our chosen model.

Before this class: complete Module 8

Lecturer: Catherine Lai

Thursday, 21 November 2024

Speech Processing Lecture 9: Speech Recognition - the Hidden Markov Model

Thursday, 21 November 2024 09:00 - 10:50

We now replace pattern matching with a generative model that is learned from data.

Before this class: complete Module 9

Lecturer: Catherine Lai

Thursday, 28 November 2024

Speech Processing Lecture 10: Speech Recognition - Connected speech & HMM training

Thursday, 28 November 2024 09:00 - 10:50

HMMs extend easily to connected speech so finally we put everything together to make a complete speech recognition system. We'll also learn how to train an HMM from data.

Before this class: complete Module 10

Lecturer: Catherine Lai

Monday, 2 December 2024

Speech Processing Milestone - assignment 2

Monday, 2 December 2024

If you've completed the rest of the assignment, create a language model for digit sequences. The techniques for this will have been taught in Module 10. Start building a system for digit sequences.

April 15, 2025	This video was Excellent Difficulty Just right Doing Text-to-Speech
April 15, 2025	This video was Excellent Difficulty Just right What is a Neural Network?
April 14, 2025	This video was Excellent Difficulty Just right Wrap-up
April 13, 2025	This video was Excellent Difficulty My brain hurts HMM speech synthesis, described as context-dependent modelling
April 13, 2025	This video was Excellent Difficulty My brain hurts HMM speech synthesis, described as context-dependent modelling

Weekly schedule

Search the forums

Speech Processing

In the forums…

Latest video ratings