Start

This module is being updated for 2024-25!

Welcome to Speech Processing!

In this first module/week, we will first give an overview of the course with a view to establishing the relevance of phonetics to speech technology (i.e., text-to-speech and automatic speech recognition). We’ll start to touch upon the following foundational questions in spoken language processing: What is text? What does it represent? How can you describe speech to a computer? How does that relate to phonetics?

After the course overview, we will start to make the connection between text and speech by looking at some visual representations of speech and relating them to the articulatory changes that take place in your mouth to create various speech sounds. We’ll also begin working with the speech annotation software Praat to annotate and analyse speech sound waves. We’ll briefly introduce the IPA and the concepts that relate the grid structure of the chart to the anatomical structures of human vocal tracts.

Since people are still choosing courses, we won’t assume that you will have watched this week’s videos or done the readings before the Thursday lecture. But if you are certain you are taking this course, we may want to get ahead on that (and on next week’s content).

Please note there is no lab in week 1! The first lab session will be in week 2 and will follow on from material from module 1. In general labs for a module are in the week after the lecture.

Lecture Slides

Lecture 1 part 1 slides (google slides) [updated 17/9/2024]

Lecture 1 part 2 slides (google slides) [updated 17/9/2024]

You download the slides in various formats from those links (go to File > Download and choose your file type)