This module is being updated for 2024-25!
Welcome to Speech Processing!
In this first module/week, we will first give an overview of the course with a view to establishing the relevance of phonetics to speech technology (i.e., text-to-speech and automatic speech recognition). We’ll start to touch upon the following foundational questions in spoken language processing: What is text? What does it represent? How can you describe speech to a computer? How does that relate to phonetics?
After the course overview, we will start to make the connection between text and speech by looking at some visual representations of speech and relating them to the articulatory changes that take place in your mouth to create various speech sounds. We’ll also begin working with the speech annotation software Praat to annotate and analyse speech sound waves. We’ll briefly introduce the IPA and the concepts that relate the grid structure of the chart to the anatomical structures of human vocal tracts.
Since people are still choosing courses, we won’t assume that you will have watched this week’s videos or done the readings before the Thursday lecture. But if you are certain you are taking this course, we may want to get ahead on that (and on next week’s content).
Please note there is no lab in week 1! The first lab session will be in week 2 and will follow on from material from module 1. In general labs for a module are in the week after the lecture.
Lecture Slides
Lecture 1 part 1 slides (google slides) [updated 17/9/2024]
Lecture 1 part 2 slides (google slides) [updated 17/9/2024]
You download the slides in various formats from those links (go to File > Download and choose your file type)
These videos give a concise introduction to articulatory phonetics and International Phonetic Alphabet (IPA). The first three videos comes from the UBC Visible Speech Channel, where you can also find a lot more demonstrations of how specific vocal tract articulations relate to speech sounds. You can also find a lot of further information on their eNunciate! project website. There’s also more audio/video resources in the readings tab.
The last two videos come from the Virtual Linguistics Campus youtube channel. That channel also has many useful videos on phonetics and linguistics more generally. We’ll watch some more of these in module 2.
Total video to watch in this section: 20 minutes
Since this is the first module, here’s a reminder that the readings (or sometimes other media) in each module are categorised as
- Essential (read all of these)
- Recommended (read if you want to go deeper)
- Extra (only read if you’re interested/have previous background; these readings may be challenging and can be considered beyond the scope of the course)
Only material in the essential readings is directly examinable, but the other readings may help you make the connections that get higher marks.
Module 1 notes
- There’s two chapters of essential reading this week, but the chapter on transcription is also a reading for module 2. So, if you don’t get through it all in week 1, that’s ok! Just make sure you look at all the phonetics essential readings by the end of module 2.
- The recommended “readings” here are actually audio/video resources. It’s usually a lot easier to grasp articulatory phonetics concepts by listening to actual sounds and seeing articulator differences. So, we’d highly recommend using these as you read through the texts (the first recommended link is the companion website for Ladefoged and Johnson’s textbook).
- You don’t have to memorize the IPA chart for this course, but exploring the interactive IPA chart (e.g. Seeing Speech) should help you get oriented with the layout quickly.
Reading
Ladefoged & Johnson – A course in phonetics – Chapter 1 – Articulation and Acoustics
An Introduction to articulatory phonetics and speech acoustics
Ladefoged & Johnson – A course in phonetics – Chapter 2 – Phonology and Phonetic Transcription
Basics of phonology and phonetic transcription. Read this over Speech Processing modules 1 and 2.
Ladefoged & Johnson – A course in phonetics – Online examples
An Introduction to articulatory phonetics and speech acoustics
Seeing Speech
Interactive IPA chart
Normal Speech Articulation
X-ray movies of speech
Introduction to the IPA from the Handbook of the International Phonetic Association
Describes the aims of the International Phonetic Alphabet and its various uses.
Practical Phonetics
Videos for the course Practical Phonetics
Analysing speech articulations
This is your first PHON lab! This lab will give a quick introduction to Praat phonetics software. We’ll then explore speech articulations via visualisations in Praat and your own practice!
Labs will start in week 2 and will be held in Appleton Tower room 4.02 (the PPLS computing lab). You should be assigned to one of the Wed 9-11am or 4-6pm labs.
You can find the lab materials for module 1 here: phon_lab_1
Remember, the labs for each module are in the week following the lecture. So, this week 2 lab will focus on module 1 material. Try to watch the videos and go through the readings before the lab (as well as looking at the lab instructions!).
Maths refresher
If you’re already quite familiar with Praat and phonetics, but not so much with maths, now might be a good time to check out some of Sharon Goldwater’s maths tutorials here:
https://homepages.inf.ed.ac.uk/sgwater/math_tutorials.html
If you’re looking for a more comprehensive textbook that goes back to basics, you might find Foundation Maths by Croft and Davidson helpful (available through the University of Edinburgh library).
For the signal processing bit of the course (coming up next), you might also want to brush up on some trigonometry and vectors (the trig stuff is mostly relevant for the signal processing modules, the vectors/linear algebra is relevant to machine learning more generally):
There’s more maths prep resources here: https://speech.zone/courses/prepare-for-study-in-speech-and-language-processing/brush-up-your-mathematics/
Or you could just get ahead with the assigned reading!
Lab Answers and Commentary
- Lab worksheet annotated with answers and commentary
- Sample textgrids: seashells, Peter Piper
Once you’ve attended the first lecture, it’s time to start watching the associated videos on phonetics and doing the readings listed in module 2. If you’re not enrolled and you want to take this course, please make sure that you enrol as soon as possible!
Remember: The lab for module 1 will be held in week 2.
What you should know from Module 1
Students often ask if they need to memorise the IPA chart and all the physiological terms and symbols mentioned in this module. The answer is no! The main skill you need is to be able to read and interpret the IPA Chart. For this course, you’ll always be able to refer to IPA chart in the tests (as in most real life IPA related scenarios!).
We don’t go that deep into articulatory phonetics in this course, but you should make sure that you have an understanding of the foundational concepts:
- Basic vocal anatomy:
- What are active versus passive articulators
- Systems in Speech Production: the respiratory system, the phonation system, and the articulation
system
- Consonants:
- Voicing: what the difference between voiced versus unvoiced sounds
- Place: You should be able to relate place columns in the IPA chart to points of constriction in
the voice tract. - Manner: You should know the general differences between stops, taps, trill, fricative, lateral fricatives, approximants in terms of how they are articulated.
- Why are pulmonic and non-pulmonic consonants listed separately on the IPA chart?
- Vowels:
- Height
- Frontness
- rounding
- What’s the difference between monophthongs and diphthongs
Key Terms
- Articulation
- Articulators
- Manner (of articulation)
- Place (of articulation)
- Voicing
- Consonant
- Vowel
- Vowel quality
- Phonetic
- Acoustic
- Phone
- Contrast
You may find some helpful discussion on the topics covered in this module on the Foundations of Speech part of the speech.zone forum.
- This category has 85 topics, 162 replies, and was last updated 2 months, 1 week ago by .
-
- Forum
- Topics
- Last Post
-
-
Acoustics
Sound, how it is produced and how it behaves
- 19
- 1 year, 2 months ago
-
Phonetics and speech science
How speech is produced and perceived
- 19
- 2 years, 2 months ago
-
Signal processing
Questions about feature extraction, time and pitch modification, or anything else we can do to speech waveforms.
- 46
- 2 months, 1 week ago
-
Acoustics