Module 2 – Acoustic Phonetics

We can analyze differences in the articulation of vowels and consonants in in terms of acoustic phonetic features

Start Videos Readings Lab Finish

In module 2, we’ll look at specific patterns relating to consonants and vowels, and apply these patterns to the task of segmenting, annotating and extracting measures from various types of speech sounds. As you start to recognize the patterns of speech acoustics, keep thinking about the link between what you see in a visualisation of speech acoustics (e.g., a spectrogram) and what is going on with the physical articulators when people speak. If you’re starting to be able to recognize acoustics of specific consonants and vowels, start thinking about how you might automated that process. What sort of phonetic transcription would be helpful for this?

This week you should try to watch the videos (in the ‘Videos’ tab for this module) before the lecture on Thursday. You can bring your questions to the lecture, or post them on the speech.zone forum.

Lecture slides

Lecture 2 slides (google slides) [updated 24/9/2024]

In these videos, we start to build up our knowledge of acoustics and link that to phonetics. The first four videos are by Simon King and focus more on the acoustics/engineering side. The last five videos are from the Virtual Linguistics Campus, introducing acoustic phonetics. We’ll get our first glimpse at how we can use the frequency properties of speech sound waves (visualised as spectrograms) to figure out what someone said. We’ll go into more depth about how we get spectrograms in module 3, but for now your task is to think about why using this sort of spectral representation of speech might be helpful for automatic speech recognition and synthesis.

Readings for module 2 focus on the acoustic properties of consonants and vowels.

Reading

Ladefoged & Johnson – A course in phonetics – Chapter 2 – Phonology and Phonetic Transcription

Basics of phonology and phonetic transcription. Read this over Speech Processing modules 1 and 2.

Ladefoged & Johnson – A course in phonetics – Chapter 8 – Acoustic phonetics

Links the source-filter model to spectrograms and acoustic analysis of speech.

Tanner et al. – Multidimensional acoustic variation in vowels across English dialects

Looks at acoustics characteristics of vowels across many dialects of English.

Turk et al. – Acoustic Segment Durations in Prosodic Research: A Practical Guide

A guide to segmenting speech using acoustic features (i.e., spectrograms)

Peterson & Barney – Control Methods Used in a Study of the Vowels

Examines the production and perception of vowels. This is a classic paper that many other studies on have built on.

Exploring Speech Acoustics

In the lab for module 2, you will continue explore speech acoustics through visualisations in Praat.

You can find the lab instructions here: phon_lab_2

If you have taken LEL2b or a similar intro to phonetics course, you may find this material very familiar. If it i and you haven’t done much maths recently, it may be a good opportunity to spend some time on maths revision (see notes in the Module 1 lab tab). Or you could just get a head with the readings/videos for upcoming modules!

Lab Answers and Commentary

Lab worksheet annotated with answers and commentary
Vowel textgrid: speechproc_phonlab2_rvowels.TextGrid

That completes our main modules on articulatory and acoustic phonetics. You should now have a basic understanding of how vowels and consonants are produced in terms of the vocal tract and it’s articulators. You should also have seen that we can “see” evidence of these articulations in speech acoustics, as represented in a spectrogram. These acoustic cues can be used to “read” spectrograms, i.e. to be able to tell what someone has said by just looking at a spectrogram. This is in essence what automated transcription systems attempt to do! So, it’s important to know what acoustic properties of the speech waveform are important for identifying what has been said for speech recognition. For speech generation, we want to make sure we generate the right acoustic features so that the waveform is understood as speech.

The next two modules will look at the aspects of acoustic phonetics from more of an engineering point of view. We’ll come back to more phon issues in later weeks, as learn more about TTS and ASR. In particular, we’ll look at the source-filter model from both theoretical and engineering points of view.

Making connections between the phonetics material and the speech technologies we’ll look at in the coming weeks will help you be an active learner. Just now, you probably have an understanding of issues in phonetics that will feed into how we design speech technologies, but only a vague idea of the ‘big picture’: the ideas may not yet be well-organised in your mind. Keep connecting and organising, and you’ll find that it does all join together.

What you should know from Module 2

Note: we’ll continue to discuss a lot of the ideas around the frequency domain, resonance and the source filter model in modules 3 and 4.

What does a speech waveform (i.e. in the time-domain) represent?

Time versus amplitude graphs
Oscillation cycle
Period T and wavelength λ (we’ll revisit this in the next few modules)
Frequency (F=1/T)
What are “Hertz”?
How to calculate the frequency of a waveform by measuring pitch periods (Example in the ”waveform” video)

Types of waveform:

Simple versus complex waves
Periodic versus aperiodic waves
Continuous versus transient waves
Fundamental Period (T0)
Fundamental frequency (F0)

Spectrum:

The spectrum as a representation of waveform frequency components
What is the spectral envelope?
Why do we consider F0 and harmonics to be “source” characteristics
What the relationship between formants and resonance
F0 is not a formant!

Spectrogram:

What do the x and y axes of a spectrogram represent (e.g. in Praat)?

Acoustics of Vowels:

What is the general relationship between formants (acoustics) to tongue position (articulation):
- F1 and vowel height
- F2 and vowel frontness
Acoustic vowel space:
- You don’t need to to know the specific formants associated with different vowels, but if you understand the relationship between height/frontness and formants you should be able to deduce this!

Acoustics of Consonants:

What does voicing look like on a spectrogram or spectrum?
Identify basic acoustic characterstics (i.e. on a spectrogram) of consonant manners: plosives (i.e., stops), aspiration (for plosives), fricatives, nasals, approximants.
Clues for place of articulation: stops, fricatives

Vowel space

How can you interpret an F1 vs F2 plot of vowel measurements
Relate this to vowel characteristics/the IPA vowel chart (e.g. tongue height and frontness)

Key Terms

waveform
amplitude
sine wave
period
frequency
wavelength
Hertz
fundamental frequency
harmonics
spectrum
spectrogram
spectral envelope
Fourier transform
formant
vowel height
vowel frontness
open vowel
closed vowel
plosive
voicing
voice onset time
aspiration

This category has 85 topics, 162 replies, and was last updated 7 months, 2 weeks ago by Patricija B.

- Forum
- Topics
- Posts
- Last Post
- Acoustics
  Sound, how it is produced and how it behaves
- 19
- 48
- 1 year, 7 months ago
  Catherine Lai
- Phonetics and speech science
  How speech is produced and perceived
- 19
- 45
- 2 years, 7 months ago
  Catherine Lai
- Signal processing
  Questions about feature extraction, time and pitch modification, or anything else we can do to speech waveforms.
- 46
- 152
- 7 months, 2 weeks ago
  Patricija B
- Praat
- 1
- 2
- 4 years, 8 months ago
  Rebekka Puderbaugh