Total video to watch in this module: 59 minutes
This course assumes no particular prior knowledge, but my experience is that students do much better when they come from particular backgrounds.
Now we look at the speech chain to understand the scope of this course and to set the context for the two main applications that we will consider: text-to-speech synthesis, and automatic speech recognition.
A pressure wave that travels through a medium such as air.
It is often more useful to investigate sound in the frequency domain, rather than the time domain.
We'll start on the journey towards understanding speech production with this basic concept.
A tube full of air can resonate, as sound waves propagate along it.
A resonating system selectively amplifies certain frequencies: this is called filtering.
Using visual reasoning, we can calculate a resonant frequency of this simple tube.