Taylor – Section 12.3 – The cepstrum

By using the logarithm to convert a multiplication into a sum, the cepstrum separates the source and filter components of speech.

Taylor – Section 10.2 – Digital signals

Going digital involves approximations in the way an original analogue signal is represented.

Taylor – Section 10.1 – Analogue signals

It’s easier to start by understanding physical signals – which are analogue – before we then approximate them digitally.

Taylor – Chapter 8 – Pronunciation

Including how the lexicon is stored, letter-to-sound, and compressing the lexicon.

Taylor – Chapter 6 – Prosody prediction from text

Predicting phrasing, prominence, intonation and tune, from text input.

Taylor – Chapter 5 – Text decoding

Complementary to Jurafsky & Martin, Section 8.1.

Taylor – Chapter 4 – Text Processing

Complementary to Jurafsky & Martin, Section 8.1.

Taylor – Chapter 3 – The text-to-speech problem

Discusses the differences between spoken and written forms of language, and describes the structure of a typical TTS system.

Tanner et al. – Multidimensional acoustic variation in vowels across English dialects

Looks at acoustics characteristics of vowels across many dialects of English.

Sharon Goldwater: Vectors and their uses

A nice, self-contained introduction to vectors and why they are a useful mathematical concept. You should consider this reading ESSENTIAL if you haven’t studied vectors before (or it’s been a while).

Sharon Goldwater: Basic probability theory

An essential primer on this topic. You should consider this reading ESSENTIAL if you haven’t studied probability before or it’s been a while. We’re adding this the readings in Module 7 to give you some time to look at it before we really need it in Module 9 – mostly we need the concepts of conditional probability and conditional independence.

Seeing Speech

Interactive IPA chart