speech processing module 09

Jurafsky & Martin (3rd Ed) – Hidden Markov models

An overview of Hidden Markov Models, the Viterbi algorithm, and the Baum-Welch algorithm

Furui et al: Fundamental Technologies in Modern Speech Recognition

A complete issue of IEEE Signal Processing Magazine. Although a few years old, this is still a very useful survey of current techniques.

Holmes & Holmes – Chapter 9 – Stochastic Modelling

May be helpful as a complement to the essential readings.

Holmes & Holmes – Chapter 11 – Improving Speech Recognition Performance

We mitigate the over-simplifications of the model using ever-more-complex algorithms.

Young et al: Token Passing

My favourite way of understanding how the Viterbi algorithm is applied to HMMs. Can also be helpful in understanding search for unit selection speech synthesis.

Jurafsky & Martin – Section 9.4 – Acoustic Likelihood Computation

To perform speech recognition with HMMs involves calculating the likelihood that each model emitted the observed speech. You can skip 9.4.1 Vector Quantization.

Jurafsky & Martin – Section 9.2 – The HMM Applied to Speech

Introduces some notation and the basic concepts of HMMs.

Jurafsky & Martin (3rd Ed) – Hidden Markov models

Furui et al: Fundamental Technologies in Modern Speech Recognition

Holmes & Holmes – Chapter 9 – Stochastic Modelling

Holmes & Holmes – Chapter 11 – Improving Speech Recognition Performance

Young et al: Token Passing

Jurafsky & Martin – Section 9.4 – Acoustic Likelihood Computation

Jurafsky & Martin – Section 9.2 – The HMM Applied to Speech

Search this site

Posts

Latest Activity

Search the forums