Algorithms for recognition

The model is only half the story. Now we need to perform computations with it. We'll start with recognising a test observation sequence.

Estimating the parameters of an HMM (called “training the model”) will come a little later. I think it’s better to understand the recognition algorithm first, because it is simpler.

Reading

Jurafsky & Martin – Section 9.5 – The lexicon and language model

Simply mentions the lexicon and language model and refers the reader to other chapters.

Jurafsky & Martin – Section 9.6 – Search and Decoding

Important material on efficiently computing the combined likelihood of the acoustic model multiplied by the probability of the language model.

Conditional probability & Bayes' rule
We can combine probabilistic models of our prior beliefs, and of the signal being classified.
Computing P(O|W) with an HMM
This term is called the "likelihood" and is the conditional probability that a particular HMM (W) generated the given observation sequence (O).
Computing P(W) with a language model
This is the "prior" probability of W. It doesn't involve the observation sequence O, so can be computed without looking at the speech to be recognised.

April 15, 2025	This video was Excellent Difficulty Just right Doing Text-to-Speech
April 15, 2025	This video was Excellent Difficulty Just right What is a Neural Network?
April 14, 2025	This video was Excellent Difficulty Just right Wrap-up
April 13, 2025	This video was Excellent Difficulty My brain hurts HMM speech synthesis, described as context-dependent modelling
April 13, 2025	This video was Excellent Difficulty My brain hurts HMM speech synthesis, described as context-dependent modelling

Algorithms for recognition

Reading

Jurafsky & Martin – Section 9.5 – The lexicon and language model

Jurafsky & Martin – Section 9.6 – Search and Decoding

Conditional probability & Bayes' rule

Computing P(O|W) with an HMM

Computing P(W) with a language model

Search the forums

Speech Processing

In the forums…

Latest video ratings