Complementary to Jurafsky & Martin, Section 8.1.
Taylor – Chapter 3 – The text-to-speech problem
Discusses the differences between spoken and written forms of language, and describes the structure of a typical TTS system.
Jurafsky & Martin – Section 3.5 – FSTs for Morphological Parsing
in Dan Jurafsky and James H. Martin “Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition”, 2009, Pearson Prentice Hall, Upper Saddle River, N.J., Second edition, ISBN 0135041961 Forum for discussing this reading
Taylor – Chapter 8 – Pronunciation
Including how the lexicon is stored, letter-to-sound, and compressing the lexicon.
Taylor – Chapter 4 – Text Processing
Complementary to Jurafsky & Martin, Section 8.1.
Jurafsky & Martin (2nd ed) – Section 8.2 – Phonetic Analysis
Each word in the normalised text needs a pronunciation. Most words will be found in the dictionary, but for the remainder we must predict pronunciation from spelling.
Jurafsky & Martin (2nd ed) – Section 8.1 – Text Normalisation
We need to normalise the input text so that it contains a sequence of pronounceable words.