speech.zone

Simon November 15, 2014

Token passing

slownormalfast

Token passing is a really nice way to understand (and even to implement) Viterbi search for Hidden Markov Models. Here we see token passing in action, and you can look at the spreadsheet to see the calculations.

To keep things simple, we are ignoring transition probabilities in this example. It would be simple to add them – tokens would just multiply their likelihood by the transition probability every time they went down an arc.

To learn more, read this tech report.

Filed Under: Models, Recognition Tagged With: HMMs, spreadsheet, video

Simon February 1, 2015

Autocorrelation for estimating F0

Most methods for estimating F0 start from autocorrelation. The idea is pretty simple: we are just looking for a repeating pattern in the waveform, which corresponds to the periodic vocal fold activity. For some waveforms, it might be possible to do that directly in the time domain, but in general that doesn’t work very well. […]

Filed Under: Signals Tagged With: spreadsheet, video

Simon October 30, 2015

Wave propagation on the surface of water

At the Alhambra (Granada, Spain) I saw this nice example of waves from a point source propagating in all directions at a fixed speed.

Filed Under: Signals Tagged With: video

Simon October 11, 2014

A simple synthetic vowel

Using Praat, we synthesise a simple vowel-like sound, starting with a pulse train, which we pass through a filter with resonant peaks.

Filed Under: Synthesis Tagged With: Harmonics, praat, Source-filter model, Spectral envelope, video

Simon October 11, 2014

Pipeline architecture for TTS

Most text-to-speech systems split the problem into two main stages. The first stage is called the front end and contains many separate processes which gradually build up a linguistic specification from the input text. The second stage typically uses language-independent techniques (although they still require a language-specific speech corpus) to generate a waveform. Here we see those two […]

Filed Under: Synthesis Tagged With: front end, video, waveform generation

Simon October 11, 2014

Sampling and quantisation

Is digital better than analogue? Here we discover that there are limitations when storing waveforms digitally. We learn that the consequence of sampling at a fixed rate is an upper limit on the frequencies that can be represented, called the Nyquist frequency. In addition to the limitations of sampling, storing each sample of the waveform as a […]

Filed Under: Signals Tagged With: Digital signal, video, Wavesurfer

Simon October 11, 2014

Spectrum and spectrogram

The spectrum and the spectrogram are much more useful ways of analysing speech signals than the waveform. We look at how to create them using Wavesurfer and what effect the analysis window size has on what we see.

Filed Under: Signals Tagged With: Frequency domain, video, Wavesurfer

Simon October 12, 2014

Entropy: understanding the equation

The equation for entropy is very often presented in textbooks without much explanation, other than to say it has the desired properties. Here, I attempt an informal derivation of the equation starting from uniform probability distributions. A good way to think about information is in terms of sending messages. In the video, we send messages […]

Filed Under: Probability Tagged With: entropy, equations, video

Simon October 11, 2014

Windowing

When we say that a signal is non-stationary we mean that its properties, such as the spectrum, change over time. To analyse signals like this, we need to first assume that these properties do not change over some short period of time, called the frame. We can then analyse individual frames of the signal, one at a […]

Filed Under: Signals Tagged With: Short-term analysis, video, Wavesurfer

Simon February 6, 2012

My inaugural lecture

I talk about how speech synthesis works, in what I hope is a non-technical and accessible way, and finish off with an application of speech synthesis that gives personalised voices to people who are losing the ability to speak. I also try to mention bicycles as many times as possible. For a more up-to-date, slightly more technical, […]

Filed Under: Synthesis Tagged With: lecture, video

Simon November 1, 2022

Bitrate

The bitrate (or bit rate) of a signal is the number of bits required to store, or transmit, 1 s of that signal. A bit is a binary number: either 0 or 1. Let’s calculate the bitrate of a digital waveform. First you should revise the concepts of sampling and quantisation from this module of the […]

Filed Under: Signals Tagged With: Digital signal

Simon October 31, 2015

The speed of sound

At the Parque de las Ciencias in Granada, Spain there is this long tube, open at the end nearest you and closed at the far end. We can calculate the length of this tube just from the audio recording, because we know the speed of sound. Here’s the waveform of part of the recording, showing […]

Filed Under: Signals Tagged With: video, Wavesurfer

Token passing

Autocorrelation for estimating F0

Wave propagation on the surface of water

A simple synthetic vowel

Pipeline architecture for TTS

Sampling and quantisation

Spectrum and spectrogram

Entropy: understanding the equation

Windowing

My inaugural lecture

Bitrate

The speed of sound

Search this site

Posts

Latest Activity