speech.zone

Simon October 11, 2014

Aliasing

In sampling and quantisation we saw that sampling a signal at a fixed rate means that there is an upper limit on the frequencies that can be represented. This limit is called the Nyquist frequency. Before sampling a signal, we must remove all energy above the Nyquist frequency, and here we will see what would […]

Filed Under: Signals Tagged With: Digital signal, video

Simon October 11, 2014

Pipeline architecture for TTS

Most text-to-speech systems split the problem into two main stages. The first stage is called the front end and contains many separate processes which gradually build up a linguistic specification from the input text. The second stage typically uses language-independent techniques (although they still require a language-specific speech corpus) to generate a waveform. Here we see those two […]

Filed Under: Synthesis Tagged With: front end, video, waveform generation

Simon July 9, 2024

CUI 2024 video available

The video and slides from Simon’s keynote are now online under Courses > One-off events.

Filed Under: Uncategorized

Simon October 11, 2014

Windowing

When we say that a signal is non-stationary we mean that its properties, such as the spectrum, change over time. To analyse signals like this, we need to first assume that these properties do not change over some short period of time, called the frame. We can then analyse individual frames of the signal, one at a […]

Filed Under: Signals Tagged With: Short-term analysis, video, Wavesurfer

Simon October 11, 2014

Classification and regression trees (CART)

A quick introduction to a very simple but widely-applicable model that can perform classification (predicting a discrete label) or regression (predicting a continuous value). The tree is learned from labelled data, using supervised learning. Before watching this video, you might want to check that you understand what Entropy is.

Filed Under: Models Tagged With: Classification, Decision tree, Learning decision trees, supervised learning, video

Simon October 11, 2014

Spectrum and spectrogram

The spectrum and the spectrogram are much more useful ways of analysing speech signals than the waveform. We look at how to create them using Wavesurfer and what effect the analysis window size has on what we see.

Filed Under: Signals Tagged With: Frequency domain, video, Wavesurfer

Simon October 11, 2014