Waveform manipulation

For speech synthesis, it may be necessary to modify F0, duration, or other properties of recorded speech samples.
  • TD-PSOLA

    Time-domain pitch-synchronous overlap-and-add is a remarkably simple but effective way to independently modify the duration and F0 of speech.

  • Linear prediction

    A more powerful technique than TD-PSOLA, linear prediction can manipulate source and filter separately.

Reading

Taylor – Section 10.1 – Analogue signals

It's easier to start by understanding physical signals - which are analogue - before we then approximate them digitally.

Taylor – Section 10.2 – Digital signals

Going digital involves approximations in the way an original analogue signal is represented.