- This topic has 1 reply, 2 voices, and was last updated 7 years, 10 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Foundations of speech › Phonetics and speech science › Source-filter model in current research
How is the source-filter model incorporated into modern speech synthesis technologies like HMM and DNN systems? Do these systems choose from a library of individual, pre-generated acoustic sounds and then arrange the sounds to improve things like intonation, stress, and rhythm?
This is a little beyond the Speech Processing course, but is covered fully in the more advanced Speech Synthesis course.
The short answer is that most statistical parametric speech synthesisers (whether HMM or DNN) use a source-filter model to generate the waveform. The HMM or DNN predicts the parameters of the source (e.g., F0) and of the filter (e.g., its frequency response).
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in