- This topic has 1 reply, 2 voices, and was last updated 4 years, 8 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Automatic speech recognition › Features › eliminating f0
Hi,
As I understand, the filterbank removes any evidence of f0 if the filters are large enough. But if so, why will there still be information about f0 if we go beyond 12 MFCCs?
Thanks
You are right that using a filterbank with suitably wide filters should eliminate all evidence of F0 in the resulting set of filter outputs. That’s the theory…
…however, even with wide filter bandwidths, there will still be some correlation between a filter’s output and F0: the amount of energy in a filter’s output will vary a little up/down as more/fewer harmonics happen to fall within its pass band.
In this case, the motivation for using the cepstrum remains obtaining decorrelated representation, and we should still truncate the cepstrum to discard the higher cepstral coefficients which will be those that correlate the most with F0 (and to reduce dimensionality).
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in