- This topic has 1 reply, 2 voices, and was last updated 4 years, 9 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Automatic speech recognition › Features › Trading Off Time and Frequency Resolution
The slides detail the different properties of long windows vs. short windows. Please could you explain a little bit more about what time resolution and frequency resolution mean?
I infer frequency resolution to be not having enough frequencies to accurately reproduce a signal, almost like how the trapezium rule with too few trapezoids is not an accurate estimation for an integral (though I’m not sure). But I cannot understand what time resolution might mean.
Thanks you in advance.
The Fourier transform is invertible, which means that you get as many data points out as you put in. This means that using a longer analysis window in the time domain gives you more points (often referred to as “FFT bins”) in the frequency domain magnitude spectrum (we will ignore phase). Those points (“bins”) are equally spaced from 0 Hz up to the Nyquist frequency.
So, a longer analysis frame in the time domain means that the spectrum has more detail: higher frequency resolution.
The Fourier spectrum is effectively an average over the entire analysis frame (formally, we are making an assumption that the spectrum is constant throughout the frame). A longer time window means averaging over a longer duration of the signal and thus being less precise in the time domain: lower time resolution.
There is a trade-off between time and frequency resolution. In practical terms, this means we have to choose an analysis frame length that is appropriate for our signal. For speech, 25 ms is a common choice.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in