- This topic has 1 reply, 2 voices, and was last updated 7 years, 9 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Automatic speech recognition › Dynamic Time Warping (DTW) › Best path/local distance
To clarify, is the local distance in DTW the geometric distance between the vectors at each point in the grid? And this would mean that the unknown word and reference word that are most similar would have the most similar vectors, and therefore the shortest local distances/total distance?
Is DTW repeated on the unknown word for each reference word until a best path is found?
Correct, the local distance in DTW is the geometric distance between the pair of feature vectors at a given point in the grid.
We hope that the total distance (usually denoted D), which is the sum of local distances, will be lowest for the template that actually corresponds to what was said in the unknown word.
For a single, given unknown word, DTW is repeated once for every template. In each case, DTW finds the best path that aligns the unknown word with the current template being tried. This results in a separate value for D for each template. We then compare all those D values and pick the lowest.
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in