By comparing the recogniser's output with the hand-labelled test data, we can compute the Word Error Rate (WER).
Now we are ready to run the recogniser on some test data. You should run this on some existing isolated test digits (one digit per wav file) and the MFCC files from them. We run the Viterbi decoder using HVite. The script recognise_test_data
does this. The output is stored in the rec
directory. Look at the recogniser output and compare it to the correct answers. Calculate the Word Error Rate (WER) using the results
script.
Again, you’ll need to edit the scripts to use a specific user (e.g. simonk
) and the full data
directory, rather than `whoami`
and the data_upload
directory.