Evaluation

By comparing the recogniser's output with the hand-labelled test data, we can compute the Word Error Rate (WER).

Now we are ready to run the recogniser on some test data. You should run this on some existing isolated test digits (one digit per wav file) and the MFCC files from them. We run the Viterbi decoder using HVite. The script recognise_test_data does this. The output is stored in the rec directory. Look at the recogniser output and compare it to the correct answers. Calculate the Word Error Rate (WER) using the results script.

Again, you’ll need to edit the scripts to use a specific user (e.g. simonk) and the full data directory, rather than `whoami` and the data_upload directory.