Forum Replies Created

Viewing 15 posts - 1 through 15 (of 1,084 total)

1 2 3 … 71 72 73 →

Author

Posts
February 20, 2025 at 13:02 in reply to: More space needed #18275
Simon
Professor
You can ignore these warnings – it’s OK if some things in the cache cannot be deleted (e.g., because they are still in use by running programs such as the linux desktop manager).

It looks like you have large files elsewhere. Send me a direct email or Teams message with the output of the du commands in this post above.

(Also, try to avoid screenshots – they are not searchable for others using the forums – it’s better to copy-paste the text into a post.)
February 19, 2025 at 13:32 in reply to: Disk quota exceeded #18272
Simon
Professor
You need to figure out what is taking up all the space, then delete some of it.

This post, and the thread below it, will help you learn how to do that.
December 3, 2024 at 15:52 in reply to: Unable to update train models in hmm1 directory #18220
Simon
Professor
You are telling HRest to load models/proto/$PROTO whereas it should start with the models just created by HInit, although this won’t actually cause an error in HTK.

Look for an error message from HRest – if it is failing to create any models, it should report an error.

Remember to always wipe all models (from hmm0 and hmm1) before every experiment: this will help you catch errors.
November 30, 2024 at 17:33 in reply to: Recognise Test data #18200
Simon
Professor
You should use either a loop (around all the files to be recognised), or the -S option (to pass the name of a file, in which all files to be recognised are listed), but not both.

The -S option will generally be faster(*). Why is that?

(*) although you might not notice the difference on the lab computers, because network speed typically dominates the run-time.
November 27, 2024 at 12:51 in reply to: MFCCs + Decoding #18176
Simon
Professor
Yes, there are a total of 39 elements in the feature vectors. 12 of them are the MFCCs. You have 12+13+13=38 though.

The language model computes P(W) where W is the word sequence of one utterance to be recognised. For the digit recogniser, can you locate and inspect the language model that is being used?

The acoustic model computes P(O|W) where O is the observation sequence. How does O relate to the MFCCs?

How are P(O|W) and P(W) combined to calculate P(W|O), and why do we need to do that?
November 25, 2024 at 17:33 in reply to: AI Use for Coding #18166
Simon
Professor
You should also exercise extreme caution in uploading data to external AI tools, since they are likely to retain this data and potentially include it in the training data for a future update of the tool.

You do not have permission to share any of the data for this assignment outside the University.
November 25, 2024 at 17:30 in reply to: AI Use for Coding #18165
Simon
Professor
Please read “Please briefly describe if you use any external Artificial Intelligence (AI) related tools in doing this assignment…” which applies to all use of such tools.

Whilst professional programmers certainly use AI “co-pilots” to write code faster, I strongly recommend against this for beginners: for one thing, you do not yet have the skill to judge whether the resulting code is correct!

You will learn a lot more, and build more confidence, doing it yourself from scratch.
November 20, 2024 at 08:53 in reply to: Access to Atli’s Notion Notes #18125
Simon
Professor
These notes are not currently available – there is everything you need here on speech.zone and the forums.
November 1, 2024 at 08:08 in reply to: Using Tables or Figures from Cited Work #18087
Simon
Professor
Reproducing a figure from another source may not be the best way to show your understanding.

If you really want to do this, then you need to cite the source in the caption of the figure, in the same way that you would cite the source of a direct text quote.

It’s up to you whether to include the original figure, or redraw it.
October 31, 2024 at 12:12 in reply to: In-Text Citation #18067
Simon
Professor
No, they are not. I’ve clarified that in the Formatting instructions.
October 29, 2024 at 08:47 in reply to: Access to labs #18048
Simon
Professor
You can check when AT 4.02 is available (i.e., whenever there is no class scheduled) by visiting timetables then searching for 4.02 and selecting the one in Appleton Tower. This link should take you directly there (although you may need to authenticate with EASE first).
May 13, 2024 at 13:09 in reply to: Axes of a plotted filter #17738
Simon
Professor
Key points

You are correct that the x-axis (horizontal) will be frequency, and will be labelled in units of Hertz (Hz).

The vertical axis is magnitude, which is most commonly plotted on a logarithmic scale and is therefore labelled in decibels (dB).

Additional detail

Magnitude is a ratio (in this case, of filter output to its input), and therefore has no units: formally, we say it is dimensionless. So dB are not actually a unit, but a scale.
April 12, 2024 at 08:03 in reply to: emulabel #17730
Simon
Professor
emulabel is an outdated program mentioned in some old documentation on festvox.org

In Pitchmark the speech, the command make_pmlab_pm creates label files from the pitchmarks, and places them in the pm_lab directory. These can be viewed in the same way as any other label file (such as the aligned phone labels), e.g., using wavesurfer.
April 10, 2024 at 13:36 in reply to: Upload Audio Files to Qualtrics #17724
Simon
Professor
You can use Qualtrics to build the survey, but host your audio files somewhere else, then enter their URLs into Qualtrics.

You can host the audio files anywhere that is able to provide a URL for the file. For example, a free github page, which might give you URLs like this:
```
https://jonojace.github.io/IS19-robustness-audio-samples/figure3/g_100clean.wav
```
April 10, 2024 at 11:03 in reply to: About abstract and introduction #17720
Simon
Professor
Yes, you need both an abstract an an introduction.
Author

Posts

Viewing 15 posts - 1 through 15 (of 1,084 total)

1 2 3 … 71 72 73 →

Simon

Forum Replies Created

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis