Forum Replies Created
-
AuthorPosts
-
Getting closer… some more clues:
The lowest value of
k
(apart from 0), is 1. That’s the lowest frequency DFT “bin” and corresponds to the lowest frequency basis function.The lowest frequency basis function makes just one cycle within the analysis frame. So its frequency in Hertz must be 1 / “duration of the analysis frame in seconds”
We can calculate the duration of the analysis window in seconds if we know how many samples are in the frame and what the sampling frequency is: there is a simple relationship between these three values.
Now we know the frequency of the first bin
k=1
we can work out the frequency of the bink=2
and so on all the way up toK
.In the magnitude spectrum (i.e., the frequency domain), we see the frequencies present in whatever signal we are analysing: the analysis frame. In the video and that time point, the analysis frame is the visible portion of waveform on the left, which has a duration of 30 ms and contains multiple pitch periods.
You can use the DFT to analyse any amount of waveform (= the analysis frame) that you wish. If you analyse only one pitch period, you will not see any evidence of periodicity. If you analyse a waveform that contains multiple pitch periods (like in the video at the time point) then you will see evidence of periodicity in the magnitude spectrum: harmonics.
You have two separate problems. The solution to the choppy sound issue is probably this.
The “Feature Word not defined” error means that either
a) you are not using the correct voice (which is loaded by
config.scm
)or
b) you omitted a step in the pipeline
Don’t change the PATH back to what it was before – this will pick up the old version of Festival, which will not work for the assignment.
Try also looking in the VMware settings for this virtual machine – there are some options to fit the VM to the window and so on – experiment with those.
The other possibility is that your host computer has a very high resolution screen – see if you can adjust its settings to something lower resolution, although that will affect the host operating system too. This does the trick for me on my Mac.
Yes, nearly. In TD-PSOLA, our representation of the filter is the impulse response. This is a single pitch period. A diphone is a sequence of pitch periods.
Why can’t we store a single pitch period to represent the filter for a complete diphone?
total_entropy = ( sum(left_counts)*entropy(left_counts) + sum(right_counts)*entropy(right_counts) ) / (sum(left_counts) + sum(right_counts))
Rename (or delete) the notebook and ‘git pull’ to get a fresh copy of the original.
Which cell of which notebook are you referring to?
At the end of the part you want to select, click and hold the left mouse button. Then move the mouse up – you’ll start selecting text and eventually the window will scroll upwards until you’ve selected everything you need.
(There should be a keyboard shortcut, but I can’t get it to work.)
Make sure you activate the
slp
python virtual environment before installing these packages, and then make sure you runjupyter notebook
with that environment activated.You can tell that an environment is active because the shell prompt changes to include “(slp)”.
The source-filter model is a conceptual model to help us understand speech signals, by creating model of how they are produced.
We can use this model in many different ways, to do many different things with speech signals. Sometimes we actually implement the model directly, other times we use it as motivation and use it indirectly.
A direct application would be to use the model itself to process speech. We would fit the filter to natural speech, thus finding suitable values for the coefficients of the difference equation defining the filter. Then, we might generate synthetic speech at an F0 and duration of our choice by constructing an appropriate impulse train as the excitation signal to input to the filter.
In this direct application, we manipulate source features (F0, duration) by changing the excitation signal. We can also independently manipulate the filter’s resonances (formant frequencies) by adjusting the coefficients of the difference equation. We have total control over the signal.
But we can also use the model indirectly – that is, without ever writing down the difference equation or inputting an excitation signal into an actual filter. TD-PSOLA is an example of this way of using the source-filter model. Instead of using the difference equation to define the filter, we instead represent the filter as its impulse response (which is a waveform). TD-PSOLA only offers partial control over the signal. We cannot use it to manipulate the filter’s resonances because the impulse response is a waveform, and not a set of coefficients in an equation.
In the same directory you started Festival from.
I’ve not seen this error, but it does sound related to disk space. If you hit serious problems, or think the VM has been corrupted, you can delete the VM entirely (and empty your Trash) and download a fresh copy.
copy-paste should work fine both within the VM and between the host and the VM but note that Linux may use different keystrokes for this, compared to your personal computer.
You can always use the mouse right-click menu to copy and paste, or the menu in the terminal window.
The error above suggests that your VM isn’t getting a network connection from the host computer. Can you browse websites using Firefox within the VM? If so, ask Google “What’s my IP” and report that here.
Also try this and report the results:
$ ping scp1.ppls.ed.ac.uk
This should work even without the VPN. You should see something like
64 bytes from 129.215.204.88: icmp_seq=0 ttl=46 time=26.379 ms
.Terminate the process with ctrl-C.
Try rebooting the VM – this often clears up network connection problems.
That error indicates that it is looking for the file
/home/atlab/¬/.bashrc
. I think you are typing the incorrect tilde character. This is a special character for the shell and is expanded to your home directory, which is/home/atlab/
.If you can’t find the correct tilde on your keyboard, either cut-and-paste it from the instructions, or use the full path to the file instead:
$ gedit /home/atlab/.bashrc
or do it from your home directory:
$ cd $ gedit .bashrc
-
AuthorPosts