Page 71

Forum Replies Created

Viewing 15 posts - 1,051 through 1,065 (of 1,104 total)

← 1 2 3 … 70 71 72 73 74 →

Author

Posts
November 10, 2015 at 19:23 in reply to: covariance parameters #600
Simon King
Professor
Well spotted !

The covariance matrix is symmetrical. Along the diagonal are the variance values (the “covariance between each dimension and itself” if you like). Off the diagonal are the covariance values between pairs of dimensions.

Since the covariance between a and b is the same as the covariance between b and a, the upper triangle of this matrix (above the diagonal) is the same as the lower triangle (below the diagonal).

Let’s define covariance formally to understand why this is:

Here are two variables – the elements of a two-dimensional feature vector: $[X_1 , X_2]$

First, let’s write down the variance of $X_1$, which is defined simply as the average squared distance from the mean – in other words, to estimate it from data, we simply compute the squared difference between every data point and the mean, and take the average of that.

[latex]
\sigma^2_1 = var(X_1) = E[ (X_1 – \mu_1)(X_1 – \mu_1) ]
[/latex]

The “E[…]” notation is just a fancy formal way of saying “the average value” and the E stands for “expected value” or “expectation”.

Here’s the covariance between $X_1$ and $X_2$

[latex]
cov(X_1,X_2) = E[ (X_1 – \mu_1)(X_2 – \mu_2) ]
[/latex]

Now, for yourself, write down the covariance between $X_2$ and $X_1$. You will find that it’s equal to the value above.

[showhide more_text="Reveal the answer" less_text="Hide the answer" hidden="yes"]

Here’s the covariance between $X_2$ and $X_1$

[latex]
cov(X_2,X_1) = E[ (X_2 – \mu_2)(X_1 – \mu_1) ]
[/latex]

and because multiplication is commutative we can write

[latex]
(X_1 – \mu_1)(X_2 – \mu_2) = (X_2 – \mu_2)(X_1 – \mu_1)
[/latex]

and therefore

[latex]
cov(X_1,X_2) = cov(X_2,X_1) \\
[/latex]

Let’s move up to three dimensions. Noting that $cov(X_1,X_2)$ can be written as $\Sigma_{12}$, the full covariance matrix looks like this:

[latex]
\Sigma = \left( \begin{array}{ccc}
\Sigma_{11} & \Sigma_{12} & \Sigma_{13} \\
\Sigma_{21} & \Sigma_{22} & \Sigma_{23} \\
\Sigma_{31} & \Sigma_{32} &\Sigma_{33} \end{array} \right)
[/latex]

But we normally write $\sigma_1^2$ rather than $\Sigma_{11}$, and since $\Sigma_{12} = \Sigma_{21}$ we can write this:

[latex]
\Sigma = \left( \begin{array}{ccc}
\sigma_1^2 & \Sigma_{12} & \Sigma_{13} \\
\Sigma_{12} & \sigma_2^2 & \Sigma_{23} \\
\Sigma_{13} & \Sigma_{23} & \sigma_3^2 \end{array} \right)
[/latex]

See how the matrix is symmetrical. It has just over half as many parameters as you might have thought at first. But, the number of parameters in a covariance matrix is still proportional to the square of the dimension of the feature vector. That’s one reason we might try to make feature vectors as low-dimensional as possible before modelling them with a Gaussian.

Confused by the notation?

The subscripts are always indexing the dimension of the feature vector. The superscript “2” in $\sigma^2$ just means “squared”: $\sigma^2 = \sigma \times \sigma$

The notation of upper and lower case sigma is also potentially confusing, because $\Sigma$ is a covariance matrix, $\sigma$ is standard deviation, and $\sigma^2$ is variance. We do not write $\Sigma^2$ for the covariance matrix!

[/showhide]

PS – let me know if the maths doesn’t render in your browser.
November 2, 2015 at 19:08 in reply to: Loops #544
Simon King
Professor
You might want to load the list of values from a file:
```
for X in `cat myfile.scp`
do
 echo The value of X is ${X}
done
```
where myfile.scp is a plain text file with one value per line. This is a good way to loop around a list of files, for example.
November 2, 2015 at 12:00 in reply to: Loops #538
Simon King
Professor
To loop around a range of numerical values, you can use
```
for X in {1..10}
do
 echo The value of X is ${X}
done
```
A more flexible way is to use the seq command which allows you to control the increment step size, use non-integer values, and control the format in which the number is printed
```
for X in $(seq 1 10)
do
 echo The value of X is ${X}
done
```
or
```
for X in $(seq -w 1 0.5 6)
do
 echo The value of X is ${X}
done
```
and so on. Type ‘man seq’ at a bash prompt to read the manual for the seq command.
November 2, 2015 at 11:52 in reply to: Loops #537
Simon King
Professor
The basic loop around a fixed set of values looks like this:
```
for X in 1 2 3
do
 echo The value of X is ${X}
done
```
where the values are actually strings, so we can also have
```
for X in 34 b purple c 99 a
do
 echo The value of X is ${X}
done
```
or
```
for FRUIT in apples oranges pears
do
 echo The current fruit is ${FRUIT}
done
```
October 28, 2015 at 08:18 in reply to: Phrasify: method #463
Simon King
Professor
This is an internal feature and you don’t need to understand what it means. The key phrase-level feature on the example above is “NB”, meaning “no break”.
October 27, 2015 at 20:45 in reply to: Front end: non-ASCII characters #458
Simon King
Professor
OK, so my hypothesis about non-ASCII characters is probably wrong here. You seem to have found a pretty bad error in the part of the pipeline that detects/classifies/expands non-standard words. Can you speculate on exactly where this might have happened, and maybe even propose where a change would have to be made to fix this problem?

The unknown / blank item in the Word relation is probably the place where the pound sign used to be just after tokenisation, but has been deleted after completion of the non-standard word processing step (because we don’t want “pounds three billion”).
October 26, 2015 at 22:03 in reply to: Unit selection errors #443
Simon King
Professor
This sounds like a unit selection error. The most likely explanation is that there is a unit in the speech database that is labelled as the vowel in “red” but actually sounds like the vowel in “reed”.

It’s easy to see how that might happen: there was a front-end error during the labelling of the database (e.g., the database utterance contained the word “read” pronounced as “reed” but the front end predicted the phone sequence for the pronunciation “red” and so aligned that phone label with the speech. Automatic labelling works well, but may not always be able to detect that type of error.

The unit selection algorithm is susceptible to mislabelling errors and has only limited ways of detecting them at synthesis time.
October 26, 2015 at 21:56 in reply to: Architecture: ids of Items in Relations #441
Simon King
Professor
Every time a new Item is created, it just gets assigned the next numerical id in sequence. So, the ids do not carry any human-friendly information and are best ignored. Sometimes an Item may be deleted, so not every numerical id will necessarily be present.

If you wanted to see all Items currently present in an Utterance, then you could save the utterance to a file using the utt.save command, and then open that file in a text editor such as Aquamacs.
```
festival> (utt.save myutt 'my_file_name.utt)
```
The file my_file_name.utt will be saved in whatever directory you were in when you started Festival. Note that the waveform is not saved within the utterance file.
October 26, 2015 at 21:42 in reply to: Front end: non-ASCII characters #437
Simon King
Professor
Festival can handle utf-8 and utf-16 characters, but not via the interactive command-line interface. This is a limitation of the input method. You would need to input such text from a file.
October 26, 2015 at 21:36 in reply to: The Manual's description of it's POS tagger? #433
Simon King
Professor
Hmm – that’s a good point, and not something I’d spotted before. It is almost certainly a typo.

Anyway, for the purposes of understanding, it’s fine to assume that Festival performs POS tagging using precisely the method described in Jurafsky & Martin.
October 26, 2015 at 11:11 in reply to: CART: measuring and using entropy #420
Simon King
Professor
The weights are simply the fractions of data points in each side of the split. So, we compute entropy as usual for each side (“yes” vs “no”) and then when we sum these two values, we weight each of them by the fraction of the data that went down that branch (e.g., if 1/3 of the data points had “yes” as the answer to the question under consideration, then we would weight the “yes” side’s entropy by 1/3 and the “no” side’s by 2/3, then add them together).
October 26, 2015 at 08:51 in reply to: Phrasify: method #410
Simon King
Professor
To query a variable in Scheme, just type it’s name at the Festival prompt, without any parentheses. If you get “unbound variable” that means the variable is not set, so the method will be the built-in default (in this case, the hand-crafted CART).
October 23, 2015 at 20:26 in reply to: Syllable structure & stress #400
Simon King
Professor
It’s tertiary stress, which is marked up in the Unisyn lexicon – see Section 3.4.3 of the Unisyn manual. Tertiary stress is essentially there not to show that a syllable might receive a pitch accent, but to block some post lexical rules, such as vowel reduction.

So, the second syllable in “upset” should never be reduced, in any context. I think Unisyn would regard “upset” as a compound word “up + set”, which is why the tertiary stress is marked up.
October 22, 2015 at 18:06 in reply to: How can I see the Postlex module working? #389
Simon King
Professor
This section of the Festival manual gives you some clues about what happens in the Postlex module, including vowel reduction and possessive “s”.

For example, compare the Segments produced for these two sentences:
- Simon’s bike.
- Matt’s bike.
October 22, 2015 at 16:04 in reply to: Phrasify: method #386
Simon King
Professor
The default is the hand-crafted CART. You can inspect this classification tree thus:
```
festival> phrase_cart_tree 
```
which should give something like:
```
((lisp_token_end_punc in ("?" "." ":"))
 ((BB))
 ((lisp_token_end_punc in ("'" "\"" "," ";"))
  ((B))
  ((n.name is 0) ((BB)) ((NB)))))
```
and if you draw that as a tree, you’ll see that the punctuation symbols

? . :

all lead to a Big Break (BB), and that the symbols

' " , ;

all lead to a Break (B) and otherwise there is No Break (NB) unless we reach the end of the input text, in which case a BB is placed even if there is no sentence-final punctuation.
Author

Posts

Viewing 15 posts - 1,051 through 1,065 (of 1,104 total)

← 1 2 3 … 70 71 72 73 74 →

Simon King

Forum Replies Created

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis