- This topic has 1 reply, 2 voices, and was last updated 2 years, 4 months ago by .
Viewing 1 reply thread
Viewing 1 reply thread
- You must be logged in to reply to this topic.
› Forums › Speech Synthesis › Festival › F0 tracking and Pitch marks
Hello, I’m unable to locate any information on this, but is the f0 tracking algorithm employed for the purposes of this assignment the cross-correlation based RAPT? and about the pitch marking, is it safe to assume that pitch marks are obtained by differentiating the waveform and locating the zero crossing in the differentiated function?
The F0 estimation tool is part of CSTR’s own Speech Tools library and is an implementation of this algorithm
Y. Medan, E. Yair and D. Chazan, “Super resolution pitch determination of speech signals,” in IEEE Transactions on Signal Processing, vol. 39, no. 1, pp. 40-48, Jan. 1991, DOI: 10.1109/78.80763.
with improvements described in
Paul C. Bagshaw, Steven Hiller and Mervyn A. Jack. “Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching” in Proc. EUROSPEECH’93, pp 1003-1006.
At the heart of the algorithm is the same idea as RAPT: the correlation between the waveform and a time-shifted copy of itself (variously called autocorrelation or cross-correlation)
For the purposes of the assignment, you may assume that the algorithm is essentially the same as RAPT, since that is the one taught in the course.
The pitchmarking method is indeed based on finding negative-going zero-crossings of the differentiated waveform. Here’s the code if you want to read it (not required!).
Some forums are only available if you are logged in. Searching will only return results from those forums if you log in.
Copyright © 2024 · Balance Child Theme on Genesis Framework · WordPress · Log in