F0 tracking and Pitch marks

This topic has 1 reply, 2 voices, and was last updated 2 years, 10 months ago by Simon.

Viewing 1 reply thread

Author

Posts
- April 10, 2022 at 20:28 #15876
  Manas M
  Student
  Hello, I’m unable to locate any information on this, but is the f0 tracking algorithm employed for the purposes of this assignment the cross-correlation based RAPT? and about the pitch marking, is it safe to assume that pitch marks are obtained by differentiating the waveform and locating the zero crossing in the differentiated function?
- April 11, 2022 at 09:52 #15878
  Simon
  Professor
  The F0 estimation tool is part of CSTR’s own Speech Tools library and is an implementation of this algorithm
  
  Y. Medan, E. Yair and D. Chazan, “Super resolution pitch determination of speech signals,” in IEEE Transactions on Signal Processing, vol. 39, no. 1, pp. 40-48, Jan. 1991, DOI: 10.1109/78.80763.
  
  with improvements described in
  
  Paul C. Bagshaw, Steven Hiller and Mervyn A. Jack. “Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching” in Proc. EUROSPEECH’93, pp 1003-1006.
  
  At the heart of the algorithm is the same idea as RAPT: the correlation between the waveform and a time-shifted copy of itself (variously called autocorrelation or cross-correlation)
  
  For the purposes of the assignment, you may assume that the algorithm is essentially the same as RAPT, since that is the one taught in the course.
  
  The pitchmarking method is indeed based on finding negative-going zero-crossings of the differentiated waveform. Here’s the code if you want to read it (not required!).
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.

F0 tracking and Pitch marks

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis