[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00081] Re: f0 extraction using pda on raw sound files

Subject: [hts-users:00081] Re: f0 extraction using pda on raw sound files
From: "Heiga Zen (Byung-Ha Chun)" <zen@xxxxxxxxxxxxxxxx>
Date: Tue, 07 Dec 2004 00:29:56 -0500
Organization: Nagoya Institute of Technology, Japan
User-agent: Mozilla Thunderbird 0.8 (Windows/20040913)

Hi Anders,

Anders Lundgren wrote:

For instance, what low/high freq boundaries should be setwhen extracting a male voice?


Such kind of parameters depend on data.

Also, are there any other parameters (ievoiced/voiceless treshold) that could affect HTS performance?


They would affect the performance.

I always check the quality of analysis/synthesis (mel-cepstral vocoder)speech of original data to determine f0 extraction parameters.

I created som f0 files using this utility and packed them to a binaryfloat little endian, but I receive the "ViterbiAlign: No path found in8'th segment" when training reaches "sil" (silence). I have successfullytrained using the exact same data, but with f0 contours taken from theKTH "Snack" f0 extraction tool. The problem then is that many segmentsin sentence-final position becomes partially unvoiced though there is noevidence for this in the training data.

Could you count the number of voiced/unvoiced frames assigned to segment"sil" in whole training data?


Best regards,

Heiga Zen (Byung-Ha Chun)

--
 ------------------------------------------------
  Heiga Zen     (in Japanese pronunciation)
  Byung-Ha Chun (in Korean pronunciation)

  Department of Computer Science and Engineering
  Graduate School of Engineering
  Nagoya Institute of Technology
  Japan

  e-mail: zen@xxxxxxxxxxxxxxxx
     web: http://kt-lab.ics.nitech.ac.jp/~zen
 ------------------------------------------------

References
: [hts-users:00080] f0 extraction using pda on raw sound files, Anders Lundgren

Prev by Subject: [hts-users:00080] f0 extraction using pda on raw sound files
Next by Subject: [hts-users:00082] how to set window data for dynamic features?
Previous by thread: [hts-users:00080] f0 extraction using pda on raw sound files
Next by thread: [hts-users:00082] how to set window data for dynamic features?