[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00080] f0 extraction using pda on raw sound files

Subject: [hts-users:00080] f0 extraction using pda on raw sound files
From: Anders Lundgren <anderslu@xxxxxxxxxxxxx>
Date: Sun, 05 Dec 2004 21:00:41 +0100
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4.3) Gecko/20041004

Hi all,

I tried to extract f0 contours using the pda utility from the EdinburghSpeech Tools, but I'm not sure which command line parameters that workbest with HTS. For instance, what low/high freq boundaries should be setwhen extracting a male voice? Also, are there any other parameters (ievoiced/voiceless treshold) that could affect HTS performance?

I created som f0 files using this utility and packed them to a binaryfloat little endian, but I receive the "ViterbiAlign: No path found in8'th segment" when training reaches "sil" (silence). I have successfullytrained using the exact same data, but with f0 contours taken from theKTH "Snack" f0 extraction tool. The problem then is that many segmentsin sentence-final position becomes partially unvoiced though there is noevidence for this in the training data.

Am I right in believing this might be a treshold problem, for thevoiced/unvoiced parts of speech? Or, could it be that some frames havebeen truncated from the f0 files created with KTH Snack?

Follow-Ups
: [hts-users:00081] Re: f0 extraction using pda on raw sound files, Heiga Zen (Byung-Ha Chun)

Prev by Subject: [hts-users:00079] Re: changing sample rate
Next by Subject: [hts-users:00081] Re: f0 extraction using pda on raw sound files
Previous by thread: [hts-users:00079] Re: changing sample rate
Next by thread: [hts-users:00081] Re: f0 extraction using pda on raw sound files