[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:01819] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering

Subject: [hts-users:01819] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering
From: Esther Klabbers <klabbers@xxxxxxxxxxxx>
Date: Wed, 3 Dec 2008 08:29:25 -0800
Delivered-to: hts-users@xxxxxxxxxxxxxxx

I wish there was more information available on how to use filesgenerated in HTS-demo in Festival. I have been able to run HTS-demofor the SLT voice but when I compare the output .win, .inf and .pdffiles they are different than the ones distributed in thenitech_cmu_slt_arctic_hts/hts directory.- For one the standard HTS-demo script produces mgc.pdf instead ofmcep.pdf. Are these the same but with a different name? Or are theydifferent. If they are different how can you change the scheme filesfor Festival to work with the different format.- The .win files are ascii in HTS-demo but something else in thefestival voice. How do you convert these?


Thanks, Esther Klabbers

On Dec 3, 2008, at 6:47 AM, Heiga Zen (Byung Ha CHUN) wrote:

Hi,

Daniel Tihelka wrote:
first of all I have found small bug on 64-bit platform: in src/hts_build/data/mkdata.pl (written by Heiga Zen), line 114, theformat of first two items stored into tmp.head file are formattedby -al in command x2x. It will write two long numbers - in 32-bitplatform they are 4B long, as expected by HTK. However, when SPTK(I use 3.1 version) is compiled at 64bit platform, the numbers are8B long and HTK (also 64bit version) cannot of course read it.Quick workaround is to use -ai switch; it should work on both 32and 64bit.
It doesn7t happen in the current HTS-demo. The HTS-demo no longeruses SPTK to append HTK headers. It uses addhtkheader.pl to appendthis. You can find the following statement in addhtkheader.pl:
# number of frames in long
$NFRAME = pack("l", $nframe);
it packs number of frames in "l". In Perl, packing a value in "l"peforms packing the value in "signed 32-bit interger value". So itdoesn't depend on 32/64-bit.
What troubles me, however, is that Training.pl script from src/hts_build/ does not build HTS voice correctly (checked on CMUARCTIC AWB 0.90 and CMU US SLT ARCTIC 0.95, on both the same typeof failure occurs). No changes were made both in scripts and voice!
Festvox's training script is out of date. I wrote it about 6 yearsago. I don't recommend you to use that. It cannot use the latesttechnologies and fixes.
What may be wrong? I do not expect a problem be a 64bit platform.All scripts and voice files were also used without any change(except -x switch in the build script and -ai switch for x2x,which, I think, are unlikely to cause the error). And the awb andslt HTS voices are available for Festival (and working), so theywere build somehow - and I suppose that by build_hts script. Is thescript up-to-date (the last date there is May 2003)? Or is thereanother preferred way how to build HTS voice for Festival?
Please do not use training script included in Festvox.
Any advice will be really appreciated. May I give you anyadditional information?
Please use the latest one released on the HTS website.

Regards,

Heiga ZEN (Byung Ha CHUN)

--
--------------------------
Heiga ZEN (Byung Ha CHUN)
Speech Technology Group
Cambridge Research Lab
Toshiba Research Europe
phone: +44 1223 436975

______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email______________________________________________________________________


Esther Klabbers
Assistant Professor,
Center for Spoken Language Understanding (CSLU),
Division of Biomedical Computer Science (BMCS)
Oregon Health & Science University (OHSU)

20000 NW Walker Road / Beaverton, OR 97006
Office: +1-503-748-3005 / Fax: +1-503-748-1306
http://www.cslu.ogi.edu/people/klabbers

Follow-Ups
: [hts-users:01824] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering, Daniel Tihelka

References
: [hts-users:01817] HTS voice built with Festvox 2.1 - HHed failure during duration model clustering, Daniel Tihelka; [hts-users:01818] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering, Heiga Zen (Byung Ha CHUN)

Prev by Subject: [hts-users:01818] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering
Next by Subject: [hts-users:01820] Re: utt and lab files for Portuguese
Previous by thread: [hts-users:01818] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering
Next by thread: [hts-users:01824] Re: HTS voice built with Festvox 2.1 - HHed failure during duration model clustering