Hallo again,
I just tried to use HTS to build the SLT voice (from file
HTS-demo_CMU-ARCTIC-SLT.tar.bz2) - make works perfectly without any
visible
problems, synthetic examples in ..../gen/qst001/ver1/hts_engine
sound as I
would expect, files in ..../voices/qst001/ver1/ were generated.
One note: option -w is not supported in sox 14.1.0 (and probably
even in some
little bit earlier version), use -2 instead, when called from
Training.pl.
Now the question is, how to "map" the files generated into Festival
voice. I
tried the following mapping, but without success - the voice was
loaded
correctly, but SayText("Hallo, this is the first try") command
consumed about
1.5GB memory and failed with message WALLOC: failed to malloc
671088648 bytes.
I just copied the files in lib/voices/us/voice_name/festvox (in
Festival) from
the SLT voice package cmu_us_slt_arctic-0.95-release.tar.bz2, for
the files
in lib/voices/us/voice_name/hts I have used the following mapping
(in order
required by Festival: --> provided by HTS's Training.pl):
- duration.pdf --> voices/qst001/ver1/dur.pdf
simply renamed
- feat.list --> voices/qst001/ver1/label.feats
simply renamed
- lf0_dyn.win --> voices/qst001/ver1/lf0.win2
3 floats need to be converted to binary form, e.g. 'x2x' from SPTK
(I have read it somewhere, but I cannot find it anymore ...)
- lf0_acc.win --> voices/qst001/ver1/lf0.win3
3 floats need to be converted to binary form, e.g. 'x2x' from SPTK
- lf0.pdf --> voices/qst001/ver1/lf0.pdf
used without changes
- mcep_dyn.win --> voices/qst001/ver1/mgc.win2
3 floats need to be converted to binary form, e.g. 'x2x' from SPTK
- mcep_acc.win --> voices/qst001/ver1/mgc.win3
3 floats need to be converted to binary form, e.g. 'x2x' from SPTK
- mcep.pdf --> voices/qst001/ver1/mgc.pdf
simply renamed
- p_mcep.pdf --> voices/qst001/ver1/????
does not exist in HTS, but required by Festival (in case that
'mgc.pdf' file
would be use for this, what about 'mcep.pdf' then?)
I just linked it to mgc.pdf for the test ...
- trees-dur.inf --> voices/qst001/ver1/tree-dur.inf
simply renamed
- trees-lf0.inf --> voices/qst001/ver1/tree-lf0.inf
simply renamed
- trees-mcep.inf --> voices/qst001/ver1/tree-mgc.inf
simply renamed
As I said, it did not work. Unfortunately.
So I would like to ask you for some additional hints. HTS training
works
perfectly, but how to convert the result into Festival? How the
voice package
cmu_us_slt_arctic-0.95-release.tar.bz2 has been built?
Thank you very much. Best regards,
Dan
On Wednesday 03 of December 2008, Esther Klabbers wrote:
I wish there was more information available on how to use files
generated in HTS-demo in Festival. I have been able to run HTS-demo
for the SLT voice but when I compare the output .win, .inf and .pdf
files they are different than the ones distributed in the
nitech_cmu_slt_arctic_hts/hts directory.
- For one the standard HTS-demo script produces mgc.pdf instead of
mcep.pdf. Are these the same but with a different name? Or are they
different. If they are different how can you change the scheme files
for Festival to work with the different format.
- The .win files are ascii in HTS-demo but something else in the
festival voice. How do you convert these?
Thanks, Esther Klabbers