Hi,make[1]: Leaving directory `/home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/data'
Thanks very much for your reply. Errors appear again.
The current version of SPTK is SPTK-3.2. Should I use the SPTK-3.1 version? When the training is running, can I use the computer to do other things or should I leave it alone?
Anyone has the similar experience? Please check it. The following is the details. I highlight the error and invalid parts with the red color. Thanks a lot.
Last few lines of Terminal:
# Generating a generation label script
rm -f scp/gen.scp
for lab in /home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/data/labels/gen/*.lab; do \
echo ${lab} >> scp/gen.scp; \
done
echo "Running a training/synthesis perl script (Training.pl) in background...."
Running a training/synthesis perl script (Training.pl) in background..../usr/bin/perl scripts/Training.pl scripts/Config.pm > log 2>&1 &.lf0...mglsadf : Invalid option 'g'!
xiexu@xiexu-laptop:~/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT$
Last few lines of the "log" file:
======================================================================
Start synthesizing waveforms (1mix) at Mon Nov 9 18:51:00 NZDT 2009
======================================================================
Processing directory /home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/gen/qst001/ver1/1mix/0:
Synthesizing a speech waveform from alice01.mgc and alice01You can check the mglsadf in the current version3.2 SPTK mannual. There is no '-g' option, so you should use SPTK3.1 instead.SoX doesn't support a single '-w' option.
mglsadf - MGLSA digital filter for speech synthesis
usage:
mglsadf [ options ] mgcfile [ infile ] > stdout
options:
-m m : order of mel-generalized cepstrum [25]
-a a : alpha [0.35]
-c c : gamma = -1 / (int) c [1]
-p p : frame period [100]
-i i : interpolation period [1]
-t : transpose filter [FALSE]
-k : filtering without gain [FALSE]
-P P : order of Pade approximation [4]
-h : print this message
infile:
filter input (float) [stdin]
stdout:
filter output (float)
mgcfile:
mel-generalized cepstrum (float)
notice:
if c==0, MLSA filter is used, P should be 4 or 5
SPTK: version 3.2
CVS Info: $Id: mglsadf.c,v 1.23 2008/11/06 15:40:51 tatsuyaito Exp $
/usr/bin/sox: invalid option -- w
Failed: invalid optionIt refers to the reverberation effect or the window selection options.So I guess '-w' means the data length of the waveform is a 'word'. You can change it to '-2'(means 2 Byte) or '-4'(means 4 Byte).This occurs because of the two errors above.
/usr/bin/sox: SoX v14.2.0
Usage summary: [gopts] [[fopts] infile]... [fopts] outfile [effect [effopts]]...
SPECIAL FILENAMES (infile, outfile):
- Pipe/redirect input/output (stdin/stdout); use with -t
-d, --default-device Use the default audio device (where available)
-n, --null Use the `null' file handler; e.g. with synth effect
-p, --sox-pipe Alias for `-t sox -'
SPECIAL FILENAMES (infile only):
"|program [options] ..." Pipe input from external program (where supported)
http://server/file Use the given URL as input file (where supported)
GLOBAL OPTIONS (gopts) (can be specified at any point before the first effect):
--buffer BYTES Set the size of all processing buffers (default 8192)
--combine concatenate Concatenate multiple input files (default for sox, rec)
--combine sequence Sequence multiple input files (default for play)
--effects-file FILENAME File containing effects and options
-h, --help Display version number and usage information
--help-effect NAME Show usage of effect NAME, or NAME=all for all
--help-format NAME Show info on format NAME, or NAME=all for all
--input-buffer BYTES Override the input buffer size (default: as --buffer)
--interactive Prompt to overwrite output file
-m, --combine mix Mix multiple input files (instead of concatenating)
-M, --combine merge Merge multiple input files (instead of concatenating)
--plot gnuplot|octave Generate script to plot response of filter effect
-q, --no-show-progress Run in quiet mode; opposite of -S
--replay-gain track|album|off Default: off (sox, rec), track (play)
-R Use default random numbers (same on each run of SoX)
-S, --show-progress Display progress while processing audio data
--version Display version number of SoX and exit
-V[LEVEL] Increment or set verbosity level (default 2); levels:
1: failure messages
2: warnings
3: details of processing
4-6: increasing levels of debug messages
FORMAT OPTIONS (fopts):
Input file format options need only be supplied for files that are headerless.
Output files will have the same format as the input file where possible and not
overriden by any of various means including providing output format options.
-v|--volume FACTOR Input file volume adjustment factor (real number)
-t|--type FILETYPE File type of audio
-s/-u/-f/-U/-A/-i/-a/-g Encoding type=signed-integer/unsigned-integer/floating-
point/mu-law/a-law/ima-adpcm/ms-adpcm/gsm-full-rate
-e|--encoding ENCODING Set encoding (ENCODING in above list)
-b|--bits BITS Encoded sample size in bits
-1/-2/-3/-4/-8 Encoded sample size in bytes
-N|--reverse-nibbles Encoded nibble-order
-X|--reverse-bits Encoded bit-order
--endian little|big|swap Encoded byte-order; swap means opposite to default
-L/-B/-x Short options for the above
-c|--channels CHANNELS Number of channels of audio data; e.g. 2 = stereo
-r|--rate RATE Sample rate of audio
-C|--compression FACTOR Compression factor for output format
--add-comment TEXT Append output file comment
--comment TEXT Specify comment text for the output file
--comment-file FILENAME File containing comment text for the output file
AUDIO FILE FORMATS: 8svx aif aifc aiff aiffc al amb au avr caf cdda cdr cvs cvsd dat dvms f4 f8 fap flac fssd gsm hcom htk ima ircam la lpc lpc10 lu mat mat4 mat5 maud nist ogg paf prc pvf raw s1 s2 s3 s4 sb sd2 sds sf sl smp snd sndfile sndr sndt sou sox sph sw txw u1 u2 u3 u4 ub ul uw vms voc vorbis vox w64 wav wavpcm wv wve xa xi
PLAYLIST FORMATS: m3u pls
AUDIO DEVICE DRIVERS: alsa
EFFECTS: allpass band bandpass bandreject bass bend chorus compand contrast dcshift deemph delay dither earwax echo echos equalizer fade filter flanger gain highpass ladspa loudness lowpass mcompand mixer noiseprof noisered norm oops pad phaser pitch rate remix repeat reverb reverse riaa silence spectrogram speed splice stat stretch swap synth tempo treble tremolo trim vol
EFFECT OPTIONS (effopts): effect dependent; see --help-effect
Error in /usr/local/SPTK-3.2/bin/excite -p 80 /home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/gen/qst001/ver1/1mix/0/alice01.pit | /usr/local/SPTK-3.2/bin/mglsadf -m 24 -p 80 -a 0.42 -g 0 /home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/gen/qst001/ver1/1mix/0/alice01.mgc | /usr/local/SPTK-3.2/bin/x2x +fs | /usr/bin/sox -c 1 -s -w -t raw -r 16000 - -c 1 -s -w -t wav -r 16000 /home/xiexu/Desktop/hts/HTS-demo_CMU-ARCTIC-SLT/gen/qst001/ver1/1mix/0/alice01.wav
Many thanks
Xu2009/11/9 那兴宇 <nxy-yzqs@xxxxxxx>
Hi,Yes, the training process is running background so you cannot shutdown your computer and cygwin.You can check if it is over by typing "ps -ef". It is like the task manager in windows.
发件人: 谢旭发送时间: 2009-11-09 06:32:28收件人: hts-users抄送:主题: [hts-users:02279] Re: Questions aboutHTS-demo_CMU-ARCTIC-SLTHi,Thanks very much for 那兴宇 and Sébastien's replys. I talk about the the training part. I have checked the 'gen' folder, and there is no waveform.I am doing the training again now. The following is the last few lines of the Terminal:echo "Running a training/synthesis perl script (Training.pl) in background...."Running a training/synthesis perl script (Training.pl) in background..../usr/bin/perl scripts/Training.pl scripts/config.pm > log 2>&1 &xiexu@xiexu-laptop:~/Desktop/hts/HTS-demo_CMU-ARCTIC-SLF$Does this mean that the training is still in process but in background and that I can not shut down the computer until the training finishes? If it is as I just said, how to check whether the training is over?Thanks a lotXu2009/11/9 那兴宇 <nxy-yzqs@xxxxxxx>
Have you checked the background running of the demo?If so, check the generated waveforms. I don't think there is any waveform in your 'gen' directory.Then you can see the log file to locate where error occured.
发件人: 谢旭发送时间: 2009-11-08 18:29:13收件人: hts-users抄送:主题: [hts-users:02276] Questions about HTS-demo_CMU-ARCTIC-SLTHi all,
I have installed the demo and no errors has occurred. But the installation took about 30 minutes, not like the INSTALL file says "After composing training data, HMMs are estimated and speech waveforms are synthesized. It takes about 6 to 12 hours :-)".
If the installation is right, what should I do next? Is there a way like festival SayText?
Many thanks
Xu