[hts-users:03671] voice building error
- Subject: [hts-users:03671] voice building error
- From: Andrei Barbos <andreibarbos.banc@xxxxxxxxx>
- Date: Wed, 13 Mar 2013 12:14:59 +0200
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=gCJmTk2ArS7AZL+HwOBDeMClwWaUkJr+Xl84l22wcak=; b=sLk5h5qEeYCXUUk4Eo1PkWn8gpdUzGsJYIMmHuUU/K1CTugi4RFZj+s5MZF00cl+ou H3+4ftdXd3gFRWD1bE3z+Dsp5qznr1uzcaGkk0L6gibb8M2W1vED6W6pg6Wp9MWxDtiH FaBSjtNzXSVWBWIiQiSjO0OvAlozZzBRXrVxOgKN3aHjYdj8IUgybNwia99c69Zaphk6 7n88VLVJtOlC7dWR3KpnriF7hFJFlfTNf6F2F4YNO+b/H/RLwXHm6mLw409f/x2L10TH Hft1rmK7PpNpwjDzQPbi0h3oHQHARc1QxjXKwhbm/IQeM+Xv4T8Tv3xEfJIKC/0ucjKQ 6EbQ==
Hi everybody,
I am using HTS2.2 and I want to build a voice for Romanian but I just can't complete the voice building process because of an error issued by HHEd. The strange thing is that it just stops working without any error message as seen in the following excerpt from the log file.
--------------------------------------------------
Clustering state 2 lsf
--------------------------------------------------
/home/andrei/Apps/CSTRVoiceClone/bin/HHEd -A -B -C /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//config/general.conf -C /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//config/clust.conf -D -V -H /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/EmbeddedTraining/2/full.cmp.mmf.lsf -T 1 -i -m -a 5.00000000000000000000 -p -r 1 -s -w /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/ContextClustering-lsf/2/clustered.cmp.mmf /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/ContextClustering-lsf/cluster.lsf.hed.2 /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//list/full.lst
HTK Configuration Parameters[9]
Module/Tool Parameter Value
# SHRINKOCCTHRESH Vector 6 1488.0 1100.0 600.0 2000.0 2000.0 2000.0
# MINLEAFOCC 5
# MAXSTDDEVCOEF 10
# DURVARFLOORPERCENTILE 1.000000
# APPLYDURVARFLOOR TRUE
# VFLOORSCALESTR Vector 6 0.01 0.01 0.01 0.01 0.01 0.01
# APPLYVFLOOR TRUE
# NATURALWRITEORDER TRUE
# NATURALREADORDER TRUE
HTK Version Information
Module Version Who Date : CVS Info
HHEd 3.4.1 CUED 12/03/09 : $Id: HHEd.c,v 1.105 2011/06/16 05:18:47 uratec Exp $
HShell 3.4.1 CUED 12/03/09 : $Id: HShell.c,v 1.15 2011/06/16 04:18:29 uratec Exp $
HMem 3.4.1 CUED 12/03/09 : $Id: HMem.c,v 1.11 2011/06/16 04:18:29 uratec Exp $
HLabel 3.4.1 CUED 12/03/09 : $Id: HLabel.c,v 1.12 2011/06/16 04:18:29 uratec Exp $
HMath 3.4.1 CUED 12/03/09 : $Id: HMath.c,v 1.15 2011/06/16 04:18:29 uratec Exp $
HSigP 3.4.1 CUED 12/03/09 : $Id: HSigP.c,v 1.8 2011/06/16 04:18:29 uratec Exp $
HWave 3.4.1 CUED 12/03/09 : $Id: HWave.c,v 1.11 2011/06/16 04:18:29 uratec Exp $
HAudio 3.4.1 CUED 12/03/09 : $Id: HAudio.c,v 1.9 2011/06/16 04:18:28 uratec Exp $
HVQ 3.4.1 CUED 12/03/09 : $Id: HVQ.c,v 1.8 2011/06/16 04:18:29 uratec Exp $
HModel 3.4.1 CUED 12/03/09 : $Id: HModel.c,v 1.42 2011/06/16 05:07:56 bonanza Exp $
HParm 3.4.1 CUED 12/03/09 : $Id: HParm.c,v 1.15 2011/06/16 04:18:29 uratec Exp $
HUtil 3.4.1 CUED 12/03/09 : $Id: HUtil.c,v 1.31 2011/06/16 04:18:29 uratec Exp $
HAdapt 3.4.1 CUED 12/03/09 : $Id: HAdapt.c,v 1.64 2011/06/16 04:15:50 uratec Exp $
HHEd
12915/12915 Models Loaded [7 states max, 2 mixes max]
RO 0.00 ''
Setting outlier threshold for clustering
RO->LS /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/EmbeddedTraining/cmp.sts
and loading state occupation stats
Stats loaded for 12916 models
Mean Occupation Count = 4.260795
TR 1
Adjusting trace level
--------------------------------------------------
Realtime 0:00:01
--------------------------------------------------
Error in this command : /home/andrei/Apps/CSTRVoiceClone/bin/HHEd -A -B -C /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//config/general.conf -C /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//config/clust.conf -D -V -H /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/EmbeddedTraining/2/full.cmp.mmf.lsf -T 1 -i -m -a 5.00000000000000000000 -p -r 1 -s -w /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/ContextClustering-lsf/2/clustered.cmp.mmf /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//training/ContextClustering-lsf/cluster.lsf.hed.2 /home/andrei/AppData/nsr_data/Spannish-v1/test_voice/acoustic_model//list/full.lst.
I managed to build voices using HTS2.2 but only from small subsets of larger corpora( the corpora used consisted of high quality recordings downsampled to 16 kHz and 16 bit per sample) but when I increased the dimension of the subsets the above presented error occurred. As a note I am not using STRAIGHT for the vocoding, but I consider that to be irrelevant since I got the same error even when using STRAIGHT.
I'm stuck at this point so any help/suggestions will be welcomed. And if you need more info regarding the setup please let me know.
Kindest regards,
Andrei Barbos