[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:03173] Problem of building regression tree (average voice model)


Hi,

I've been trying to build a regression tree for speaker adaptation. I am using HTS 2.1.1. I've trained a average voice model from 4000 sentences (about 2.5 hrs) of 20 speakers. It was successful to generate voice using the average voice model. I wanted to apply speaker adaptation on this average voice model, so I tried to build a regression tree with the command below: 
/usr/local/HTS-2.1.1/bin/HHEd -A -B -C /home/jay/TTS/try/AST_female_20_speakers_2/configs/trn.cnf -D -T 1 -p -i -H /home/ jay /TTS/try/AST_female_20_speakers_2/models/cmp/re_clust
ered.mmf -M /home/ jay /TTS/try/AST_female_20_speakers_2/models/cmp/regTrees /home/ jay /TTS/try/AST_female_20_speakers_2/edfiles/cmp/reg.hed /home/ jay /TTS/try/AST_female_20_speaker
s_2/data/lists/full.list

The problem was that splitting of nodes did finish. It seemed to be in a loop, and the score stayed the same. So the HHEd command cannot stop.  The log file shows as below:

HTK Configuration Parameters[10]
  Module/Tool     Parameter                  Value
#                 MINDUR                         5
#                 MAXSTDDEVCOEF                 10
#                 APPLYDURVARFLOOR              TRUE
#                 DURVARFLOORPERCENTILE          1.000000
#                 SHRINKOCCTHRESH  Vector 4 500.0 100.0 100.0 100.0
#                 VFLOORSCALESTR  Vector 4 0.01 0.01 0.01 0.01
#                 MINLEAFOCC                     0
#                 NATURALWRITEORDER              TRUE
#                 NATURALREADORDER              TRUE
#                 APPLYVFLOOR                 TRUE

// construct regression class tree
RC 32 reg
 Building regression tree with 32 terminals (4 streams)
Creating regression class tree with ident reg.tree and baseclass reg.base
Splitting Node 1, score 1.000000e+10
(Stream splitting)
Splitting Node 3, score 1.000000e+10
(Stream splitting)
Splitting Node 5, score 1.000000e+10
(Stream splitting)
Splitting Node 7, score 1.000000e+10
(MSD splitting)
Splitting Node 6, score 1.000000e+10
(MSD splitting)
Splitting Node 10, score 8.998759e+10
(Stream=3, vSize=1)
Splitting Node 13, score 2.999760e+10
(Stream=3, vSize=1)
Splitting Node 4, score 1.000000e+10
(MSD splitting)
Splitting Node 15, score 9.997541e+09
(Stream=3, vSize=1)
Splitting Node 19, score 9.997541e+09
(Stream=3, vSize=1)
Splitting Node 21, score 9.997541e+09
(Stream=3, vSize=1)
...
...
...
Splitting Node -16495, score 9.997541e+09
(Stream=3, vSize=1)
Splitting Node -16493, score 9.997541e+09
(Stream=3, vSize=1)
Splitting Node -16491, score 9.997541e+09
(Stream=3, vSize=1)

Could you do me a favor to help the problem? My questions are:
1: What could be the reason or problem result in this endless splitting node situation.
2:Could it be the problem with the average modeling? Is there any option to enable average modeling? I trained the average model just as speaker dependent model with the same scripts, except the training data from different people.

Thank you.

Regards,
Jay

Follow-Ups
[hts-users:03174] Re: Problem of building regression tree (average voice model), Keiichiro Oura