Hi,
Did you calculate GV models on adaptation data for the target speaker?
Sometimes GV models calculated for average voice are too big for some
speakers. (this would be crucial for log gain case.)
I don't remember how HTS-demo calculates this, but please check this
first.
Then it might be good to normalize amplitude level of adaptation data to
that of training data for avoiding bad transformation of C0/gain terms.
Regards,
Junichi Yamagishi
CSTR
On 12 Aug 2009, at 22:22, Tóth Bálint wrote:
Hi,
I am trying to adapt HTS to a new voice. The SAT average voice is ok:
http://alpha.tmit.bme.hu/~toth.b/hts_samples/SAT.wav
but after adaptation the volume overshoots:
http://alpha.tmit.bme.hu/~toth.b/hts_samples/SAT_dec_feat3.wav
The volume of the adaptation data is normal, there are no overshoots.
The adaptation of other voices works well.
Can you please help me, what can be the problem?
Thanks in advance!
Best Regards,
Balint Toth