[hts-users:03365] Re: Regarding GV for spectrum
- Subject: [hts-users:03365] Re: Regarding GV for spectrum
- From: Kwan Lisa <lisakwan1102@xxxxxxxxx>
- Date: Mon, 25 Jun 2012 16:54:47 +0800
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=vouncAscHaPPq0NMBixFblQpSCGPLgvTvQaCK0hWdms=; b=Rqa2pMEcaFQPzdZHJje6fpPMZB2m0vG+Y0lI3u0r9DgHEZ4uPbBVRgtRWrRIpPb2N2 UUKeH8HuNFfBbo92avxb3qt5SmuSMFsm0bcRgslbaFxIBrdEwTnpmqJNm3YFGzT/5Htz 4OGt5OyAi8N4S9HAgraqqSMqhqcounwVHMdpajusvLqsQBYECXDt7lUvp3WmslOLj1oA rPgDdAHp0VsZtXT5uPWZdP5y0zFiAfwdLmn85uQO24saQCQV/fftTvfw+LLPdyN3HTR9 5s0K15J4dAFFiAy8d7KGvNM9C4uj3mrx68Qfbo9GoODvB0AWRELf4yHepvR25uL0iqCu 3+wg==
Hi,
I thought I was using the GV model of the target speaker. But I am not
sure what kind of GV model I was using.
I found that when I use HMGenS to generate the voice for the speaker
independent(SI) model, my GV setting is:
GVHMMLIST /home/rex/TTS/grad/TTS2/AST_realign_20speakers_5tone/gv/tiedlist
GVMODELMMF
/home/rex/TTS/grad/TTS2/AST_realign_20speakers_5tone/gv/clustered_all.mmf
When I use HMGenS to generate the voice for the speaker dependent(SD)
model, my GV setting is still:
GVHMMLIST /home/rex/TTS/grad/TTS2/AST_realign_20speakers_5tone/gv/tiedlist
GVMODELMMF
/home/rex/TTS/grad/TTS2/AST_realign_20speakers_5tone/gv/clustered_all.mmf
According to my logs, it seemed that I didn't train GV model for the
target speaker. So I think the GV model I was using was the one of SI
model.
How should I train a speaker dependent GV model? I thought the GV
model used in the demo script was re-trained using adaptation data.
2012/6/25 Keiichiro Oura <uratec@xxxxxxxxxxxxxxx>:
> Hi,
>
> I guess you use speaker-independent GV model.
> If there are many adaptation utterances, could you try to calculate
> target speaker-dependent GV?
>
> Regards,
> Keiichiro Oura
>
>
> 2012/6/22 Kwan Lisa <lisakwan1102@xxxxxxxxx>:
>> Hi,
>>
>> I have question about the weight of GV for spectrum. When synthesis a
>> voice with hts-engine, there is a option controlling the weight of GV
>> for spectrum. I am using a kind of average voice model to adapt to the
>> target speaker, and I found when I tune the weight higher, the voice
>> is more similar to the target speaker. But there was a trade-off. The
>> higher the weight, the more explosive sound in the voice. I'm not sure
>> why it happened. How do I diminish the explosive noisy sound in the
>> voice?
>>
>> --
>> Lisa Kwan
>> lisakwan1102(at)gmail.com
>> Advanced Speech Technology Lab, ASTL
>>
>
--
Lisa Kwan
lisakwan1102(at)gmail.com
Advanced Speech Technology Lab, ASTL
- Follow-Ups
-
- [hts-users:03366] Re: Regarding GV for spectrum, Keiichiro Oura
- References
-
- [hts-users:03362] Regarding GV for spectrum, Kwan Lisa
- [hts-users:03363] Re: Regarding GV for spectrum, Keiichiro Oura