[hts-users:03444] Re: Issue with GV
- Subject: [hts-users:03444] Re: Issue with GV
- From: Keiichiro Oura <uratec@xxxxxxxxxxxxxxx>
- Date: Tue, 30 Oct 2012 23:21:01 +0900
- Cc: uratec <uratec@xxxxxxxxxxxx>
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=rRKwWXKJtEu9zhzGczh4bW+P3Xb2UZ9/lPJ1LpbQDr4=; b=qrUDkek16T9XSo5i13zaAkgollEmo2pVqz5Kzb8yMUm3+5CgBMXJRjCc0y/yv5pBiE qEenS0Y5QfzFenyB9HL12iGAVyzcNy4Uy0EwwIWThvkHh9V0NPphnIfEa1AG2KzBB5uK n1/au1cVleF1ulF6jjNAaX+uycAKl6drpguQoTyzNx8YRp0+qsujvTJdeZSqNdJ5Iqju 9Somt43ucirWaDPYBwr6h3TWsooNplbJsOwXdy7uRP3kNREOILCPK3aQDWfB7GUhfOO2 wrnpZvLKLXAmYiz4pE/tFlBYm7utT+7VIHVrq9SZTxx70zvUPaX1GLBSPCxJVXfz+G0+ DGzg==
Hi
How about speech analysis parameters such as -e option of 'mcep'
command in SPTK?
Regards,
Keiichiro Oura
2012/10/30 Veera Raghavendra <raghavendra@xxxxxxxxxx>:
> I have tried many parameter tuning. But, the spikes are still present in the
> synthesized file.
>
> The analysis shows that the spikes are happening when vowel phone is
> followed by nasal sound. Can I change any thing in the context dependent
> modeling to avoid these spikes?
>
> Thanks,
> Raghavendra.
>
>
>
> On Fri, Oct 19, 2012 at 8:01 PM, Keiichiro Oura <uratec@xxxxxxxxxxxxxxx>
> wrote:
>>
>> Hi,
>>
>> You can try to change many settings for GV in the HTS demo scripts.
>>
>> MAXGVITER maximum number of iterations of GV-based parameter
>> generation algorithm (default=50)
>> GVEPSILON convergence factor for GV iteration (default=0.0001)
>> MINEUCNORM minimum Euclid norm for GV iteration (default=0.01)
>> STEPINIT initial step size (default=1.0)
>> STEPINC step size acceleration factor (default=1.2)
>> STEPDEC step size deceleration factor (default=0.5)
>> HMMWEIGHT weight for HMM output prob. (default=1.0)
>> GVWEIGHT weight for GV output prob. (default=1.0)
>> OPTKIND optimization method (STEEPEST, NEWTON, or LBFGS)
>> (default=NEWTON)
>> NOSILGV turn on GV without silent and pause phoneme (0:off or
>> 1:on, default=1)
>> CDGV turn on context-dependent GV (0:off or 1:on, default=1)
>>
>> Regards,
>> Keiichiro Oura
>>
>>
>> 2012/10/19 Veera Raghavendra <raghavendra@xxxxxxxxxx>:
>> > Dear All,
>> >
>> > I found the difference in the quality with and without GV.
>> >
>> > If GV is turned-on, the voice quality is very clear but there are sudden
>> > spikes. These spikes destroys the synthesis quality.
>> >
>> > If GV is turned-off, there are no spikes but the content is not clear.
>> >
>> > Do I need to change any settings in HSMMAlign.
>> >
>> > Thanks,
>> > Raghavendra.
>>
>
- Follow-Ups
-
- [hts-users:03448] Re: Issue with GV, Veera Raghavendra
- References
-
- [hts-users:03433] Issue with GV, Veera Raghavendra
- [hts-users:03434] Re: Issue with GV, Keiichiro Oura
- [hts-users:03442] Re: Issue with GV, Veera Raghavendra