[hts-users:01670] Re: Stability problem using LSP

Subject: [hts-users:01670] Re: Stability problem using LSP

From: "Geoffrey Wilfart" <geoffrey.wilfart@xxxxxxxxx>

Date: Tue, 9 Sep 2008 15:02:51 +0200

Delivered-to: hts-users@xxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type:references; bh=FRzaLgFqiXoiGtOH3WKS5MkL1iCoBpaudLQi/W8wji8=; b=Ml5VNEFJzweSYfkMa0OV2B3/p2mzlvCJE9s2ESyF2Oetn5aH7T3Rmq64F40vLK7GVi zmFhTlB45GCFmwu5GhSyjc5wVvcz9WwVeFYCQx/8/SW4mWSZu9GlY2bBzi/Z/3hP7mY8 4sqOsDkwyIQxCIUJzha4aNGIKVf/AEj/4O7rE=

Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:references; b=U83MlQHfkD9iHxz41VnA8PHNjtJzR0vaYZmOFgo3Bt/JWV/Dq9xh4CkBWt3OVKb5Ui FYkUcrXUrzMkTg23EuOReCicQJMhG8iB8XcTHwVI0MI7/LQBuxBGgbRRZ6PwQRjfwxZr 8qxwX/Q2l5edLi1xHGngWUxugemjxMMy8hwUo=

Hi all,

Thank you for all your answers.

I am not using STRAIGHT, not was using global variance on that experiment. I have checked that I don't have any state duration being zero.

Thanks to your help, though, I was able to find out what my problem is: it turns out that some LSP coefficients do get over Pi.

I think I can find some ad-hoc solution at the vocoding stage.

Thanks again for your help.

Geoffrey

2008/9/9 Heiga ZEN (Byung Ha CHUN) <heiga.zen@xxxxxxxxxxxxxxxxx>

Hi,

Geoffrey Wilfart wrote:

Thank you for your answer. I have already used both, and never got any issue when using mel-cepstrum parameters.

I was willing to use generalized LPC-LSP parameters on mel scale, as they're reported to give the best MOS in:

H. Zen, T. Toda, K. Tokuda, "The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006", IEICE Trans. on Information and Systems, 2006.

In that paper, we used STRAIGHT to extract spectra and then extracted 39-order LSPs for each frame. Are you using STRAIGHT? And if GV is used, synthesis filters obtained from generated LSPs sometimes get unstable (mcep achieved the best MOS in that experiment when GV was used).

Best regards,

Heiga ZEN (Byung Ha CHUN)

--
Heiga ZEN (Byung Ha CHUN)
Speech Technology Group
Cambridge Research Lab
Toshiba Research Europe
phone: +44 1223 436975

______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email ______________________________________________________________________