[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:01472] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP

Subject: [hts-users:01472] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
From: "ext-xuchen.2.yao@xxxxxxxxx" <ext-xuchen.2.yao@xxxxxxxxx>
Date: Wed, 25 Jun 2008 15:17:29 +0800
Delivered-to: hts-users@xxxxxxxxxxxxxxx
Thread-index: AcjWeljJ+L4v+wjkQRmBlP9ZgmPoTgAGKFPQAAAbZBA=
Thread-topic: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP

Sorry, I sent to wrong mail receipt;-(

Please just ignore it.

Xuchen 

-----Original Message-----
From: Yao Xuchen.2 (EXT-Fesco/Beijing) 
Sent: Wednesday, June 25, 2008 3:15 PM
To: 'hts-users@xxxxxxxxxxxxxxx'
Subject: RE: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP

ZEN给出了为什么hts_engine 无法产生正常语音的原因。

对于第二个问题，看样子我们还需讨论一下frequency warping，还有到底选用什么参数，LSP或者如ZEN所说的mel-LSP?

-----Original Message-----
From: ext Heiga ZEN (Byung Ha CHUN) [mailto:zen@xxxxxxxxxxxxxxx] 
Sent: Wednesday, June 25, 2008 12:17 PM
To: hts-users@xxxxxxxxxxxxxxx
Subject: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP

Hi,

Xuchen Yao wrote (2008/06/25 12:17):

> I noticed that many speeches generated by hts_engine are truncated while 
> that generated by SPTK are normal. Here's my running environment:
>
> configs:
> 
> Config 1: 12-th order LSP,     linear gain
> 
>                  speeches generated by SPTK are OK, while wave files 
> generated by hts_engine are truncated (silence in the wave files).

Oops, this is because hts_engine API doesn't support linear gain.
I discussed it with Keiichiro Oura, maintainer of hts_engine API.
We will support linear gain in hts_engine API 1.0.
 
> Config 2: 24-th order Mel-cepstrum
> 
>                  speeches generated by SPTK and hts_engine are both OK.
> 
> 2. Speeches in Config 2 sound better than those in Config 1. Could 
> someone provide any experience that on what condition the speech 
> generated from LSP features achieve the best quality?

From our internal experiments we found that frequency warping strongly 
affected the final speech quality.
So how about trying to use mel-LSP rather than LSP?

Regards,

Heiga ZEN (Byung Ha CHUN)

-- 
------------------------------------------------
 Heiga ZEN     (in Japanese pronunciation)
 Byung Ha CHUN (in Korean pronunciation)

 Department of Computer Science and Engineering
 Nagoya Institute of Technology
 Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan

 http://www.sp.nitech.ac.jp/~zen
------------------------------------------------

References
: [hts-users:01462] hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP, Xuchen Yao; [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP, Heiga ZEN (Byung Ha CHUN)

Prev by Subject: [hts-users:01471] Re: Trying new language on HTS
Next by Subject: [hts-users:01473] Re: Trying new language on HTS
Previous by thread: [hts-users:01470] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
Next by thread: [hts-users:01483] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP