[hts-users:01472] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
- Subject: [hts-users:01472] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
- From: "ext-xuchen.2.yao@xxxxxxxxx" <ext-xuchen.2.yao@xxxxxxxxx>
- Date: Wed, 25 Jun 2008 15:17:29 +0800
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Thread-index: AcjWeljJ+L4v+wjkQRmBlP9ZgmPoTgAGKFPQAAAbZBA=
- Thread-topic: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
Sorry, I sent to wrong mail receipt;-(
Please just ignore it.
Xuchen
-----Original Message-----
From: Yao Xuchen.2 (EXT-Fesco/Beijing)
Sent: Wednesday, June 25, 2008 3:15 PM
To: 'hts-users@xxxxxxxxxxxxxxx'
Subject: RE: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
ZEN给出了为什么hts_engine 无法产生正常语音的原因。
对于第二个问题,看样子我们还需讨论一下frequency warping,还有到底选用什么参数,LSP或者如ZEN所说的mel-LSP?
-----Original Message-----
From: ext Heiga ZEN (Byung Ha CHUN) [mailto:zen@xxxxxxxxxxxxxxx]
Sent: Wednesday, June 25, 2008 12:17 PM
To: hts-users@xxxxxxxxxxxxxxx
Subject: [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP
Hi,
Xuchen Yao wrote (2008/06/25 12:17):
> I noticed that many speeches generated by hts_engine are truncated while
> that generated by SPTK are normal. Here's my running environment:
>
> configs:
>
> Config 1: 12-th order LSP, linear gain
>
> speeches generated by SPTK are OK, while wave files
> generated by hts_engine are truncated (silence in the wave files).
Oops, this is because hts_engine API doesn't support linear gain.
I discussed it with Keiichiro Oura, maintainer of hts_engine API.
We will support linear gain in hts_engine API 1.0.
> Config 2: 24-th order Mel-cepstrum
>
> speeches generated by SPTK and hts_engine are both OK.
>
> 2. Speeches in Config 2 sound better than those in Config 1. Could
> someone provide any experience that on what condition the speech
> generated from LSP features achieve the best quality?
From our internal experiments we found that frequency warping strongly
affected the final speech quality.
So how about trying to use mel-LSP rather than LSP?
Regards,
Heiga ZEN (Byung Ha CHUN)
--
------------------------------------------------
Heiga ZEN (in Japanese pronunciation)
Byung Ha CHUN (in Korean pronunciation)
Department of Computer Science and Engineering
Nagoya Institute of Technology
Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan
http://www.sp.nitech.ac.jp/~zen
------------------------------------------------
- References
-
- [hts-users:01462] hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP, Xuchen Yao
- [hts-users:01463] Re: hts_engine-0.99 does not generate normal speech in HTS-demo using 12-th order LSP, Heiga ZEN (Byung Ha CHUN)