[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00942] Re: Questions about certain concepts


Hi,

marc sobhy wrote (2007/11/18 20:14):

the thesis used the mcp ? why he didn't use any other speech encoding such LPC or MFCC ?

LPC coefficients have a stability problem if they are quantized excessively.
Similar problem may occur if they are modeled by HMMs.

Please see http://hts.sp.nitech.ac.jp/hts-users/spool/2006/msg00097.html why he didn't use MFCCs.

when he tried to generate the cepstral coefficient, why he didn't used the original F0 instead to regenerate it again with the spectrum

In their early papers, original F0s were used.
However, to realize text-to-speech synthesis, F0s should be generated because we cannot obtain original F0s for unknown sentences.

Why he take the Log of the F0 when extract it from the speech databse

It worked better, and it is generally thought that human perception of frequency is logarithmic in nature.

Regards,

Heiga ZEN (Byung Ha CHUN)

--
------------------------------------------------
Heiga ZEN     (in Japanese pronunciation)
Byung Ha CHUN (in Korean pronunciation)

Department of Computer Science and Engineering
Nagoya Institute of Technology
Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan

http://www.sp.nitech.ac.jp/~zen
------------------------------------------------

Follow-Ups
[hts-users:00943] Re: Questions about certain concepts, Tamer Fares
[hts-users:00944] Re: Questions about certain concepts, Tamer Fares
References
[hts-users:00941] Questions about certain concepts, marc sobhy