[hts-users:00322] Re: some questions about hts_engine and HTS_demo
- Subject: [hts-users:00322] Re: some questions about hts_engine and HTS_demo
- From: "lei liu" <wenshiliu@xxxxxxxxx>
- Date: Thu, 25 May 2006 16:52:01 +0800
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=muw0Hsx+phTPrj8Ag48jYtKyck9NmzDDzqHV+CD3Y+xluMG5G+V4ts9cgB9i9uTsADgeaMmUsQzQK5lxYGBmEHdH9hZb9jrIQLbQhhgLyZut/NgqGJGCbf8YWEbyQqIJR2NE4HlB29Hyz0wf5wIdMk07V5/zjR5jaAkv05Z1IO8=
Hi
thank you
I will check my pdf file.
In the mcp.pdf,
it is defined 75 dimension for the mean vector of leaf node,
as this
"4byte float, 75th dim. of mean vector at last leaf node"
why?
Best regards
2006/5/25, Heiga ZEN (Byung Ha CHUN) <zen@xxxxxxxxxxxxxxxx>:
Hi
lei liu wrote:
> I have used
> swab +f mcp.pdf | dmp +i | less (to review header parts)
> swab +f mcp.pdf | dmp +f | less (to review distributions)
>
> to see the mcp.pdf file in the hts-demo.
>
> "mcp.pdf" in the hts-demo is as this
>
> header
> 0 12313123
> 1 12121231
These values are weird so maybe your mcp.pdf is corrupted.
If you are using Big Endian arch machine such as SPARC or PowerPC, do not swap byte order.
> ..................
>
> pdf
> 0 0.123131
> ................
>
> The first column, I think it is serial number.
> what's the second column's mean.
Statistics.
> another question
> I aslo used dmp +f to open the ". mcep " and ".pit" files that are
> used to generate speech.
> Here is the ".pit",
>
> 0 123.123123
> 1 324.234244
> .........................
>
> but I found there are different numbers of value in
> the two file.
>
> maybe there are 14781 values int the mcep file but 777 int the pit file
>
> if " MCEPORDER = 18 " , does it mean there are 36 mels (static and
> dynamic ) and 2 f0s (static and dynamic).
> so f0's numbers * 18 = mel's numbers
No.
The zero-th mel-cepstral coefficients (c0) are also included.
And generated mcep and pit do not include dynamic features.
> finally if I want to modify some values in the .pit file to modify
> speech's characteristic,
> what tool can I use to do that ?
You can shift f0 values using hts_engine but we don't have any tools to modify part of pitch values.
Regards,
Heiga Zen (Byung Ha Chun)
--
------------------------------------------------
Heiga ZEN (in Japanese pronunciation)
Byung Ha CHUN (in Korean pronunciation)
Department of Computer Science and Engineering
Nagoya Institute of Technology
Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan
http://kt-lab.ics.nitech.ac.jp/~zen
------------------------------------------------
- Follow-Ups
-
- [hts-users:00323] Re: some questions about hts_engine and HTS_demo, Heiga ZEN (Byung Ha CHUN)
- References
-
- [hts-users:00318] some questions about hts_engine and HTS_demo, lei liu
- [hts-users:00319] Re: some questions about hts_engine and HTS_demo, Heiga ZEN (Byung Ha CHUN)
- [hts-users:00320] Re: some questions about hts_engine and HTS_demo, lei liu
- [hts-users:00321] Re: some questions about hts_engine and HTS_demo, Heiga ZEN (Byung Ha CHUN)