[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00461] HTS-2.0RC3


Dear members of hts-users mailing list,


The HTS working group would like to release the 3rd release candidate of HTS-2.0.
They can be downloaded from

	Patch code:
	http://kt-lab.ics.nitech.ac.jp/~zen/HTS-2.0RC3_for_HTK-3.4-alpha.patch.bz2
	(190670 bytes, MD5 checksum: aa7b6f56fc1dc0f51bfaf46dca9934d7)
	
	Speaker dependent demo:
	http://kt-lab.ics.nitech.ac.jp/~zen/HTS-demo_CMU-ARCTIC-SLT.tar.bz2
	(83386649 bytes, MD5 checksum d5f709b89aea5d4b026d45d6dc4ac6a4)

	Speaker adaptation/adaptive training demo:
	http://kt-lab.ics.nitech.ac.jp/~zen/HTS-demo_CMU-ARCTIC-ADAPT.tar.bz2
	(515973742 bytes, MD5 checksum f795fd33c8f65bf4d3b673fb2f00bfeb)


Differences between RC2 and RC3 are as follows:
- Speaker adaptation/adaptive training of MSD and multi-stream HMMs
- Speech parameter generation with speaker adapted, full covariance, or non left-to-right HMMs.
- Interface of HMGenS is moved from HHEd style to HERest style
- Miscellaneous fixes
- Demonstration for speaker adaptation/adaptive training


This version is released as a patch code to HTK-3.4-alpha (http://htk.eng.cam.ac.uk/).
You can download it from the following addresses:

	HTK 3.4 (alpha):
	http://htk.eng.cam.ac.uk/ftp/software/HTK-3.4-alpha.tar.gz

	HDecode:
See extensions page on HTK website - http://htk.eng.cam.ac.uk/extensions/index.shtml

After downloading HTK-3.4-alpha, HDecode and HTS-2.0RC2 patch code, please expand them (tar -jxvf) and apply the patch code on htk-3.4 directory as
follows:

	htk-3.4% patch -p1 -d . < HTS-2.0RC3_for_HTK-3.4-alpha.patch

Then please run configure and make.


Festival Speech Synthesis System (http://www.cstr.ed.ac.uk/projects/festival/download.html) and SPTK
(http://kt-lab.ics.nitech.ac.jp/~tokuda/SPTK/index.html) are required to run the demonstration.
Please download, configure, make, and install them.


Many people have asked us about F0 extraction, so from RC2 we decided to include a F0 extraction program.
Fortunately, ESPS get_f0 has been included in the open-source speech toolkit *Snack* (http://www.speech.kth.se/snack/).
In the release we include a Tcl/Tk script to call get_f0 function from Snack.
Therefore, you also have to install Tcl/Tk and Snack on your environment to run the demonstration.
ActiveState (http://www.activestate.com/) provides a Tcl/Tk distribution named ActiveTcl for many platforms (http://www.activestate.com/Products/ActiveTcl/?mp=1).
This distribution includes Snack and it is easy to install and use.
I recommend you to use this for running the demonstration (Actually I'm using it).

After expanding HTS-demo_CMU-ARCTIC-{SLT,ADAPT}.tar.bz2, please run configure.
If your PATH variable does not include SPTK, Festival, tclsh (with Snack) and HTS binary directory, please run configure as follows:

$ PATH=/usr/local/SPTK/bin:/usr/local/HTS-2.0RC3_for_HTK-3.4/bin:/usr/local/festival/examples:/usr/local/Activetcl/bin:$PATH ./configure

Then run make.
After composing training data, training script will be run in background.


This would be the final release candidate.
We are planning to release the final version at the end of this month.
Any bug reports and comments are highly appreciated, please mail to the hts-users mailing list.
I'm looking forward to hearing from you :-)


Best regards,

Heiga ZEN (Byung Ha CHUN)

--
------------------------------------------------
 Heiga ZEN     (in Japanese pronunciation)
 Byung-Ha CHUN (in Korean pronunciation)

 Department of Computer Science and Engineering
 Nagoya Institute of Technology
 Japan

 web: http://kt-lab.ics.nitech.ac.jp/~zen
------------------------------------------------


Follow-Ups
[hts-users:00462] Re: [hts-wg:00223] HTS-2.0RC3, Heiga ZEN (Byung Ha CHUN)
[hts-users:00476] Re: HTS-2.0RC3, Heiga ZEN (Byung Ha CHUN)