[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00208] HTS-2.0RC1


Dear members of hts-users mailing list,

HTS working group would like to release the first release candidate of HTS-2.0 and a simple demonstration using CMU
ARCTIC database.
You can download them from

HTS-2.0RC1_for_HTK-3.3.patch.gz: http://kt-lab.ics.nitech.ac.jp/~zen/HTS-2.0RC1_for_HTK-3.3.patch.gz
(190813 bytes, MD5 checksum: 01c58b0b15ddf182cd00c8a0dab7ac80)

HTS-demo_CMU-ARCTIC-SLT.tar.gz:  http://kt-lab.ics.nitech.ac.jp/~zen/HTS-demo_CMU-ARCTIC-SLT.tar.gz
(97926828 bytes, MD5 checksum e1abe51aa34ad1ffcc496c939c3ecb4e)


This version has the following new features:

 - Based on HTK-3.3
 - Compilation without SPTK
 - Many fixed bugs
 - HCompV calculates Variance floor in double
 - HRest can generate state duration density (-g option)
 - Phoneme boundaries can be given to HERest (-e option).
   We may specify a part of phone boundaries, e.g, pause positions.
 - Reduced-memory implementation of decision-tree based context clustering of HHEd
   (-r option).
 - Each decision tree can have a name with regular expression
   (-p option).
     ex. TB 000 {(*-a+*, *-i+*, *-u+*, *-e+*, *-o+*).state[2]}
   As a result, deferent two trees can be constructed for
   consonants and vowels, respectively.
 - Flexible model structures can be handled in HMGenS (in the previous version,
   the first stream is assumed to be for mcep, and the others are
   assumed to be for logf0).
 - EM-based parameter generation algorithm (-c option) i.e.,
   multi-mixture models can be used.
     -c 0: Cholesky decomposition based parameter generation
     -c 1: EM (with fixed phone boundaries)
     -c 2: EM
 - miscellaneous changes.


This version is released as a patch code to HTK-3.3 (http://htk.eng.cam.ac.uk/).
After downloading HTK-3.3 and this patch code, please apply HTS-2.0RC1_for_HTK-3.3.patch on htk-3.3 directory as
follows:

 htk-3.3% patch -p1 -d . < HTS-2.0RC1_for_HTK-3.3.patch

Then run configure and make, please.


Festival Speech Synthesis System (http://www.cstr.ed.ac.uk/projects/festival/download.html) and SPTK
(http://kt-lab.ics.nitech.ac.jp/~tokuda/SPTK/index.html) are required to run demonstration.
Please download, configure, make, and install them.
After expanding HTS-demo_CMU-ARCTIC-SLT.tar.gz, please run configure.
If your PATH variable does not include SPTK, Festival and HTS binary directory, please run configure as follows:

$ PATH=/usr/local/SPTK/bin:/usr/local/HTS-2.0RC1_for_HTK-3.3/bin.linux:/usr/local/festival/examples:$PATH ./configure

Then run make, please.
After composing training data, training script will run in background.


This is release *candidate*, it may contain a number of bugs or problems.
We are planning to release the final version in this March or April.
Any bug reports or comments are highly appreciated, please send an e-mail to hts-users mailing list.
I'm looking forward to hearing from you :-)

Best regards,

Heiga Zen (Byung Ha Chun)

-- 
 ------------------------------------------------
  Heiga ZEN     (in Japanese pronunciation)
  Byung-Ha CHUN (in Korean pronunciation)

  Department of Computer Science and Engineering
  Nagoya Institute of Technology
  Japan

  web: http://kt-lab.ics.nitech.ac.jp/~zen
 ------------------------------------------------




Follow-Ups
[hts-users:00209] HTS-2.0RC1: patch problem, Nicholas Volk