History source of Home(No. 10) - HMM/DNN-based speech synthesis system (HTS)

* HMM-based speech synthesis system [#qb249ac2]
//#contents
** Welcome! [#k4f3be02]
> The HMM-Based Speech Synthesis System (HTS) has been being developed by the HTS working group and others (see "Who we are" and "Acknowledgments" in "README" file). The basic core system of HTS was implemented as a modified version of [[HTK:http://htk.eng.cam.ac.uk/]] together with [[SPTK:http://kt-lab.ics.nitech.ac.jp/~tokuda/SPTK/]], and is released as HMM-Based Speech Synthesis System (HTS) in a form of patch code to [[HTK:http://htk.eng.cam.ac.uk/]]. The patch code is released under a free license, without commercial restrictions. However, it should be noted that once you apply the patch to the [[HTK:http://htk.eng.cam.ac.uk/]] source code, you must obey the [[license of HTK:http://htk.eng.cam.ac.uk/docs/license.shtml]].

> HTS version 1.1.1 comes with a small run-time synthesis engine (less than 1 MB including acoustic models), which can run without the HTK library. The current version does not include any text analyzer but the [[Festival Speech Synthesis System:http://www.festvox.org/festival/]] can be used as a text analyzer. This distribution includes a demo script using [[CMU ARCTIC US English awb:http://www.festvox.org/cmu_arctic/dbs_awb.html]], which generates "voices" for Festival.

> Two HTS voices for Festival trained by using [[CSTR US KED Timit:http://www.festvox.org/dbs/dbs_kdt.html]] and [[CMU Communicator KAL limited domain:http://www.festvox.org/dbs/dbs_com.html]], respectively, and four HTS voices for Festival trained by using [[CMU ARCTIC database:http://www.festvox.org/cmu_arctic/]] are also released with HTS version 1.1.1. They are based on the small synthesis engine. Each of HTS voices can be used as a "voice" of Festival Speech Synthesis System without any other HTS tools.

> For training Japanese voices, a demo script using the NIT database for speech synthesis "NIT JP ATR503 m001" is also prepared. Japanese voices trained by the demo script can be used on [[GalateaTalk:http://hil.t.u-tokyo.ac.jp/~galatea/]], which is a speech synthesis module of an open-source toolkit for anthropomorphic spoken dialogue agents developed in [[Galatea project:http://hil.t.u-tokyo.ac.jp/~galatea/]], without any other HTS tools. An HTS voice for [[GalateaTalk:http://hil.t.u-tokyo.ac.jp/~galatea/]] trained by the demo script is also released with HTS version 1.1.1.

** What's new and changed [#z8164883]
>''July 1, 2006:''
>> [[HTS version 2.0 RC2:http://kt-lab.ics.nitech.ac.jp/hts-users/spool/2006/msg00175.html]] was released to members of [[hts-users mailing list>Mailing List]].~
>''March 3, 2006:''
>> [[HTS version 2.0 RC1:http://kt-lab.ics.nitech.ac.jp/hts-users/spool/2006/msg00021.html]] was released to members of [[hts-users mailing list>Mailing List]]. ~
>''February 15, 2006:''
>> HTS version 2.0 RC0 was released to the internal working group. ~
>''December 26, 2003:''
>> HTS version 1.1.1 was released. The new features were~
- Based on HTK-3.2.1
- Demo script for ARCTIC database
- Demo script for an original database (Japanese)
- Variance flooring in demo script
- Postfiltering in hts-engine
- Many fixed bugs 

>''Oct. 14, 2003:''
>> New HTS voices trained by ARCTIC databases were released. 

>''June 11, 2003:''
>>  HTS version 1.1b was released.

>''May 9, 2003:''
>> HTS version 1.1 was released. The new features were~
- A small synthesis engine (to be called from Festival).
- HMM file format converter for the engine.
- Many fixed bugs (Thanks for reporting them).
- Accompanied by HTS voices for Festival. 

>''January 21, 2003:''
>> Minor revision was made to HTS version 1.0. 

>''December 25, 2002:''
>> HTS version 1.0 was released.
HMM/DNN-based Speech Synthesis System (HTS) - History source of Home (No. 10)