[hts-users:03639] Re: hts questions

Subject: [hts-users:03639] Re: hts questions

Date: Tue, 26 Feb 2013 22:22:26 +0800

Delivered-to: hts-users@xxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=rb/wXWcMi6PooEJpBpbjnq0Cykl5grnVE4AOZagKi5k=; b=g8tn5j6ssyO4a3widzIOOSaVyTCwXpUEaSRr8QjpPETrQgtzYTuhsTDUaJYM/CmHoN T78u0bb7nU8UvXW9lV7aQIcCdIDh8cQUKY1/Gdi/uQ8KMIfa2+ApwmbB/7HJqAdaAzag Q6YC/P6bRTX9xhExCxVsUg/Hio7+wcM4lyK5XrQqAaEYTzb2/18X92gA+jj1VyhVcVLC 1dwzDbHKxwiaw5+JTzi6a9uhstIViBWJZdyiKJhfeuoIyMTWmHSga/UgtiHwg6/LZO1J SG9wWC0GcoU65D8V0vNfarAbSA6eEEjDnfNPM6WsY3yexFshOnmx7daKRHGayImjgjus iY0A==

Hi, Georg,

Yes, the likelihood of the model generating the training data is maximized when selecting the best question from many questions. Minimum Description Length (MDL) is the optimization criterion deciding when to stop the top-down splitting procedure. Two useful articles are:

1. Odell, J. J. (1995). The Use of Context in Large Vocabulary Speech Recognition. PhD thesis, Cambridge University.

2. Shinoda, K. and T. Watanabe (2000). "MDL-based context-dependent subword modeling for speech recognition." Acoustical Science and Technology 21(2): 79-86.

Yang Wang

2013/2/26 Georg Schlunz <gschlunz@xxxxxxxxxx>

Hi

According to which optimum criteria in the data are the HTS questions selected during decision tree building? Is it some kind of information content or likelihood?

Thanks

Georg

--
This message is subject to the CSIR's copyright terms and conditions, e-mail legal notice, and implemented Open Document Format (ODF) standard.
The full disclaimer details can be found at http://www.csir.co.za/disclaimer.html.

This message has been scanned for viruses and dangerous content by MailScanner,
and is believed to be clean.

Please consider the environment before printing this email.