[hts-users:03346] Re: About the prototype definition of the MSD-HMM
- Subject: [hts-users:03346] Re: About the prototype definition of the MSD-HMM
- From: Kwan Lisa <lisakwan1102@xxxxxxxxx>
- Date: Sun, 10 Jun 2012 11:47:51 +0800
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=NVgjQhot3DHEQMs/7en/9+55gDFpBV645XqAv7msrEc=; b=LAgXLCrao56ir19T45mZ29jzdRf5jhnxOknItm/iGeKsW1uFg007oe+1f8LCX3zf3d V1nfBEC6lwxGxR6XO6cGfmLO10CZWPpuC+XQ4cCjv495RBona280uerXNT/plS/K5Ofs DW5+2gybVQaNmiMCdJxMmuNAc7+Re4LDhpOUF8Bc48G0eI7XYhlrKSFJPFCxavfiPM9A 9gBAkPe/0nxMDeIEFIGIUQA90NwfEX1VBPOHubQzPcdXBlyfH6ndaGP/svVhJnm39yqW 4KRlD8U4nNsz/mPic9HAYb+su2nj90+i1nll/8ratPOnw4vDSdiqW593pldy1qRsPy07 BQig==
Hi,
I got the difference between putting 3 features in a stream and
putting them in different streams.
I will get a 3-dimensional multivariate normal distribution by putting
3 features in a stream. However, I will get three 1-dimensional normal
distribution by putting 3 features in different streams.
2012/6/6 Kwan Lisa <lisakwan1102@xxxxxxxxx>:
> Hi,
>
> Thanks. I will try to do this.
>
> 2012/6/6 那兴宇 <nxy-yzqs@xxxxxxx>:
>> Hi,
>> You can try assign F0 features in one stream.
>> I have tried this and got a bit lower "RMSE of F0", and I remember there was
>> a paper using this structure (maybe by Heiga?).
>>
>>
>> --
>> Xingyu Na (那兴宇)
>> Beijing Institute of Technology
>> naxy(at)bit.edu.cn
>> asr.naxingyu(at)gmail.com
>> naxingyu at {facebook, twitter, linkedin}
>>
>>
>> At 2012-06-06 02:45:20,"Kwan Lisa" <lisakwan1102@xxxxxxxxx> wrote:
>>>Hi,
>>>
>>>I don't understand why the distributions of frequency and its 1- and
>>>2-order dynamics are placed in stream 2, 3, and 4 respectively, but
>>>the distributions of spectral and its 1- and 2-order dynamics are
>>>placed in stream 1. What if I place all of frequency feature in stream
>>>2 and treat them as a 3 dimensional data like spectral data?
>>>
>>>2012/6/5 那兴宇 <nxy-yzqs@xxxxxxx>:
>>>> Hi,
>>>>
>>>> Yes, you are right.
>>>> Stream 3 and 4 are distributions of 1- and 2-order dynamics respectively.
>>>>
>>>> --
>>>> Xingyu Na (那兴宇)
>>>> Beijing Institute of Technology
>>>> naxy(at)bit.edu.cn
>>>> asr.naxingyu(at)gmail.com
>>>> naxingyu at {facebook, twitter, linkedin}
>>>>
>>>>
>>>> At 2012-06-05 02:43:21,"Kwan Lisa" <lisakwan1102@xxxxxxxxx> wrote:
>>>>>Hi,
>>>>>
>>>>>I have question about the definition of the prototype of the MSD-HMM.
>>>>>My monophone.mmf in the model directory is like:
>>>>>~h "mo"
>>>>><BEGINHMM>
>>>>><NUMSTATES> 7
>>>>><STATE> 2
>>>>>
>>>>><STREAM> 1
>>>>><MEAN> 120
>>>>>...
>>>>><VARIANCE> 120
>>>>>...
>>>>><GCONST> -7.524412e+02
>>>>>
>>>>><STREAM> 2
>>>>><NUMMIXES> 2
>>>>><MIXTURE> 1 5.459577e-01
>>>>><MEAN> 1
>>>>> 4.907650e+00
>>>>><VARIANCE> 1
>>>>> 2.217153e-02
>>>>><GCONST> -1.971069e+00
>>>>><MIXTURE> 2 4.540337e-01
>>>>><MEAN> 0
>>>>><VARIANCE> 0
>>>>><GCONST> 0.000000e+00
>>>>>
>>>>><STREAM> 3
>>>>>...
>>>>>
>>>>><STREAM> 4
>>>>>...
>>>>>
>>>>>I want to utilize the monophone model to calculate the KL-divergence
>>>>>of the frequency distributions. However there are voice and unvoiced
>>>>>speech. I think stream 2, 3, and 4 are the frequency streams, and in
>>>>>each stream mixture 1 stands for the distribution weight of the voiced
>>>>>speech and mixture 2 stands for the distribution weight of the
>>>>>unvoiced speech. Is my explanation correct?
>>>>>
>>>>>--
>>>>>Lisa Kwan
>>>>>lisakwan1102(at)gmail.com
>>>>>Advanced Speech Technology Lab, ASTL
>>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>>
>>>--
>>>Lisa Kwan
>>>lisakwan1102(at)gmail.com
>>>Advanced Speech Technology Lab, ASTL
>>>
>
>
>
> --
> Lisa Kwan
> lisakwan1102(at)gmail.com
> Advanced Speech Technology Lab, ASTL
--
Lisa Kwan
lisakwan1102(at)gmail.com
Advanced Speech Technology Lab, ASTL
- References
-
- [hts-users:03321] About the prototype definition of the MSD-HMM, Kwan Lisa
- [hts-users:03322] Re: About the prototype definition of the MSD-HMM, 那兴宇
- [hts-users:03324] Re: About the prototype definition of the MSD-HMM, Kwan Lisa
- [hts-users:03327] Re: About the prototype definition of the MSD-HMM, 那兴宇
- [hts-users:03328] Re: About the prototype definition of the MSD-HMM, Kwan Lisa