[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:03910] Re: Clustering all phones


Thank you so much!

> Date: Mon, 11 Nov 2013 11:28:36 +0900
> From: uratec@xxxxxxxxxxxxxxx
> Subject: [hts-users:03906] Re: Clustering all phones
> To: hts-users@xxxxxxxxxxxxxxx
> CC: uratec@xxxxxxxxxxxx
>
> Hi,
>
> As you said, in speech recognition, "current phoneme identity" context
> is usually treated as a special context.
> However, in HTS-demo scripts, "current phoneme identity" context is
> regarded as one of the many context factors to be considered.
> So, the similar phonemes can share some leafs.
>
> Regards,
> Keiichiro Oura
>
>
>
> 2013/11/11 Ibrahim Sobh <im_sobh@xxxxxxxxxxx>:
> > Hi,
> >
> > Regarding clustering :
> >
> > Why we use:
> > TB 0.00 mgc_s2_ {*.state[2].stream[1-1]} -->> state 2 from ALL phones
> >
> > and not use:
> > TB 0.00 mgc_s2_ {phone.state[2].stream[1-1]} -->> state 2 from certain
> > phone
> >
> > The reason could be because we have many context factors, however this will
> > result in clustering states from totally different phones together! so how
> > this really works?!
> >
> > Note: in ASR (HTK) we usually use "phone1.state[2], phone2.state[2] ....."
> > for all phones.
> >
> > Regards
> > Sobh
>

References
[hts-users:03853] objective evaluation, Hea Young Park
[hts-users:03857] Re: objective evaluation, Matt Shannon
[hts-users:03858] Re: objective evaluation, Hea Young Park
[hts-users:03871] HTS model and Speech Recognition, Ibrahim Sobh
[hts-users:03872] Re: HTS model and Speech Recognition, Keiichiro Oura
[hts-users:03893] Re: HTS model and Speech Recognition, Ibrahim Sobh
[hts-users:03895] Re: HTS model and Speech Recognition, Keiichiro Oura
[hts-users:03898] Change Speak Rate, Ibrahim Sobh
[hts-users:03900] lf0, mgc, duration trees, Ibrahim Sobh
[hts-users:03901] Clustering all phones, Ibrahim Sobh
[hts-users:03906] Re: Clustering all phones, Keiichiro Oura