[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:03906] Re: Clustering all phones


Hi,

As you said, in speech recognition, "current phoneme identity" context
is usually treated as a special context.
However, in HTS-demo scripts, "current phoneme identity" context is
regarded as one of the many context factors to be considered.
So, the similar phonemes can share some leafs.

Regards,
Keiichiro Oura



2013/11/11 Ibrahim Sobh <im_sobh@xxxxxxxxxxx>:
> Hi,
>
> Regarding clustering :
>
> Why we use:
> TB 0.00 mgc_s2_  {*.state[2].stream[1-1]}   -->> state 2 from ALL phones
>
> and not use:
> TB 0.00 mgc_s2_  {phone.state[2].stream[1-1]}   -->>  state 2 from certain
> phone
>
> The reason could be because we have many context factors, however this will
> result in clustering states from totally different phones together! so how
> this really works?!
>
> Note: in ASR (HTK) we usually use "phone1.state[2], phone2.state[2]  ....."
> for all phones.
>
> Regards
> Sobh

Follow-Ups
[hts-users:03910] Re: Clustering all phones, Ibrahim Sobh
References
[hts-users:03853] objective evaluation, Hea Young Park
[hts-users:03857] Re: objective evaluation, Matt Shannon
[hts-users:03858] Re: objective evaluation, Hea Young Park
[hts-users:03871] HTS model and Speech Recognition, Ibrahim Sobh
[hts-users:03872] Re: HTS model and Speech Recognition, Keiichiro Oura
[hts-users:03893] Re: HTS model and Speech Recognition, Ibrahim Sobh
[hts-users:03895] Re: HTS model and Speech Recognition, Keiichiro Oura
[hts-users:03898] Change Speak Rate, Ibrahim Sobh
[hts-users:03900] lf0, mgc, duration trees, Ibrahim Sobh
[hts-users:03901] Clustering all phones, Ibrahim Sobh