[hts-users:01797] F0 modelling
- Subject: [hts-users:01797] F0 modelling
- From: "Javi Palenzuela" <javi.pa.cam@xxxxxxxxxxxxxx>
- Date: Wed, 26 Nov 2008 10:28:35 +0000
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:mime-version:content-type:content-transfer-encoding :content-disposition; bh=R4Tfda6tgWc6JVFjsVw1DCSaRpflbCTNiaR2J+LLF2A=; b=K5r5XNn7nK6IYlnD+wqp1nWutRA/tVZ7A6Tyc2FACGkYXC9RME5jMDfRGwQsc069FC ZTwlvzDWOfRxd5kPAU6wBD8xa4KGFWUhv/W7F6E8Rn4w9G8u61vLyIHarcYeR0s+SMFT whSc+RBBmXMb9sXf+/aXFHLiI9xX3myDMBB/U=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type :content-transfer-encoding:content-disposition; b=iAXOmhqRVnMljL85h5+aXy28JZXc+cFrdQwIAhlVEnyqh06IyggsBX7Vsdw1j7oIrj xYxTNROuRB9+iTBy8hSPG6nva0NFU1nWiZHkCLZV+VYF0O+fI9gwuA2fO6xf+cHCWV6N BDMLwD3JqSeZglZ6F78kCQsgyBkqRtxWtL+sU=
Hi all,
After reading the papers, the mailing list and looking at the code, I
still don't understand at 100% how the voiced/unvoiced decision is
practically trained using MSD.
I see that the decision is made using the weight of the mixture of the
first stream. In the data, LZERO is used to flag the unvoiced regions.
But how is the weight updated accordingly with this?
There's an message about "SpaceOrder" in the mailing list, but I'd
appreciate a few more details.
Thank you
- Follow-Ups
-
- [hts-users:01798] Re: F0 modelling, Heiga Zen (Byung Ha CHUN)