[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:04020] Re: Building Sinsy voice

Subject: [hts-users:04020] Re: Building Sinsy voice
From: Keiichiro Oura <uratec@xxxxxxxxxxxxxxx>
Date: Mon, 3 Mar 2014 00:17:28 +0900
Cc: Keiichiro Oura <uratec@xxxxxxxxxxxx>
Delivered-to: hts-users@xxxxxxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type; bh=lQx9x7WgYLaDYw1eqrbiwbd3E/dsZRfX0XFW9juFE+I=; b=mP63yvv0ywenxqieU6yqagX2Y19Ua+93HPdizt6pbA1s3a16qzYMHONOorKWdZyrv1 4TJofIgjLVcjF3/p/dig6KevREfLBO2BB0ie4DCOOj67qtfVSipMQl0XEQlTFCGqs5au eAyKtyVi5x6WTvKYUGzmM1+nfyjqdCOULrfD2ZHTUmU38TwTQGnrkFytwUxDjTDcFZST SqrJDgZE+MJ3OReY5kj6a4UfONp9Aih1+Ams+zEvTPBBK8X7+aTYHggP0X/5JZk4gLqZ +0V1GawmXco7lkaRZZSaPKgR3vXDlqDJ/oqL5E5pW53nzixsWEpLmi/UAT432xAxcgsa TvZQ==

Hi,

> I was wondering; is building the  "HTS-demo_NIT-SONG070-F001" training demo
> is supposed to result in same voice as the
> "hts_voice_nitech_jp_song070_f001-0.90" downloadable pre-build binary for
> Sinsy?

No.
One of the main differences is the size of training data.
Only 31 songs (32min., *public domain*) are included in the demo scripts.
On the other hand, the HTS voice of http://sinsy.sourceforge.net is
trained by using 70 songs (72min.).

Regards,
Keiichiro Oura


2014-03-02 23:22 GMT+09:00 Merlijn Blaauw <merlijn.blaauw@xxxxxxx>:
> Hello,
>
> I was wondering; is building the  "HTS-demo_NIT-SONG070-F001" training demo
> is supposed to result in same voice as the
> "hts_voice_nitech_jp_song070_f001-0.90" downloadable pre-build binary for
> Sinsy?
>
> I tried to build the demo, but the file size of the resulting .htsvoice file
> and synthesis results are very different from the pre-build voice.
> In particular pitch (and breath sounds) seems to be modeled very poorly;
> timbre seems more or less ok (although it is kind of hard to tell).
> The "gen" phrases synthesized as part of the training script also do not
> sound very good.
>
> I'm using the following software: HTS 2.3alpha, HTS-demo_NIT-SONG070-F001
> from HTS 2.3alpha (slightly modified to fix raw2wav sample rate issue), SPTK
> 3.7, sinsy 0.90, hts_engine 1.08 .
>
> Thank you very much.
> Merlijn
>

Follow-Ups
: [hts-users:04021] Re: Building Sinsy voice, Suman Senapati; [hts-users:04022] Re: Building Sinsy voice, Merlijn Blaauw

References
: [hts-users:04019] Building Sinsy voice, Merlijn Blaauw

Prev by Subject: [hts-users:04019] Building Sinsy voice
Next by Subject: [hts-users:04021] Re: Building Sinsy voice
Previous by thread: [hts-users:04019] Building Sinsy voice
Next by thread: [hts-users:04021] Re: Building Sinsy voice