[hts-users:04258] Re: generating hts voice (.htsvoice) for hts

[hts-users:04258] Re: generating hts voice (.htsvoice) for hts_engine in STRAIGHT demo

Subject: [hts-users:04258] Re: generating hts voice (.htsvoice) for hts_engine in STRAIGHT demo

From: Xingyu Na <asr.naxingyu@gmail.com>

Date: Tue, 21 Apr 2015 09:52:39 +0800

Delivered-to: hts-users@sp.nitech.ac.jp

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type; bh=1SeEXdq4YtkuDABshDiWNayJ8oGi1XeTdvg3ygKtxIM=; b=OAiflQ1QTGakPpju4HIWQ1niD3WVLqCt+CmW8YN+XSTW6hAqab+yGy6vVQr5sUz4eX WhUvluvLCt4hceJhbKFslK4pTczagmP+mkvWchN0cLSEFxWkTEtom7qeKsBoU2nOt8g0 HOeqwDlvY+MqwovpgY6i3T2uJn6QiQj9dGJwzhwSfHP1pj5KhcYB5PJ7H0CeBkLPS7A6 zllOAhIUnehy415feuIR1DJdqZqr3st08PTMbXafXXIwjJRSjpTcNMrxRPE4NDiD/3Pj 9ayPc2WiEi8TT0H2H7YGNIgvcQ8NwoOJdyuyk2Dk7If2bvn07/6u8n0k5tMrgQZ/Muef fNbQ==

In-reply-to: <CADsWUxaC4mxFamidhN3JFboUOO+_rJ=K7Mp1E9EX4Pu1wE8iCA@mail.gmail.com>

List-help: <mailto:hts-users-ctl@sp.nitech.ac.jp?body=help>

List-id: hts-users.sp.nitech.ac.jp

List-owner: <mailto:hts-users-admin@sp.nitech.ac.jp>

List-post: <mailto:hts-users@sp.nitech.ac.jp>

List-software: fml [fml 4.0 STABLE (20040215/4.0.4_BETA)]

List-unsubscribe: <mailto:hts-users-ctl@sp.nitech.ac.jp?body=unsubscribe>

References: <CADsWUxbGohPyqqbvFjLCz5d862dUc1FV7qMUaonf1NLfn1HaZw@mail.gmail.com> <m28udn8cer.fsf@jad.coli.uni-saarland.de> <CADsWUxaC4mxFamidhN3JFboUOO+_rJ=K7Mp1E9EX4Pu1wE8iCA@mail.gmail.com>

Reply-to: hts-users@sp.nitech.ac.jp

User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.6.0

Hi,

You are asking a question that most of the engineers in this community would like to know an answer to.
It is important to know about the roles of different tools that compose the whole HTS package.
The HTS toolkit itself, provides acoustic model training from feature and label, and feature generation from trained model and to-be-uttered label, aka, HTS provides feature-to-model (trainer), and model-to-feature (generator).
How the features are estimated (encoded), and how to reconstruct waveform from generated features (decoded), are the responsibilities of the vocoder.

As Sebastien said, hts_engine doesn't support STRAIGHT. hts_engine is a lite version the synthsiser, meaning that it combines generator and decoder. The 'lite' comes in two ways, that 1) the simplest feature generation algorithm is applied, and 2) a fast decoder is applied, i.e the MLSA based filter, which works with binary excitation signal. "doesn't support STRAIGHT" means that the STRAIGHT decoder is not part of hts_engine. But if you can find a way to decode STRAIGHT spectrum using MLSA based filter, you can "generate hts voice using HTS-demo-STRAIGHT".

Last time I checked (2 years ago...), the limitation on the number of streams is because hts_engine is suppose to work with only mgc, lf0, and lpf. There are lots of work attempting to "avoid vocoder noise". If you works on more then 3 streams using hts_engine, I suggest you use my single stream generator (http://github.com/naxingyu/StreamGenerator. you can also find it on the HTS entensions page). It generate a single stream of feature, so that you can use more streams and cooperate with your own decoder.

FYI, this topic has been discussed extensively in this mailing list. Checking out previous threads would normally help.

Best,
Xingyu

On 04/20/2015 07:34 PM, payman shaykhmehdi wrote:

Dear Sébastien,

So is there any way to use hts_engine and avoid vocoder noise?

Best Regards

Payman Shaykhmehdi

On Mon, Apr 20, 2015 at 12:22 PM, Sébastien Le Maguer <slemaguer@coli.uni-saarland.de> wrote:

Hello,

hts_engine doesn't support STRAIGHT.

Kind regards,
Sébastien

payman shaykhmehdi writes:

> Hi all,
>
> is there any way to generate hts voice using HTS-demo-STRAIGHT???
>
> in the /scripts/config.pm , converting mmfs to hts_engine format and
> synthesizing using hts_engine has been disabled, and when i enable this
> options, hts_engine says that the number of streams must be 2 or 3!!!
>
> what is the problem??

--
Dr. Sébastien Le Maguer
Postdoctorate researcher

Saarland University
Campus C7.4 - room 2.03
D-66123 Saarbrücken
Germany

phone : +49-681-302-70030
Mail: slemaguer@coli.uni-saarland.de
website : http://www.coli.uni-saarland.de/~slemaguer/