[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:04432] Re: HTS demo 'alice' labels


Thanks all for your help.

I will do my best to finalize the process!

On Wed, Dec 7, 2016 at 9:24 PM, Rasmus Dall <R.Dall@xxxxxxxxxxxx> wrote:
How do I generate *.lab* and/or *.utt* files similar to those in the HTS
Demos?  Do I have to use text2utt?

You need to produce them yourself. Festival's text2utt works for English but won't work for Chichewa presumably. I recommend producing .lab files as the .utt file format from festival is only used by HTS to create .lab files.

If so, then was the point going through the process with HTK (I am new to
NLP, and was advised to do this first step by my Professors - I was told
HTS takes input *from* HTK)?

Well that gives you time-aligned monophone labels which you can use for your full-context labelling. HTS is a patch on top of HTK so shares labelling format with HTK and many other things.

I am new to NLP

Well, I recommend you take a look at Jurafsky and Martins's "Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition" and afterwards Paul Taylor's "Text-to-Speech Synthesis" to get the fundamentals down :-)

Good luck with Chichewa!

- Rasmus

Quoting Jeremiah Chienda <jeremiahchienda@xxxxxxxxx> on Wed, 7 Dec 2016 03:48:03 +0900:

Hey guys.

I am trying to train a voice for my native Malawian language of Chichewa.
I've managed to achieve forced alignment (aligned.mlf) of a
monophone models using HTK.  My question is:

How do I generate *.lab* and/or *.utt* files similar to those in the HTS
Demos?  Do I have to use text2utt?
If so, then was the point going through the process with HTK (I am new to
NLP, and was advised to do this first step by my Professors - I was told
HTS takes input *from* HTK)?

How do I then use the generated (hopefully) synthesized WAVs (and .htsvoice
file) to make a TTS sample (either with Festival, Festlight or any other
system?)

Your help will be very much appreciated.

Attached is my aligned file

Regards
Jeremiah



On Wed, Dec 7, 2016 at 3:16 AM, Rasmus Dall <R.Dall@xxxxxxxxxxxx> wrote:

There may have been ever so slight changes since they were generated 10
years ago.

I'd recommend regenerating them yourself using the same front-end setup
(even if that is also festival) you use for your training data!

- Rasmus


Quoting "Heiga ZEN (Byung Ha CHUN)" <heigazen@xxxxxxxxxx> on Tue, 06 Dec
2016 17:12:22 +0000:

AFAIK there was not change at that time.

Heiga


On Tue, Dec 6, 2016 at 4:32 PM Erica Cooper <ecooper@xxxxxxxxxxxxxxx>
wrote:

Thanks, Heiga!  so the Festival HTS voice was just using the default US
English frontend from Festival, or do you know of any changes that were
made?

On Mon, Dec 5, 2016 at 2:29 PM, Heiga ZEN (Byung Ha CHUN) <
heigazen@xxxxxxxxxx> wrote:

I think I generated these labels about 10 years ago...

IIRC, they were directly generated from festival HTS voice.  At that
time,
festival HTS voices dumped labels to a tmp file then ran hts engine to
synthesize speech.  I think I copied these tmp label files.

I am not sure whether it still runs in this way.  It may be fully
integrated and no tmp file is created.

Heiga

2016年12月5日(月) 19:12 Erica Cooper <ecooper@xxxxxxxxxxxxxxx>:

Hi HTS-users,

I was wondering if anyone knew anything about how the 'alice' test labels
in the demo were created.  Searching through the list archives it appears
that Festival was used, however I have noticed on a few occasions that
with
some different voices trained on different types of data, .wav files
synthesized from the 'alice' labels just tend to sound better than
synthesis from our own test labels we've created.  We are using Festival
'text2utt' to get utts out of a txt.done.data-formatted file, and then
using the steps in the data/Makefile to convert from utt->lab.  Were the
'alice' labels perhaps created in some different way, or using some
different settings in Festival?

Thanks,
Erica

--
---------------------------------------
Heiga ZEN (in Japanese)
Byung Ha CHUN (in Korean)
<heigazen@xxxxxxxxxx>


--

---------------------------------------
Heiga ZEN (in Japanese)
Byung Ha CHUN (in Korean)
<heigazen@xxxxxxxxxx>




--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.








--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.




References
[hts-users:04425] HTS demo 'alice' labels, Erica Cooper
[hts-users:04426] Re: HTS demo 'alice' labels, Heiga ZEN (Byung Ha CHUN)
[hts-users:04427] Re: HTS demo 'alice' labels, Erica Cooper
[hts-users:04428] Re: HTS demo 'alice' labels, Heiga ZEN (Byung Ha CHUN)
[hts-users:04429] Re: HTS demo 'alice' labels, Rasmus Dall
[hts-users:04430] Re: HTS demo 'alice' labels, Jeremiah Chienda
[hts-users:04431] Re: HTS demo 'alice' labels, Rasmus Dall