[hts-users:04540] Re: Preparing data for HTS

Subject: [hts-users:04540] Re: Preparing data for HTS

From: Nickolay V.Shmyrev <nshmyrev@xxxxxxxxx>

Date: Sun, 16 Jul 2017 12:31:52 +0300

Authentication-results: mailgw.mains.nitech.ac.jp; dkim=pass (1024-bit key) header.d=yandex.ru header.i=@yandex.ru header.b=XLC+X/zk

Authentication-results: mxback9h.mail.yandex.net; dkim=pass header.i=@yandex.ru

Cc: atlaskhan90@xxxxxxxxx

Delivered-to: hts-users@xxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1500197512; bh=bqEI2l6/LRXyHW4ztwdFI4De7QbpDea8XiJCMI3/foQ=; h=From:To:Cc:In-Reply-To:References:Subject:Message-Id:Date; b=XLC+X/zkBkqDSzzQtkf4RQeSL36W0Qax0smjLBj/9OBImrao2U8J/F4Z8q6rexsQA tYIQzpvvrhQDj+fK3X2HfOKe5Idl4Pfgtm+DPLtG/Sg3DPy2xOGFXB7pxaN5E8mgIx RtJwV7WfupC/0zmzQujhvTSp95plodbKLWxqHCHM=

This is rarely mentioned but actually you MUST build Festival unit selection voice for your language from your data first. The reasons are simple:

1) Unit selection voice helps you to debug phonetic transcription and segmentation (very important for voice quality). With unit selection voices you can trace pronunciation issues directly to source units so you can figure out what is wrong in your training data annotation. With HTS voice you will never know what was wrong, just the quality will be a bit worse.

2) Once you have unit selection voice, feature dump is simple

The Festival documentation about building unit selection voices is available on http://festvox.org/bsv, it is not very complex.

16.07.2017, 10:15, "Atlas Khan" <atlaskhan90@xxxxxxxxx>:

Hi,

I am working on Speech Synthesis for language which do not have any type of support in Festival. It has different phonemes and Lexicons than English. I have recordings in raw format. As per my knowledge, I need following types of data for speech synthesis with HTS.

questions
labels (full and mono)
utt

I want to ask how can I prepare questions, labels for language which have different lexicon and phonemes than English. If I need Festival for generating that data, than how can I do for language for which Festival do not have any support.

Regards,

Atlas Khan