[hts-users:04509] Re: Speech Synthesis for New Language

Subject: [hts-users:04509] Re: Speech Synthesis for New Language

From: Quang Bui Tan <langmaninternet@xxxxxxxxx>

Date: Sat, 18 Mar 2017 22:26:21 +0700

Authentication-results: mailgw.mains.nitech.ac.jp; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=b1mC6Ehd

Delivered-to: hts-users@xxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=FLzGhKvjlpMd128RJqK1cJ2ubfSSSaXTFfT1QdiVO9A=; b=b1mC6EhdX8280hLHAtAPZdLVWsvH5p3mjyNlcCdhOHkBjZH0+mIT6r4/oTDST2iV5S ygPuQt3JiujcfKLynDJDh5nUfjOqSjLf5PlBBNTldcr87/KlTCFMFAK/nW6n/xHJhWUo JFTqsS3YGb1d8vLH1F5f+vnIxJgaauKD4IZZ76m6B4Y50e1drJdyRdq2HZxJGCflcVcP +qGwHZX10jLuYr6LHsyQjoEK4VxXNu4fve4Jn9f3TzVnDzpn0xm9ZOMYRH+bAx9J60Ej O+dlu4+oSKNYJtizLtdQEsaXjoFNHTjVIQA3+Ib/D0yR8JeH6YV0ZR+IjuLXHPeg2O1e m4aw==

Prepare for you:

1 . Recording data

2. Language model :

- lexicon : word -> phonemes list , to create mono file

- question

- tool, that convert txt to full-context label

Note : raw file is data of wav file

wave file == header + raw

you can view raw file with https://sourceforge.net/projects/wavesurfer/ (in demo raw file, choice Sample = 48000Hz, SampleEncoding Lin16, chanel Mono

and you can hear that)

Final : questions + labels/full + labels/mono + raw

utt is optional

2017-03-18 13:49 GMT+07:00 Atlas Khan <atlaskhan90@xxxxxxxxx>:

Hi,

I want to do speech synthesis using HTS for Urdu Language. I have ran Demo Script for English (HTS-demo_CMU-ARCTIC-SLT.tar.bz2) and it is run also ning fine. I also have explored all files in data directory of above demo. There are following different type of files in data directory
questions: list of all context and properties format for tree-based context clustering.
labels: phone labeled file for xxx.raw with/without their context and properties
raw: I think they are recordings
txt: Text of corresponding recordings
utts: Utterance files
What I wanted to ask is which of these types of data I needed to prepare for Urdu language. I have speech corpus and their text labels. Kindly also tell me how to prepare this data.