Hi,
I want to do speech synthesis using HTS for Urdu Language. I have ran Demo Script for English (HTS-demo_CMU-ARCTIC-SLT.tar.bz2) and it is run also ning fine. I also have explored all files in data directory of above demo. There are following different type of files in data directory
- questions: list of all context and properties format for tree-based context clustering.
- labels: phone labeled file for xxx.raw with/without their context and properties
- raw: I think they are recordings
- txt: Text of corresponding recordings
- utts: Utterance files
What I wanted to ask is which of these types of data I needed to prepare for Urdu language. I have speech corpus and their text labels. Kindly also tell me how to prepare this data.