Dear
I have run the HTS demo successfully on my own data (raw, labels around 500 files). Everything worked fine however when the synthesized file is around 3 seconds I noticed that the first second is good however the last second sound noisy (as if the speech is vibrated). When the synthesized file is around 1 second it is good. What do you recommend? Best Regards Sobh |