Sorry for my little knowledge, but I would like to know, why were used 25 coefficients mel-ceptral, and 1(one) coefficient f0(pitch), for each frame in HTS-demo?
I am doing this question because I read an Raniery/Heiga´s article suggesting 13 coefficients mel-cepstral and 13 for f0.