Hi, Currently HTS employs fundamental frequency (F0) as an excitation parameter. I wonder if has there been a study or publication on also using higher level frequencies (F1, F2, ...) in order to model voiced excitation more effectively. Thanks in advance. Â
  |