Hi All,
There is an option in hts_engine_API supporting time scale modification by increasing/decreasing the predicted durations of each phoneme and then synthesizing speech. Is it not possible to do pitch scaling in similar by altering logf0 values. Algorithmically is my thought wrong?
Can you pls suggest how to do pitch scaling using hts_engine_API.