[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:04593] Re: Description of DNN Predicted output


Hi,

The output of DNN consists of the following features:

35: MGC
35: 1st-order derivatives of MGC
35: 2nd-order derivatives of MGC
1: voiced/unvoiced symbol
1: log f0
1: 1st-order derivative of log f0
1: 2nd-order derivative of lof f0

The position of log f0 is 107.


2018-01-31 17:23 GMT+09:00 Aayush Kumar Tyagi <aayush16081@xxxxxxxxxxx>:
> Hi,
>
> I am using DNN based speech synthesis(USEDNN=1).
> I guess the output of DNN is a combination of spectral and excitation
> parameters.
> The number of output units of DNN is 109.
> Can someone help me understand what these 109 points correspond to?Like how
> many of these are MGC points and which one is log f0.
> I am particularly interested in the position of the fundamental frequency in
> the predicted output.
>
> Please Correct me if I am completely wrong.
>
> Thanks a lot
> Aayush Tyagi
>



-- 
Nagoya Institute of Technology
Tokuda and Nankaku Laboratory
Takenori Yoshimura
takenori@xxxxxxxxxxxxxxx

References
[hts-users:04592] Description of DNN Predicted output, Aayush Kumar Tyagi