I saw a few posts on the mailing-list about data normalization, I would like to know what are the best practices I’m starting with 16kHz samples that I upsampled to 48kHz using SoX, which seems to renormalize the gain and cause clipping later Right now I’m not using
after this I still had many files skipped during feature building because
so I added a gain normalization which helps with this
but I still get a lot of clipping warning during the synthesis phase after training Are there suggestions on a proper way to normalize data, both which tools to use (sox or wav2raw) and which options or target gain? Should we also update the check rule to
( <= and not <) to allow for all normal short values, or is there a reason it was using |