[hts-users:03812] Re: HTS Multi pulse excitation

Subject: [hts-users:03812] Re: HTS Multi pulse excitation

Date: Fri, 5 Jul 2013 00:53:35 +0800

Delivered-to: hts-users@xxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:references:from:content-type:x-mailer:in-reply-to :message-id:date:to:content-transfer-encoding:mime-version; bh=yszukmkRRIkcoc/ddUT+TguPZOcicE89v1ydn6s6DG8=; b=EAgj3nrwB/7y8vuCkzIRDCyGf5iFGXUmqgseKGLTSKhic18B0G1QjCRd6G9mrWx7lL 2pLopZpdzio0jVVPhrHIEFtTiTRDydmPudXa44uHdn9PoUAdUF/yOBPYhNExy0ThFgVj +6dSoIpGU9i9I2FIBsSBb2GP5k+xplLM4mqVxWu9i524dZ4bTf4utjB65nptqFH5KFe6 U9cbvMIzT9Gi3tLvVLbqFNmd0iyNDBQd6O/X0caFkRszzEbhQCit9yljh8zMYno+W69v GjXqsXMZxdaXto+wUynXnnFzhX+Bme6ILjNtBPUQSuLOQjE5KcT9BsLerTS6JNcCihko JBmA==

I worked on a mandarin TTS system using HTS several years ago, it was based on MBE codec, multiband- excitation.

The produced voices sounded a little smear, because the lsp order was too low(10?).

Finding accurate f0 f1 f2... is very difficult, I do believe MBE is a better choice.

在 2013-7-4，21:43，Fatih Kıralioğlu <fatih.kiralioglu@xxxxxxxxxxxxx> 写道：

Hi,
Currently HTS employs fundamental frequency (F0) as an excitation parameter.
I wonder if has there been a study or publication on also using higher level frequencies (F1, F2, ...) in order to model voiced excitation more effectively.
Thanks in advance.

<image001.jpg>

Fatih Kıralioğlu

İTÜ Ayazağa Kampüsü Koru Yolu Arı-2 Teknokent Binası A Blok No:A4-4 34469 Maslak-İstanbul

<image002.jpg>

E-posta : fatih.kiralioglu@xxxxxxxxxxxxx Tel : +90 (212) 286 25 45 / 164 Faks : +90 (212) 286 25 47