[Subject Prev][Subject Next][Thread Prev][Thread Next][Date Index][Thread Index]

[hts-users:00055] FT: Festival 2.0 (well nearly)


I am forwarding an HTS-related message sent to festival-talk.


Subject: FT: Festival 2.0 (well nearly)
From: Rob Clark <robert@xxxxxxxxxxxxx>
To: festival-announce@xxxxxxxxxxxxx, festival-talk@xxxxxxxxxxxxx
Date: Wed, 14 Jul 2004 15:43:04 +0100 (BST)

message from Rob Clark <robert@xxxxxxxxxxxxx> to festival-talk
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =

In preperation for the the forthcoming release of Festival 2.0.
Edinburgh and CMU are proud to annouce:

        The Festival Speech Synthesis System version 1.95-beta
         and Edinburgh Speech Tools Library version 1.2.95
                            14th Jul 2004


Festival offers a general framework for building speech synthesis
systems as well as including examples of various modules.  As a whole
it offers full text to speech through a number APIs: from shell level,
though a Scheme command interpreter, as a C++ library, from Java, and
an Emacs interface.  Festival is multi-lingual (currently English
(British and American), and Spanish) though English is the most
advanced.  Other groups release new languages for the system.
And full tools and documentation for build new voices are
available through Carnegie Mellon's FestVox project (http://festvox.org)

The system is written in C++ and uses the Edinburgh Speech Tools
Library for low level architecture and has a Scheme (SIOD) based
command interpreter for control.  Documentation is given in the FSF
texinfo format which can generate, a printed manual, info files and
HTML.
                                                                                
Festival is free software.  Festival and the speech tools are
distributed under an X11-type licence allowing unrestricted commercial
and non-commercial use alike.
                                                                                
This distribution includes:
   * Full English (British and American English) text to speech
   * Full C++ source for modules, SIOD interpreter, and Scheme library
   * Lexicon based on CMULEX and OALD (OALD is restricted to 
non-commercial
     use only)
   * Edinburgh Speech Tools, low level C++ library
   * 4 US English voices (HTS ARCTIC), 2 male, 1 female and one "Scottish"
   * 2 US English voices (Multisyn ARCTIC)
   * British English Male (for residual LPC resynthesis 8k and 16k 
versions)
   * 2 American English Male (for residual LPC resynthesis 8k and 16k 
versions)
   * 4 other voices using MBROLA based diphone synthesis (1 British Male,
     2 American Males and 1 American Female).
   * Castilian Spanish Male (for residual LPC resynthesis 11k version)
   * British English Male (for spike excited LPC resynthesis 10k version)
   * Full documentation (html, postscript and GNU info format)

Note there are some licence restrictions on the voices themselves.
                                                                                
Festival version 1.95 sources, voices, and Linux binaries are
available from
                                                                                
In Europe:
    http://www.cstr.ed.ac.uk/projects/festival
In North America:
    http://festvox.org/festival

Requirements
                                                                                
To run Festival you need:
   * A Unix machine, Festival has compiled and run on Suns (SunOS and
     Solaris), FreeBSD, Linux, SGIs and DEC Alphas but should be portable
     to any standard Unix machine.
   * A C++ compiler: we have used GCC  version 2.7.2, 2.95.x
     2.96.x, 3.x, and egcs.  Other C++ compilers are
     probably possible with perhaps some minor chanages
   * GNU Make any recent version
   * Audio hardware, /dev/audio (8 bit and 16 bit on Suns, Linux
     and FreeBSD) and NCD's NAS network transparent audio system
     are supported directly but Festival supports the execution of
     any Unix command that can play audio files.
                                                                                
There is also support for building the system under Windows NT and
95/98/2000/XP.  We have successfully ran the system complied with Cygnus'
GNU win32 package and Microsoft's Visual C++, instructions are
included
                                                                                
New in 1.95 version
   * Integrated Nagoya Institute of Technologies HTS Engine
   * Four new US English voices (HTS versions of CMU_ARCTIC voices)
   * Multisyn - A general purpose customisable unit selection engine.
   * 2 US English voices (Multisyn ARCTIC)
   * Support for the new versions of C++ that have been released
   * Bug fixes to reduce "chipmunk" audio output with some audio drivers
   * clunits unit selection improvements
   * lots of small bugs fixeds
   * Apple OS X support
                                                                                
Rob Clark (Edinburgh) and Alan W Black (CMU)
14th Jul 2004


-- 
 /------------------------------------------------------------\
|   __  __ ___  __    | Rob Clark <robert@xxxxxxxxxxxxx>       |
|  /_/ /_/  /  /      | Centre for Speech Technology Research, |
| / \./ /._/. /__lark | University of Edinburgh.               |
 \------------------------------------------------------------/
                                   Information is not knowledge.
                                                -- Frank Zappa

= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
			       Sent Via festival-talk@xxxxxxxxxxxxx
		        To unsubscribe mail majordomo@xxxxxxxxxxxxx