eSpeak: Formant Synthesis Text-to-Speech Software

(Most has been adapted from local file /usr/share/doc/espeak/docs/index.html)

eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.  http://espeak.sourceforge.net

eSpeak uses a formant synthesis method. This allows many languages to be provided in a small size. The speech is clear, and can be used at high speeds, but is not as natural or smooth as larger synthesizers which are based on human speech recordings.

eSpeak is available as:


The latest development version is at: espeak.sf.net/test/latest.html.


I regularly use eSpeak to listen to blogs and news sites. I prefer the sound through a domestic stereo system rather than small computer speakers, which can sound rather harsh.

History. Originally known as speak and originally written for Acorn/RISC_OS computers starting in 1995. This version is an enhancement and re-write, including a relaxation of the original memory and processing power constraints, and with support for additional languages.

Features

Languages

The eSpeak speech synthesizer supports several languages, however in many cases these are initial drafts and need more work to improve them. Assistance from native speakers is welcome for these, or other new languages. Please contact me if you want to help.

eSpeak does text to speech synthesis for the following languages, some better than others. Afrikaans, Albanian, Armenian, Cantonese, Catalan, Croatian, Czech, Danish, Dutch, English, Esperanto, Finnish, French, German, Greek, Hindi, Hungarian, Icelandic, Indonesian, Italian, Kurdish, Latvian, Lojban, Macedonian, Mandarin, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Spanish, Swahili, Swedish, Tamil, Turkish, Vietnamese, Welsh.

espeakedit

espeakedit is a GUI program used to prepare and compile phoneme data. It is now available for download. Documentation is currently sparse, but if you want to use it to add or improve language support, let me know.

Installing eSpeak

I first tried espeak, with mixed results, before trying Festival, reported to have a steep learning curve. I installed the following linux packages:

Documentation is found in /usr/share/doc/espeak/docs/, just open /usr/share/doc/espeak/docs/index.html.

All in all a meagre 3.2 MB of additional hard-disk space.

Running espeak

I play an SSML file:

espeak -m -f spec-example-1.ssml

[...]

You may raise the pitch for capitalized runs (switch -kN) and you may also save to a WAV file (sample.wav):

espeak  -m -s 144 -g 1 -k20  -f template.ssml -w /dev/shm/sample.wav

(Castillian) Spanish Speech Synthesis

I test Spanish speech synthesis:

espeak -s 140 -p 60 -ves "Hola. Me llamo Francisco Fernández-Victorio Hernández."

I play a longish file, too:

espeak -s 170 -p 60 -ves -f /dev/shm/WHATEVER.txt
espeak -v europe/es -m -s 130 -g 3  -f FILE.html