ESpeak-ng: speech synthesizer with more than one hundred languages and accents

https://github.com/espeak-ng/espeak-ng

eSpeak NG Text-to-Speech


The eSpeak NG is a compact open source software text-to-speech synthesizer for Linux, Windows, Android and other operating systems. It supports more than 100 languages and accents. It is based on the eSpeak engine created by Jonathan Duddington.

eSpeak NG uses a "formant synthesis" method. This allows many languages to be provided in a small size. The speech is clear, and can be used at high speeds, but is not as natural or smooth as larger synthesizers which are based on human speech recordings. It also supports Klatt formant synthesis, and the ability to use MBROLA as backend speech synthesizer.

eSpeak NG is available as:

  • A command line program (Linux and Windows) to speak text from a file or from stdin.
  • A shared library version for use by other programs. (On Windows this is a DLL).
  • A SAPI5 version for Windows, so it can be used with screen-readers and other programs that support the Windows SAPI5 interface.
  • eSpeak NG has been ported to other platforms, including Solaris and Mac OSX.

Features

  • Includes different Voices, whose characteristics can be altered.
  • Can produce speech output as a WAV file.
  • SSML (Speech Synthesis Markup Language) is supported (not complete), and also HTML.
  • Compact size. The program and its data, including many languages, totals about few Mbytes.
  • Can be used as a front-end to MBROLA diphone voices. eSpeak NG converts text to phonemes with pitch and length information.
  • Can translate text into phoneme codes, so it could be adapted as a front end for another speech synthesis engine.
  • Potential for other languages. Several are included in varying stages of progress. Help from native speakers for these or other languages is welcome.
  • Written in C.

See the ChangeLog for a description of the changes in the various releases and with the eSpeak NG project.

The following platforms are supported:

Platform Minimum Version Status
Linux CI
BSD
Android 4.0
Windows Windows 8
Mac

Documentation

  1. User guide explains how to set up and use eSpeak NG from command line or as a library.
  2. Building guide provides info how to compile and build eSpeak NG from the source.
  3. Index provides full list of more detailed information for contributors and developers.
  4. Look at contribution guide to start your contribution.
  5. Look at eSpeak NG roadmap to participate in development of eSpeak NG.

eSpeak Compatibility

The espeak-ng binaries use the same command-line options as espeak, with several additions to provide new functionality from espeak-ng such as specifying the output audio device name to use. The build creates symlinks of espeak to espeak-ng, and speak to speak-ng.

The espeak speak_lib.h include file is located in espeak-ng/speak_lib.h with an optional symlink in espeak/speak_lib.h. This file contains the espeak 1.48.15 API, with a change to the ESPEAK_API macro to fix building on Windows and some minor changes to the documentation comments. This C API is API and ABI compatible with espeak.

The espeak-data data has been moved to espeak-ng-data to avoid conflicts with espeak. There have been various changes to the voice, dictionary and phoneme files that make them incompatible with espeak.

The espeak-ng project does not include the espeakedit program. It has moved the logic to build the dictionary, phoneme and intonation binary files into the libespeak-ng.so file that is accessible from the espeak-ng command line and C API.

History

The program was originally known as speak and originally written for Acorn/RISC_OS computers starting in 1995 by Jonathan Duddington. This was enhanced and re-written in 2007 as eSpeak, including a relaxation of the original memory and processing power constraints, and with support for additional languages.

In 2010, Reece H. Dunn started maintaining a version of eSpeak on GitHub that was designed to make it easier to build eSpeak on POSIX systems, porting the build system to autotools in 2012. In late 2015, this project was officially forked to a new eSpeak NG project. The new eSpeak NG project is a significant departure from the eSpeak project, with the intention of cleaning up the existing codebase, adding new features, and adding to and improving the supported languages.

The historical branch contains the available older releases of the original eSpeak that are not contained in the subversion repository.

1.24.02 is the first version of eSpeak to appear in the subversion repository, but releases from 1.05 to 1.24 are available at http://sourceforge.net/projects/espeak/files/espeak/.

These early releases have been checked into the historical branch, with the 1.24.02 release as the last entry. This makes it possible to use the replace functionality of git to see the earlier history:

git replace 8d59235f 63c1c019

NOTE: The source releases contain the big_endian, espeak-edit, praat-mod, riskos, windows_dll and windows_sapi folders. These do not appear in the source repository until later releases, so have been excluded from the historical commits to align them better with the 1.24.02 source commit.

License Information

eSpeak NG Text-to-Speech is released under the GPL version 3 or later license.

The getopt.c compatibility implementation for getopt support on Windows is taken from the NetBSD getopt_long implementation, which is licensed under a 2-clause BSD license.

Android is a trademark of Google LLC.

Acknowledgements

The catalan extension was funded by [Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya](https://politiquesdigitals.gencat.cat/ca/inici/index.html#googtrans(ca|en) within the framework of Projecte AINA.

{
"by": "nateb2022",
"descendants": 107,
"id": 40231630,
"kids": [
40232384,
40233104,
40232263,
40235318,
40233054,
40234196,
40232309,
40234788,
40234245,
40232527,
40234739,
40234722,
40233249,
40235194,
40232320,
40236704,
40233103,
40235950,
40236035,
40236412,
40232626,
40238113,
40239381,
40232672
],
"score": 256,
"time": 1714612014,
"title": "ESpeak-ng: speech synthesizer with more than one hundred languages and accents",
"type": "story",
"url": "https://github.com/espeak-ng/espeak-ng"
}
{
"author": "espeak-ng",
"date": null,
"description": "eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents. - espeak-ng/espeak-ng",
"image": "https://opengraph.githubassets.com/e6cb2c0e46cd5cf27cd77cb6c4708b32e6cb8dfa4425a88eb412dad60b95d651/espeak-ng/espeak-ng",
"logo": "https://logo.clearbit.com/github.com",
"publisher": "GitHub",
"title": "GitHub - espeak-ng/espeak-ng: eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.",
"url": "https://github.com/espeak-ng/espeak-ng"
}
{
"url": "https://github.com/espeak-ng/espeak-ng",
"title": "GitHub - espeak-ng/espeak-ng: eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.",
"description": "eSpeak NG Text-to-Speech Features Supported languages Documentation eSpeak Compatibility History License Information The eSpeak NG is a compact open source software text-to-speech synthesizer for Linux,...",
"links": [
"https://github.com/espeak-ng/espeak-ng"
],
"image": "https://opengraph.githubassets.com/e6cb2c0e46cd5cf27cd77cb6c4708b32e6cb8dfa4425a88eb412dad60b95d651/espeak-ng/espeak-ng",
"content": "<div><article><p></p><h2>eSpeak NG Text-to-Speech</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#espeak-ng-text-to-speech\"></a><p></p>\n<ul>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#features\">Features</a></li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md\">Supported languages</a></li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#documentation\">Documentation</a></li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#espeak-compatibility\">eSpeak Compatibility</a></li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#history\">History</a></li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#license-information\">License Information</a></li>\n</ul>\n<hr />\n<p>The eSpeak NG is a compact open source software text-to-speech synthesizer for\nLinux, Windows, Android and other operating systems. It supports\n<a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md\">more than 100 languages and accents</a>. It is based on the eSpeak engine\ncreated by Jonathan Duddington.</p>\n<p>eSpeak NG uses a \"formant synthesis\" method. This allows many languages to be\nprovided in a small size. The speech is clear, and can be used at high speeds,\nbut is not as natural or smooth as larger synthesizers which are based on human\nspeech recordings. It also supports Klatt formant synthesis, and the ability\nto use MBROLA as backend speech synthesizer.</p>\n<p>eSpeak NG is available as:</p>\n<ul>\n<li>A <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/src/espeak-ng.1.ronn\">command line</a> program (Linux and Windows) to speak text from a file or\nfrom stdin.</li>\n<li>A <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/integration.md\">shared library</a> version for use by other programs. (On Windows this is\na DLL).</li>\n<li>A SAPI5 version for Windows, so it can be used with screen-readers and\nother programs that support the Windows SAPI5 interface.</li>\n<li>eSpeak NG has been ported to other platforms, including Solaris and Mac\nOSX.</li>\n</ul>\n<p></p><h2>Features</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#features\"></a><p></p>\n<ul>\n<li>Includes different Voices, whose characteristics can be altered.</li>\n<li>Can produce speech output as a WAV file.</li>\n<li>SSML (Speech Synthesis Markup Language) is supported (not complete),\nand also HTML.</li>\n<li>Compact size. The program and its data, including many languages,\ntotals about few Mbytes.</li>\n<li>Can be used as a front-end to <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/mbrola.md\">MBROLA diphone voices</a>.\neSpeak NG converts text to phonemes with pitch and length information.</li>\n<li>Can translate text into phoneme codes, so it could be adapted as a\nfront end for another speech synthesis engine.</li>\n<li>Potential for other languages. Several are included in varying stages\nof progress. Help from native speakers for these or other languages is\nwelcome.</li>\n<li>Written in C.</li>\n</ul>\n<p>See the <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/ChangeLog.md\">ChangeLog</a> for a description of the changes in the\nvarious releases and with the eSpeak NG project.</p>\n<p>The following platforms are supported:</p>\n<table>\n<thead>\n<tr>\n<th>Platform</th>\n<th>Minimum Version</th>\n<th>Status</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>Linux</td>\n<td></td>\n<td><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/actions/workflows/ci.yml/badge.svg\"><img src=\"https://github.com/espeak-ng/espeak-ng/actions/workflows/ci.yml/badge.svg\" alt=\"CI\" /></a></td>\n</tr>\n<tr>\n<td>BSD</td>\n<td></td>\n<td></td>\n</tr>\n<tr>\n<td>Android</td>\n<td>4.0</td>\n<td></td>\n</tr>\n<tr>\n<td>Windows</td>\n<td>Windows 8</td>\n<td></td>\n</tr>\n<tr>\n<td>Mac</td>\n<td></td>\n<td></td>\n</tr>\n</tbody>\n</table>\n<p></p><h2>Documentation</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#documentation\"></a><p></p>\n<ol>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/guide.md\">User guide</a> explains how to set up and use eSpeak NG from command line or as a library.</li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/building.md\">Building guide</a> provides info how to compile and build eSpeak NG from the source.</li>\n<li><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/index.md\">Index</a> provides full list of more detailed information for contributors and developers.</li>\n<li>Look at <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/docs/contributing.md\">contribution guide</a> to start your contribution.</li>\n<li>Look at <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/wiki/eSpeak-NG-roadmap\">eSpeak NG roadmap</a> to participate in development of eSpeak NG.</li>\n</ol>\n<p></p><h2>eSpeak Compatibility</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#espeak-compatibility\"></a><p></p>\n<p>The <em>espeak-ng</em> binaries use the same command-line options as <em>espeak</em>, with\nseveral additions to provide new functionality from <em>espeak-ng</em> such as specifying\nthe output audio device name to use. The build creates symlinks of <code>espeak</code> to\n<code>espeak-ng</code>, and <code>speak</code> to <code>speak-ng</code>.</p>\n<p>The espeak <code>speak_lib.h</code> include file is located in <code>espeak-ng/speak_lib.h</code> with\nan optional symlink in <code>espeak/speak_lib.h</code>. This file contains the espeak 1.48.15\nAPI, with a change to the <code>ESPEAK_API</code> macro to fix building on Windows\nand some minor changes to the documentation comments. This C API is API and ABI\ncompatible with espeak.</p>\n<p>The <code>espeak-data</code> data has been moved to <code>espeak-ng-data</code> to avoid conflicts with\nespeak. There have been various changes to the voice, dictionary and phoneme files\nthat make them incompatible with espeak.</p>\n<p>The <em>espeak-ng</em> project does not include the <em>espeakedit</em> program. It has moved\nthe logic to build the dictionary, phoneme and intonation binary files into the\n<code>libespeak-ng.so</code> file that is accessible from the <code>espeak-ng</code> command line and\nC API.</p>\n<p></p><h2>History</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#history\"></a><p></p>\n<p>The program was originally known as <strong>speak</strong> and originally written\nfor Acorn/RISC_OS computers starting in 1995 by Jonathan Duddington. This was\nenhanced and re-written in 2007 as <strong>eSpeak</strong>, including a relaxation of the\noriginal memory and processing power constraints, and with support for additional\nlanguages.</p>\n<p>In 2010, Reece H. Dunn started maintaining a version of eSpeak on GitHub that\nwas designed to make it easier to build eSpeak on POSIX systems, porting the\nbuild system to autotools in 2012. In late 2015, this project was officially\nforked to a new <strong>eSpeak NG</strong> project. The new eSpeak NG project is a significant\ndeparture from the eSpeak project, with the intention of cleaning up the\nexisting codebase, adding new features, and adding to and improving the\nsupported languages.</p>\n<p>The <em>historical</em> branch contains the available older releases of the original\neSpeak that are not contained in the subversion repository.</p>\n<p>1.24.02 is the first version of eSpeak to appear in the subversion\nrepository, but releases from 1.05 to 1.24 are available at\n<a target=\"_blank\" href=\"http://sourceforge.net/projects/espeak/files/espeak/\">http://sourceforge.net/projects/espeak/files/espeak/</a>.</p>\n<p>These early releases have been checked into the historical branch,\nwith the 1.24.02 release as the last entry. This makes it possible\nto use the replace functionality of git to see the earlier history:</p>\n<div><pre><code>git replace 8d59235f 63c1c019\n</code></pre></div>\n<p><strong>NOTE:</strong> The source releases contain the <code>big_endian</code>, <code>espeak-edit</code>,\n<code>praat-mod</code>, <code>riskos</code>, <code>windows_dll</code> and <code>windows_sapi</code> folders. These\ndo not appear in the source repository until later releases, so have\nbeen excluded from the historical commits to align them better with\nthe 1.24.02 source commit.</p>\n<p></p><h2>License Information</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#license-information\"></a><p></p>\n<p>eSpeak NG Text-to-Speech is released under the <a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/COPYING\">GPL version 3</a> or\nlater license.</p>\n<p>The <code>getopt.c</code> compatibility implementation for getopt support on Windows is\ntaken from the NetBSD <code>getopt_long</code> implementation, which is licensed under a\n<a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng/blob/master/COPYING.BSD2\">2-clause BSD</a> license.</p>\n<p>Android is a trademark of Google LLC.</p>\n<p></p><h2>Acknowledgements</h2><a target=\"_blank\" href=\"https://github.com/espeak-ng/espeak-ng#acknowledgements\"></a><p></p>\n<p>The catalan extension was funded by [Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya](<a target=\"_blank\" href=\"https://politiquesdigitals.gencat.cat/ca/inici/index.html#googtrans(ca%7Cen)\">https://politiquesdigitals.gencat.cat/ca/inici/index.html#googtrans(ca|en)</a>\nwithin the framework of\n<a target=\"_blank\" href=\"https://politiquesdigitals.gencat.cat/ca/economia/catalonia-ai/aina\">Projecte AINA</a>.</p>\n</article></div>",
"author": "",
"favicon": "https://github.githubassets.com/favicons/favicon.svg",
"source": "github.com",
"published": "",
"ttr": 177,
"type": "object"
}