Skip to main content

Supported Voices and Languages

Below is a list of supported Voice languages and codes from different providers.

Google

Google is one of the leading tech companies in the world and has developed a suite of cloud services tailored for various applications. One of these services is text-to-speech conversion. Google's text-to-speech service provides three types of voices: WaveNet, Nueral2, and Standard.

Supported Voices

You can find a link to the supported voices here

Google Neural2 Voices

Neural2 voices are the latest generation of Google's text-to-speech voices. They are powered by advanced neural network technology and are designed to produce speech that is more natural and closer to human-like pronunciation and intonation. Neural2 voices are based on the same technology used to create Custom Voices. More information on Neural2 voices can be found here.

When using a Neural2 voice, you must specify the voice to use the Neural2 in the voice parameter.

Neural2 Format: "gcloud.<language-code>-Neural2-<variant>"

Neural2 Example: "gcloud.en-US-Neural2-A"

Google WaveNet Voices

These are advanced voices powered by deep learning technology. They offer a more natural and lifelike speech output. However, they come at a premium price compared to the standard voices. More information on WaveNet voices can be found here.

When using a WaveNet voice, you must specify the voice to use the Wavenet in the voice parameter.

WaveNet Format: "gcloud.<language-code>-Wavenet-<variant>"

WaveNet Example: "gcloud.en-GB-Wavenet-A"

Google Standard Voices

These are the traditional text-to-speech voices provided by Google. Suitable for a wide range of applications, they are budget-friendly and serve as a reliable option for many developers and businesses. Less natural sounding than WaveNet and Neural2 voices. More information on Standard voices can be found here.

When using a Standard voice, you must specify the voice to use the Standard in the voice parameter

Standard Format: "gcloud.<language-code>-Standard-<variant>"

Standard Example: "gcloud.en-US-Standard-A"

Supported Languages

You can find a link to the supported languages here

SSML

SSML may be entered as a string wrapped in <speak> tags.

You can find more information on Google Cloud SSML here

Amazon

Amazon, a global tech giant, offers a text-to-speech service through its cloud platform, AWS, known as Polly. Polly is designed to turn text into lifelike speech, allowing applications to speak out loud and developers to create applications that talk.

Supported Voices

You can find the list of supported Neural voices here

You can find the list of supported Standard voices here

Polly Neural Voices

Powered by advanced neural network technology, these voices are designed to produce speech that is more natural and closer to human-like pronunciation and intonation. Due to the sophistication of the technology, they come at a premium price.

When using a Neural voice, you must specify the voice to use the Neural in the voice parameter.

Neural Format: "polly.<language-code>-<voice-name>-Neural"

Neural Example: "polly.en-GB-Emma-Neural"

Polly Standard Voices

These are the traditional text-to-speech voices provided by Polly. Suitable for a wide range of applications, they are budget-friendly and serve as a reliable option for many developers and businesses. Less natural sounding than Polly Neural voices.

Standard Format: "polly.<language-code>-<voice-name>"

Standard Example: "polly.en-GB-Emma"

Supported Languages

You can find a link to the supported languages here

SSML

SSML may be entered as a string wrapped in <speak> tags.

You can find more information on Amazon's Polly SSML here

ElevenLabs

Supported Voices

Currently, we support the voices from the Multilingual v2 model from ElevenLabs.


ElevenLabs ModelSupported Voice
Multilingual v2rachel, clyde, domi, dave, fin, antoni, thomas, charlie, emily, elli, callum, patrick, harry, liam, dorothy, josh, arnold, charlotte, matilda, matthew, james, joseph, jeremy, michael, ethan, gigi, freya, grace, daniel, serena, adam, nicole, jessie, ryan, sam, glinda, giovanni, mimi

ElevenLabs Format: "elevenlabs.<voice-name>"

ElevenLabs Example: "elevenlabs.rachel"

Supported Languages

We currently support the Multilingual v2 model from ElevenLabs.

The Multilingual model is capable of supporting multiple different languages.

Below is the list of supported languages in ElevenLabs.

ElevenLabs ModelSupported Languages
Multilingual v2US - [English - USA]
GB - [English - UK]
AU - [English - Australia]
CA - [English - Canada]
JP - [Japanese]
CN - [Chinese]
DE - [German]
IN - [Hindi]
FR - [French - France]
CA - [French - Canada]
KR - [Korean]
BR - [Portuguese - Brazil]
PT - [Portuguese - Portugal]
IT - [Italian]
ES - [Spanish - Spain]
MX - [Spanish - Mexico]
ID - [Indonesian]
NL - [Dutch]
TR - [Turkish]
PH - [Filipino]
PL - [Polish]
SE - [Swedish]
BG - [Bulgarian]
RO - [Romanian]
SA - [Arabic - Saudi Arabia]
AE - [Arabic - UAE]
CZ - [Czech]
GR - [Greek]
FI - [Finnish]
HR - [Croatian]
MY - [Malay]
SK - [Slovak]
DK - [Danish]
IN - [Tamil]
UA - [Ukrainian]

SSML

ElevenLabs does not currently support the usage of SSML, therefore, it cannot be used when utilizing an ElevenLabs Voice model.