Skip to main content

Supported Voices and Languages

SignalWire's cloud platform integrates with leading third-party text-to-speech (TTS) providers. This guide describes supported engines, voices, and languages. Refer to each provider's documentation for up-to-date model details and service information.

Compare providers and models

SignalWire's TTS providers offer a wide range of voice engines optimized for various applications. Select a provider, model, and voice according to the following considerations:

Cost: When cost-efficiency is the top priority, select a Standard-tier voice from Google Cloud or Amazon Polly. Review our pricing information to learn more.

Language support: Amazon Polly, ElevenLabs, Google Cloud, and OpenAI offer a wide range of supported languages. In addition, all ElevenLabs and OpenAI voices are fully multilingual.

Model quality and realism: All four supported providers offer high-quality engines: Google Cloud's WaveNet and Neural2, Amazon Polly Neural, ElevenLabs' Multilingual v2, and Deepgram's Aura are all optimized for voice quality.

SSML support: Google Cloud and Amazon Polly support SSML (Speech Synthesis Markup Language) as a string wrapped in <speak> tags. Consult Google Cloud's SSML docs for details. Refer to the Amazon Polly docs for more information on using SSML and supported SSML tags.

Use voice identifier strings

Compose voice identifier strings using the <engine>.<voice id> format.

First, select your engine using the gcloud, polly, elevenlabs, or deepgram identifier. Append a period (.), followed by the specific voice ID from the TTS provider.

Case insensitivity

Voice identifier strings are case insensitive. For example, gcloud.en-US-Neural2-A, gcloud.en-us-neural2-a, and GCLOUD.EN-US-NEURAL2-A are equivalent.

For detailed instructions for each provider, consult the voice ID references linked in the Usage column of the below table.

TTS providerEngine codeSample voice ID stringUsage
Amazon Pollypollypolly.Joanna-NeuralReference
Cartesiacartesiacartesia.a167e0f3-df7e-4d52-a9c3-f949145efdabReference
Deepgramdeepgramdeepgram.aura-asteria-enReference
ElevenLabselevenlabselevenlabs.thomasReference
Google Cloudgcloudgcloud.en-US-Casual-KReference
OpenAIopenaiopenai.alloyReference


TTS providers

Amazon Polly

Amazon Web Services' Polly TTS engine includes several models to accommodate different use cases. SignalWire supports the Standard, Neural, and Generative models:

  • Standard is a traditional, cost-effective, and reliable TTS model. It is less natural-sounding but more budget-friendly than Polly Neural. Example voice identifier string: polly.Emma
  • Neural is an advanced model designed to produce speech that is more natural and closer to human-like pronunciation and intonation. Example voice identifier string: polly.Emma-Neural

Set language for Amazon Polly voices

Most Amazon Polly voices support a single language. Select a language by choosing from the list of supported voices.

All Amazon Polly voices support accented bilingual pronunciation through the use of the SSML lang tag.

Amazon Polly also offers some fully bilingual voices designed to fluently speak two languages.

Amazon Polly voice IDs

Polly voices are identified by the voice name (like Amy, Matthew, Mia, Zhiyu, etc ) only, except when the voice exists in multiple models. In that case, append a code after a dash - to specify variations of the model, like neural or generative. If no model code is specified, the Standard model will be used.

Example stringModel used
polly.AmyStandard
polly.Amy-NeuralNeural
polly.Amy-GenerativeGenerative
Use Amazon Polly voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: polly.Ruth-Neural

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "polly.Ruth-Neural"
- play: "say:Greetings. This is the Ruth voice from Amazon Polly's Neural text-to-speech model."

Cartesia

Cartesia offers a wide selection of fully multilingual voices with very low latency.

Consult Cartesia's Text-to-Speech documentation for more information and audio samples for available voices. Create a Cartesia Account to browse and test voices in the Cartesia Playground.

Cartesia voice IDs

Copy the voice ID from the below table:

Voice nameVoice ID
German Conversational Woman
3f4ade23-6eb4-4279-ab05-6a144947c4d5
Nonfiction Man
79f8b5fb-2cc8-479a-80df-29f7a7cf1a3e
Friendly Sidekick
e00d0e4c-a5c8-443f-a8a3-473eb9a62355
French Conversational Lady
a249eaff-1e96-4d2c-b23b-12efa4f66f41
French Narrator Lady
8832a0b5-47b2-4751-bb22-6a8e2149303d
German Reporter Woman
119e03e4-0705-43c9-b3ac-a658ce2b6639
Indian Lady
3b554273-4299-48b9-9aaf-eefd438e3941
British Reading Lady
71a7ad14-091c-4e8e-a314-022ece01c121
British Narration Lady
4d2fd738-3b3d-4368-957a-bb4805275bd9
Japanese Children Book
44863732-e415-4084-8ba1-deabe34ce3d2
Japanese Woman Conversational
2b568345-1d48-4047-b25f-7baccf842eb0
Japanese Male Conversational
e8a863c6-22c7-4671-86ca-91cacffc038d
Reading Lady
15a9cd88-84b0-4a8b-95f2-5d583b54c72e
Newsman
d46abd1d-2d02-43e8-819f-51fb652c1c61
Child
2ee87190-8f84-4925-97da-e52547f9462c
Meditation Lady
cd17ff2d-5ea4-4695-be8f-42193949b946
Maria
5345cf08-6f37-424d-a5d9-8ae1101b9377
1920's Radioman
41534e16-2966-4c6b-9670-111411def906
Newslady
bf991597-6c13-47e4-8411-91ec2de5c466
Calm Lady
00a77add-48d5-4ef6-8157-71e5437b282d
Helpful Woman
156fb8d2-335b-4950-9cb3-a2d33befec77
Mexican Woman
5c5ad5e7-1020-476b-8b91-fdcbe9cc313c
California Girl
b7d50908-b17c-442d-ad8d-810c63997ed9
Korean Narrator Woman
663afeec-d082-4ab5-827e-2e41bf73a25b
Russian Calm Lady
779673f3-895f-4935-b6b5-b031dc78b319
Russian Narrator Man 1
2b3bb17d-26b9-421f-b8ca-1dd92332279f
Russian Narrator Man 2
da05e96d-ca10-4220-9042-d8acef654fa9
Russian Narrator Woman
642014de-c0e3-4133-adc0-36b5309c23e6
Hinglish Speaking Lady
95d51f79-c397-46f9-b49a-23763d3eaa2d
Italian Narrator Woman
0e21713a-5e9a-428a-bed4-90d410b87f13
Polish Narrator Woman
575a5d29-1fdc-4d4e-9afa-5a9a71759864
Chinese Female Conversational
e90c6678-f0d3-4767-9883-5d0ecf5894a8
Pilot over Intercom
36b42fcb-60c5-4bec-b077-cb1a00a92ec6
Chinese Commercial Man
eda5bbff-1ff1-4886-8ef1-4e69a77640a0
French Narrator Man
5c3c89e5-535f-43ef-b14d-f8ffe148c1f0
Spanish Narrator Man
a67e0421-22e0-4d5b-b586-bd4a64aee41d
Reading Man
f146dcec-e481-45be-8ad2-96e1e40e7f32
New York Man
34575e71-908f-4ab6-ab54-b08c95d6597d
Friendly French Man
ab7c61f5-3daa-47dd-a23b-4ac0aac5f5c3
Barbershop Man
a0e99841-438c-4a64-b679-ae501e7d6091
Indian Man
638efaaa-4d0c-442e-b701-3fae16aad012
Australian Customer Support Man
41f3c367-e0a8-4a85-89e0-c27bae9c9b6d
Friendly Australian Man
421b3369-f63f-4b03-8980-37a44df1d4e8
Wise Man
b043dea0-a007-4bbe-a708-769dc0d0c569
Friendly Reading Man
69267136-1bdc-412f-ad78-0caad210fb40
Customer Support Man
a167e0f3-df7e-4d52-a9c3-f949145efdab
Dutch Confident Man
9e8db62d-056f-47f3-b3b6-1b05767f9176
Dutch Man
4aa74047-d005-4463-ba2e-a0d9b261fb87
Hindi Reporter Man
bdab08ad-4137-4548-b9db-6142854c7525
Italian Calm Man
408daed0-c597-4c27-aae8-fa0497d644bf
Italian Narrator Man
029c3c7a-b6d9-44f0-814b-200d849830ff
Swedish Narrator Man
38a146c3-69d7-40ad-aada-76d5a2621758
Polish Confident Man
3d335974-4c4a-400a-84dc-ebf4b73aada6
Spanish-speaking Storyteller Man
846fa30b-6e1a-49b9-b7df-6be47092a09a
Kentucky Woman
4f8651b0-bbbd-46ac-8b37-5168c5923303
Chinese Commercial Woman
0b904166-a29f-4d2e-bb20-41ca302f98e9
Middle Eastern Woman
daf747c6-6bc2-4083-bd59-aa94dce23f5d
Hindi Narrator Woman
c1abd502-9231-4558-a054-10ac950c356d
Sarah
694f9389-aac1-45b6-b726-9d9369183238
Sarah Curious
794f9389-aac1-45b6-b726-9d9369183238
Laidback Woman
21b81c14-f85b-436d-aff5-43f2e788ecf8
Reflective Woman
a3520a8f-226a-428d-9fcd-b0a4711a6829
Helpful French Lady
65b25c5d-ff07-4687-a04c-da2f43ef6fa9
Pleasant Brazilian Lady
700d1ee3-a641-4018-ba6e-899dcadc9e2b
Customer Support Lady
829ccd10-f8b3-43cd-b8a0-4aeaa81f3b30
British Lady
79a125e8-cd45-4c13-8a67-188112f4dd22
Wise Lady
c8605446-247c-4d39-acd4-8f4c28aa363c
Australian Narrator Lady
8985388c-1332-4ce7-8d55-789628aa3df4
Indian Customer Support Lady
ff1bb1a9-c582-4570-9670-5f46169d0fc8
Swedish Calm Lady
f852eb8d-a177-48cd-bf63-7e4dcab61a36
Spanish Narrator Lady
2deb3edf-b9d8-4d06-8db9-5742fb8a3cb2
Salesman
820a3788-2b37-4d21-847a-b65d8a68c99a
Yogaman
f114a467-c40a-4db8-964d-aaba89cd08fa
Movieman
c45bc5ec-dc68-4feb-8829-6e6b2748095d
Wizardman
87748186-23bb-4158-a1eb-332911b0b708
Australian Woman
043cfc81-d69f-4bee-ae1e-7862cb358650
Korean Calm Woman
29e5f8b4-b953-4160-848f-40fae182235b
Friendly German Man
fb9fcab6-aba5-49ec-8d7e-3f1100296dde
Announcer Man
5619d38c-cf51-4d8e-9575-48f61a280413
Wise Guide Man
42b39f37-515f-4eee-8546-73e841679c1d
Midwestern Man
565510e8-6b45-45de-8758-13588fbaec73
Kentucky Man
726d5ae5-055f-4c3d-8355-d9677de68937
Brazilian Young Man
5063f45b-d9e0-4095-b056-8f3ee055d411
Chinese Call Center Man
3a63e2d1-1c1e-425d-8e79-5100bc910e90
German Reporter Man
3f6e78a8-5283-42aa-b5e7-af82e8bb310c
Confident British Man
63ff761f-c1e8-414b-b969-d1833d1c870c
Southern Man
98a34ef2-2140-4c28-9c71-663dc4dd7022
Classy British Man
95856005-0332-41b0-935f-352e296aa0df
Polite Man
ee7ea9f8-c0c1-498c-9279-764d6b56d189
Mexican Man
15d0c2e2-8d29-44c3-be23-d585d5f154a1
Korean Narrator Man
57dba6ff-fe3b-479d-836e-06f5a61cb5de
Turkish Narrator Man
5a31e4fb-f823-4359-aa91-82c0ae9a991c
Turkish Calm Man
39f753ef-b0eb-41cd-aa53-2f3c284f948f
Hindi Calm Man
ac7ee4fa-25db-420d-bfff-f590d740aeb2
Hindi Narrator Man
7f423809-0011-4658-ba48-a411f5e516ba
Polish Narrator Man
4ef93bb3-682a-46e6-b881-8e157b6b4388
Polish Young Man
82a7fc13-2927-4e42-9b8a-bb1f9e506521
Alabama Male
40104aff-a015-4da1-9912-af950fbec99e
Australian Male
13524ffb-a918-499a-ae97-c98c7c4408c4
Anime Girl
1001d611-b1a8-46bd-a5ca-551b23505334
Japanese Man Book
97e7d7a9-dfaa-4758-a936-f5f844ac34cc
Sweet Lady
e3827ec5-697a-4b7c-9704-1a23041bbc51
Commercial Lady
c2ac25f9-ecc4-4f56-9095-651354df60c0
Teacher Lady
573e3144-a684-4e72-ac2b-9b2063a50b53
Princess
8f091740-3df1-4795-8bd9-dc62d88e5131
Commercial Man
7360f116-6306-4e9a-b487-1235f35a0f21
ASMR Lady
03496517-369a-4db1-8236-3d3ae459ddf7
Professional Woman
248be419-c632-4f23-adf1-5324ed7dbf1d
Tutorial Man
bd9120b6-7761-47a6-a446-77ca49132781
Calm French Woman
a8a1eb38-5f15-4c1d-8722-7ac0f329727d
New York Woman
34bde396-9fde-4ebf-ad03-e3a1d1155205
Spanish-speaking Lady
846d6cb0-2301-48b6-9683-48f5618ea2f6
Midwestern Woman
11af83e2-23eb-452f-956e-7fee218ccb5c
Sportsman
ed81fd13-2016-4a49-8fe3-c0d2761695fc
Storyteller Lady
996a8b96-4804-46f0-8e05-3fd4ef1a87cd
Spanish-speaking Man
34dbb662-8e98-413c-a1ef-1a3407675fe7
Doctor Mischief
fb26447f-308b-471e-8b00-8e9f04284eb5
Spanish-speaking Reporter Man
2695b6b5-5543-4be1-96d9-3967fb5e7fec
Young Spanish-speaking Woman
db832ebd-3cb6-42e7-9d47-912b425adbaa
The Merchant
50d6beb4-80ea-4802-8387-6c948fe84208
Stern French Man
0418348a-0ca2-4e90-9986-800fb8b3bbc0
Madame Mischief
e13cae5c-ec59-4f71-b0a6-266df3c9bb8e
German Storyteller Man
db229dfe-f5de-4be4-91fd-7b077c158578
Female Nurse
5c42302c-194b-4d0c-ba1a-8cb485c84ab9
German Conversation Man
384b625b-da5d-49e8-a76d-a2855d4f31eb
Friendly Brazilian Man
6a16c1f4-462b-44de-998d-ccdaa4125a0a
German Woman
b9de4a89-2257-424b-94c2-db18ba68c81a
Southern Woman
f9836c6e-a0bd-460e-9d3c-f7299fa60f94
British Customer Support Lady
a01c369f-6d2d-4185-bc20-b32c225eab70
Chinese Woman Narrator
d4d4b115-57a0-48ea-9a1a-9898966c2966


Prepend cartesia. and the string is ready for use. For example: cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab

Use Cartesia voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "cartesia.a167e0f3-df7e-4d52-a9c3-f949145efdab"
- play: "say:Greetings. This is the Customer Support Man voice from Cartesia's Sonic text-to-speech model."

Deepgram

Deepgram offers a range of English-speaking voices for its text-to-speech API, each designed to produce natural-sounding speech output in an array of different accents and speaking styles.

Deepgram's voices are promised to have human-like tones, rhythm, and emotion, lower than 250 ms latency, and are optimized for high-throughput applications.

Consult Deepgram's TTS models guide for more information and samples for supported voices.

Deepgram voice IDs

Copy the voice ID from the Values column of Deepgram's Voice Selection reference. Prepend deepgram. and the string is ready for use. For example: deepgram.aura-athena-en

Use Deepgram voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: deepgram.aura-asteria-en

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "deepgram.aura-asteria-en"
- play: "say:Greetings. This is the Asteria voice from Deepgram's Aura text-to-speech model."

ElevenLabs

ElevenLabs voices offer expressive, human-like pronunciation and an extensive list of supported languages. SignalWire supports the following voices in the Multilingual v2 model:

VoicesLanguages
rachel, clyde, domi, dave, fin, antoni, thomas, charlie, emily, elli, callum, patrick, harry, liam, dorothy, josh, arnold, charlotte, matilda, matthew, james, joseph, jeremy, michael, ethan, gigi, freya, grace, daniel, serena, adam, nicole, jessie, ryan, sam, glinda, giovanni, mimi🇺🇸 English (USA), 🇬🇧 English (UK), 🇦🇺 English (Australia), 🇨🇦 English (Canada), 🇯🇵 Japanese, 🇨🇳 Chinese, 🇩🇪 German, 🇮🇳 Hindi, 🇫🇷 French (France), 🇨🇦 French (Canada), 🇰🇷 Korean, 🇧🇷 Portuguese (Brazil), 🇵🇹 Portuguese (Portugal), 🇮🇹 Italian, 🇪🇸 Spanish (Spain), 🇲🇽 Spanish (Mexico), 🇮🇩 Indonesian, 🇳🇱 Dutch, 🇹🇷 Turkish, 🇵🇭 Filipino, 🇵🇱 Polish, 🇸🇪 Swedish, 🇧🇬 Bulgarian, 🇷🇴 Romanian, 🇸🇦 Arabic (Saudi Arabia), 🇦🇪 Arabic (UAE), 🇨🇿 Czech, 🇬🇷 Greek, 🇫🇮 Finnish, 🇭🇷 Croatian, 🇲🇾 Malay, 🇸🇰 Slovak, 🇩🇰 Danish, 🇮🇳 Tamil, 🇺🇦 Ukrainian, 🇷🇺 Russian

Language selection with ElevenLabs voices

Multilingual v2 voices are designed to be interchangeably compatible with all supported languages. Rather than enforcing language selection with language code, this TTS model automatically uses the appropriate language of the input text.

Consult ElevenLabs' supported languages resource for an up-to-date list of supported languages.

ElevenLabs voice IDs

Copy the voice ID from our list of supported ElevenLabs voices. Prepend elevenlabs. and the string is ready for use. For example: elevenlabs.sam

Use ElevenLabs voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: elevenlabs.rachel

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "elevenlabs.rachel"
- play: "say:Greetings. This is the Rachel voice, speaking in English, from ElevenLabs' Multilingual v2 text-to-speech model."

Google Cloud

Google Cloud offers a number of robust text-to-speech voice models. SignalWire supports all Google Cloud voices in both General Availability and Preview launch stages, except for the Studio model.

  • Standard is a basic, reliable, and budget-friendly text-to-speech model. The Standard model is less natural-sounding than WaveNet and Neural2, but more cost-effective.
  • WaveNet is powered by deep learning technology and offers more natural and lifelike speech output.
  • Neural2 is based on the same technology used to create Custom Voices and prioritizes natural and human-like pronunciation and intonation.
  • Polyglot voices have variants in multiple languages. For example, at time of writing, the polyglot-1 voice has variants for English (Australia), English (US), French, German, Spanish (Spain), and Spanish (US).

Set language for Google Cloud voices

Sample all available voices with Google's supported voices and languages reference. Copy the voice identifier string in whole from the Voice name column.

Unlike the other supported engines, Google Cloud voice identifier strings include both voice and language keys, following the pattern <language>-<model>-<variant>. For example:

  • English (UK) WaveNet female voice: en-GB-Wavenet-A
  • Spanish (Spain) Neural2 male voice: es-ES-Neural2-B
  • Mandarin Chinese Standard female voice: cmn-CN-Standard-D

Google Cloud voice IDs

Copy the voice ID in whole from the Voice name column of Google's table of supported voices. Google Cloud voice IDs encode language and model information, so no modification is needed to make these selections. Prepend gcloud. and the string is ready for use. For example: gcloud.en-GB-Wavenet-A

Use Google Cloud voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: gcloud.en-US-Neural2-A

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "gcloud.en-US-Neural2-A"
- play: "say:Greetings. This is the 2-A US English voice from Google Cloud's Neural2 text-to-speech model."

OpenAI

OpenAI offers versatile multilingual voices balancing low latency and good quality. While voices are optimized for English, they perform well across all supported languages.

Consult OpenAI's Text-to-Speech documentation for more information and audio samples for available voices.

OpenAI voice IDs

Copy the voice ID from OpenAI's Voice Options reference.

Prepend openai. and the string is ready for use. For example: openai.alloy

Use OpenAI voices on the SignalWire platform

Use the languages SWML method to set one or more voices for an AI agent.

version: 1.0.0
sections:
main:
- ai:
prompt:
text: Have an open-ended conversation about flowers.
languages:
- name: English
code: en-US
voice: openai.alloy

Alternatively, use the say_voice parameter of the play SWML method to select a voice for basic TTS.

version: 1.0.0
sections:
main:
- set:
say_voice: "openai.alloy"
- play: "say:Greetings. This is the Alloy voice from OpenAI's text-to-speech model."

Pricing

Voices are priced according to model in three tiers. Consult our Voice API Pricing for up-to-date pricing information.

Standard

  • Google Cloud Standard
  • Amazon Polly Standard

Premium

  • Google Cloud Neural2, WaveNet, and Journey
  • Amazon Polly Neural and Generative
  • Deepgram Aura

ElevenLabs voices have their own tier.