Language And Speech
Learn how to configure languages, voices, custom pronunciations, and word recognition for your AI character in Convai Playground.
Introduction
The Language and Speech section allows you to define the spoken languages, select a voice, and improve pronunciation and recognition for your AI character. With support for multiple languages and voice providers, you can ensure that your character communicates naturally and effectively with your audience.

Main Features
1. Set Language
Choose the languages your character can speak and understand.
Supports multilingual select between 1 and 4 languages.
Default language: English.
Over 65+ languages are available.
Selecting a language will filter the available voices in the Voice section.

2. Voice Selection
The Voice field provides access to over 1,200 voices in total. When you select a language, the available voices are filtered accordingly, so the number of voices varies by language.
Supported voice providers:
Google Cloud Platform (GCP)
Microsoft Azure
OpenAI
ElevenLabs
Custom Voices can be added through ElevenLabs.
See: ElevenLabs Voice Integration Documentation for setup details.

3. Add Custom Pronunciation
Custom pronunciations help your character pronounce specific words correctly, especially unusual or brand-specific terms.
To add:
Spelled As – The word as it appears in text.
Pronounced As – How it should sound, written phonetically in plain English.
Example:
Spelled As: convai
Pronounced As: convey
Case-sensitive: Uppercase and lowercase entries can have different pronunciations.
Currently only supports English.

4. New Word Recognition
New Word Recognition improves your character’s ability to understand unique or challenging words in speech input.
To add:
Spelled As – The correct spelling of the word.
Pronounced As – The phonetic pronunciation using simple syllables.
Example:
Spelled As: Ankur
Pronounced As: Ahnkur
Currently only supports English.

Conclusion
The Language and Speech settings provide complete control over how your character communicates, from language selection and voice choice to fine-tuning pronunciation and recognition. These tools help ensure your AI character delivers clear, accurate, and engaging interactions for users.
Last updated
Was this helpful?