ElevenLabs
ElevenLabs provides highly realistic AI voices with advanced prosody and natural speech patterns, delivering lifelike audio synthesis with emotional nuance and conversational flow.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication. Get your API key from the ElevenLabs Console.
- model_id stringrequired
Identifier of the model to be used. Popular options include
eleven_flash_v2_5for speed oreleven_multilingual_v2for quality. - voice_id stringrequired
The identifier for the selected voice for speech synthesis. Browse available voices in the Voice Library.
- sample_rate numbernullable
Default:
24000Audio sampling rate in Hz. Common values:
16000,22050,24000,44100. - speed numbernullable
Default:
1.0Speed up or slow down the speed of the generated speech. Range
0.7to1.2inclusive. - stability numbernullable
Controls voice stability. Higher values
(0.8-1.0)produce more consistent speech, lower values(0.0-0.5)add more variation. - similarity_boost numbernullable
Enhances similarity to the original voice. Range:
0.0-1.0. Higher values stick closer to the training voice. - style numbernullable
Controls speaking style and expressiveness. Higher values increase emotional range and variation.
- use_speaker_boost booleannullable
Improves voice quality and similarity when enabled. Recommended for most use cases.
For advanced configuration options, voice cloning, and detailed parameter descriptions, see the ElevenLabs TTS documentation.