ElevenLabs
ElevenLabs provides highly realistic AI voices with advanced prosody and natural speech patterns, delivering lifelike audio synthesis with emotional nuance and conversational flow.
Sample configuration
The following example shows a starting tts
parameter configuration you can use when you Start a conversational AI agent.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication. Get your API key from the ElevenLabs Console.
- model_id stringrequired
Identifier of the model to be used. Popular options include
eleven_flash_v2_5
for speed oreleven_multilingual_v2
for quality. - voice_id stringrequired
The identifier for the selected voice for speech synthesis. Browse available voices in the Voice Library.
- sample_rate numbernullable
Default:
24000
Audio sampling rate in Hz. Common values:
16000
,22050
,24000
,44100
. - speed numbernullable
Default:
1.0
Speed up or slow down the speed of the generated speech. Range
0.7
to1.2
inclusive. - stability numbernullable
Controls voice stability. Higher values
(0.8-1.0)
produce more consistent speech, lower values(0.0-0.5)
add more variation. - similarity_boost numbernullable
Enhances similarity to the original voice. Range:
0.0-1.0
. Higher values stick closer to the training voice. - style numbernullable
Controls speaking style and expressiveness. Higher values increase emotional range and variation.
- use_speaker_boost booleannullable
Improves voice quality and similarity when enabled. Recommended for most use cases.
For advanced configuration options, voice cloning, and detailed parameter descriptions, see the ElevenLabs TTS documentation.