Skip to main content

ElevenLabs

ElevenLabs provides highly realistic AI voices with advanced prosody and natural speech patterns, delivering lifelike audio synthesis with emotional nuance and conversational flow.

Sample configuration

The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.


_9
"tts": {
_9
"vendor": "elevenlabs",
_9
"params": {
_9
"key": "<your_elevenlabs_key>",
_9
"model_id": "eleven_flash_v2_5",
_9
"voice_id": "pNInz6obpgDQGcFmaJgB",
_9
"sample_rate": 24000
_9
}
_9
}

Key parameters

paramsrequired
  • key stringrequired

    The API key used for authentication. Get your API key from the ElevenLabs Console.

  • model_id stringrequired

    Identifier of the model to be used. Popular options include eleven_flash_v2_5 for speed or eleven_multilingual_v2 for quality.

  • voice_id stringrequired

    The identifier for the selected voice for speech synthesis. Browse available voices in the Voice Library.

  • sample_rate numbernullable

    Default: 24000

    Audio sampling rate in Hz. Common values: 16000, 22050, 24000, 44100.

  • speed numbernullable

    Default: 1.0

    Speed up or slow down the speed of the generated speech. Range 0.7 to 1.2 inclusive.

  • stability numbernullable

    Controls voice stability. Higher values (0.8-1.0) produce more consistent speech, lower values (0.0-0.5) add more variation.

  • similarity_boost numbernullable

    Enhances similarity to the original voice. Range: 0.0-1.0. Higher values stick closer to the training voice.

  • style numbernullable

    Controls speaking style and expressiveness. Higher values increase emotional range and variation.

  • use_speaker_boost booleannullable

    Improves voice quality and similarity when enabled. Recommended for most use cases.

For advanced configuration options, voice cloning, and detailed parameter descriptions, see the ElevenLabs TTS documentation.