Skip to main content

Cartesia (Beta)

Cartesia provides ultra-fast, low-latency text-to-speech with real-time streaming capabilities, optimized for interactive conversational AI applications.

Sample configuration

The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.


_11
"tts": {
_11
"vendor": "cartesia",
_11
"params": {
_11
"api_key": "<your_cartesia_key>",
_11
"model_id": "sonic-2",
_11
"voice": {
_11
"mode": "id",
_11
"id": "<voice_id>"
_11
}
_11
}
_11
}

Key parameters

paramsrequired
  • api_key stringrequired

    The API key used for authentication. Get your API key from the Cartesia Console.

  • model_id stringrequired

    Identifier of the model to be used.

  • voice objectrequired

    Voice configuration object.

    Show propertiesHide properties
    • mode stringrequired

      Voice selection mode. Use id to select by voice identifier.

    • id stringrequired

      The identifier of the selected voice for speech synthesis.

For advanced configuration options, voice customization, and detailed parameter descriptions, see the Cartesia TTS documentation.