Hume AI (Beta)

Hume AI provides emotion-aware text-to-speech technology that generates natural, expressive voices with emotional intelligence. The platform offers both curated voices from the Voice Library and support for custom voice creation, enabling personalized audio experiences.

Sample configuration

The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.

"tts": {
    "vendor": "humeai",
    "params": {
        "key": "<API_KEY>",
        "voice_id": "",
        "provider": "HUME_AI",
        "speed": 1,
        "trailing_silence": 0.35
    }
}

caution

The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For a full list of supported options, refer to the Hume AI TTS documentation.

Key parameters

paramsrequired

key stringrequired

The API key used for authentication with Hume AI's services. Get your API key from the Hume AI console.

voice_id stringrequired

The identifier for the selected voice for speech synthesis. Choose from available voices in your Hume AI dashboard.

provider stringrequired

Default: CUSTOM_VOICE

Possible values: HUME_AI, CUSTOM_VOICE

The voice provider type.

"HUME_AI": Use a pre-built voice from Hume's curated Voice Library.
"CUSTOM_VOICE" Use your own custom-trained voice.

speed numbernullable

Default: 1

Possible values: 0.25 to 3.0

Controls the playback speed of the generated speech. Higher values increase speech rate.

trailing_silence numbernullable

Default: 0.35

Possible values: 0 to 5

Duration of silence (in seconds) to add at the end of each utterance. Useful for natural conversation pacing.

Sample configuration​

Key parameters​

Sample configuration

Key parameters