Hume AI (Beta)
Hume AI provides emotion-aware text-to-speech technology that generates natural, expressive voices with emotional intelligence. The platform offers both curated voices from the Voice Library and support for custom voice creation, enabling personalized audio experiences.
Sample configuration
The following example shows a starting tts
parameter configuration you can use when you Start a conversational AI agent.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication with Hume AI's services. Get your API key from the Hume AI console.
- voice_id stringrequired
The identifier for the selected voice for speech synthesis. Choose from available voices in your Hume AI dashboard.
- provider stringrequired
Default:
CUSTOM_VOICE
Possible values:
HUME_AI
,CUSTOM_VOICE
The voice provider type.
"HUME_AI"
: Use a pre-built voice from Hume's curated Voice Library."CUSTOM_VOICE"
Use your own custom-trained voice.
- speed numbernullable
Default:
1
Possible values:
0.25 to 3.0
Controls the playback speed of the generated speech. Higher values increase speech rate.
- trailing_silence numbernullable
Default:
0.35
Possible values:
0 to 5
Duration of silence (in seconds) to add at the end of each utterance. Useful for natural conversation pacing.
For advanced configuration options, available voices, and detailed parameter descriptions, see the Hume AI TTS documentation.