Hume AI (Beta)
Hume AI provides emotion-aware text-to-speech technology that generates natural, expressive voices with emotional intelligence. The platform offers both curated voices from the Voice Library and support for custom voice creation, enabling personalized audio experiences.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication with Hume AI's services. Get your API key from the Hume AI console.
- voice_id stringrequired
The identifier for the selected voice for speech synthesis. Choose from available voices in your Hume AI dashboard.
- provider stringrequired
Default:
CUSTOM_VOICEPossible values:
HUME_AI,CUSTOM_VOICEThe voice provider type.
"HUME_AI": Use a pre-built voice from Hume's curated Voice Library."CUSTOM_VOICE"Use your own custom-trained voice.
- speed numbernullable
Default:
1Possible values:
0.25 to 3.0Controls the playback speed of the generated speech. Higher values increase speech rate.
- trailing_silence numbernullable
Default:
0.35Possible values:
0 to 5Duration of silence (in seconds) to add at the end of each utterance. Useful for natural conversation pacing.
For advanced configuration options, available voices, and detailed parameter descriptions, see the Hume AI TTS documentation.