Microsoft Azure
Microsoft Azure offers neural voices in multiple languages with options for different speaking styles and emotions, providing enterprise-grade text-to-speech capabilities with high-quality audio output.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For advanced configuration options, voice galleries, and detailed parameter descriptions, see the Microsoft Azure TTS documentation.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication. Get your API key from the Azure Portal.
- region stringrequired
The Azure region where the speech service is hosted (For example,
eastus,westus2). - voice_name stringrequired
The identifier for the selected voice for speech synthesis. See available voices for options.
- speed numbernullable
Default:
1.0Speaking rate of the text. Values between
0.5and2.0times the original audio speed. - volume numbernullable
Default:
100Audio volume as a number between
0.0and100.0, where0.0is quietest and100.0is loudest. - sample_rate integernullable
Default:
24000Audio sampling rate in Hz. Common values:
16000,24000,48000.