Microsoft Azure
Microsoft Azure offers neural voices in multiple languages with options for different speaking styles and emotions, providing enterprise-grade text-to-speech capabilities with high-quality audio output.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
Key parameters
paramsrequired
- key stringrequired
The API key used for authentication. Get your API key from the Azure Portal.
- region stringrequired
The Azure region where the speech service is hosted (For example,
eastus,westus2). - voice_name stringrequired
The identifier for the selected voice for speech synthesis. See available voices for options.
- speed numbernullable
Default:
1.0Speaking rate of the text. Values between
0.5and2.0times the original audio speed. - volume numbernullable
Default:
100Audio volume as a number between
0.0and100.0, where0.0is quietest and100.0is loudest. - sample_rate integernullable
Default:
24000Audio sampling rate in Hz. Common values:
16000,24000,48000.
For advanced configuration options, voice galleries, and detailed parameter descriptions, see the Microsoft Azure TTS documentation.