Skip to main content

Microsoft Azure

Microsoft Azure provides enterprise-grade automatic speech recognition with support for multiple languages and robust noise handling capabilities, optimized for real-time conversational applications.

Sample configuration

The following example shows a starting asr parameter configuration you can use when you Start a conversational AI agent.


_9
"asr": {
_9
"vendor": "microsoft",
_9
"params": {
_9
"key": "<microsoft_key>",
_9
"region": "eastus",
_9
"language": "en-US",
_9
"phrase_list": ["agora", "conversational", "ai", "engine"]
_9
}
_9
}

caution

The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For a full list of supported options, refer to the Microsoft Azure Speech-to-Text documentation.

Key parameters

paramsrequired
  • key stringrequired

    The API key used for authentication. Get your API key from the Azure Portal.

  • region stringrequired

    The Azure region where the speech service is hosted (For example, eastus, westus2).

  • language stringrequired

    The language code for speech recognition (For example, en-US, es-ES, fr-FR). See supported languages for language codes.

  • phrase_list array[string]nullable

    A list of words or phrases provided in advance to improve recognition accuracy.