Skip to main content

Microsoft Azure

Microsoft Azure provides enterprise-grade automatic speech recognition with support for multiple languages and robust noise handling capabilities, optimized for real-time conversational applications.

Sample configuration

The following example shows a starting asr parameter configuration you can use when you Start a conversational AI agent.


_9
"asr": {
_9
"vendor": "microsoft",
_9
"params": {
_9
"key": "<microsoft_key>",
_9
"region": "eastus",
_9
"language": "en-US",
_9
"phrase_list": ["agora", "conversational", "ai", "engine"]
_9
}
_9
}

Key parameters

paramsrequired
  • key stringrequired

    The API key used for authentication. Get your API key from the Azure Portal.

  • region stringrequired

    The Azure region where the speech service is hosted (For example, eastus, westus2).

  • language stringrequired

    The language code for speech recognition (For example, en-US, es-ES, fr-FR). See supported languages for language codes.

  • phrase_list array[string]nullable

    A list of words or phrases provided in advance to improve recognition accuracy.

For detailed parameter descriptions, see the Microsoft Azure Speech-to-Text documentation.