Skip to main content

Google (Beta)

Google provides advanced automatic speech recognition with high accuracy and support for multiple languages, designed for real-time conversational AI applications.

Sample configuration

The following example shows a starting asr parameter configuration you can use when you Start a conversational AI agent.


_10
"asr": {
_10
"vendor": "google",
_10
"params": {
_10
"project_id": "<GOOGLE_ASR_PROJECT_ID>",
_10
"location": "global",
_10
"adc_credentials_string": "<GOOGLE_APPLICATION_CREDENTIALS_STRING>",
_10
"language": "en-US",
_10
"model": "long"
_10
}
_10
}

caution

The parameters listed on this page are validated for use with Conversational AI Engine. To avoid unpredictable behavior, Agora strongly recommends using only the supported parameters. For a complete reference, consult the Google official documentation.

Key parameters

paramsrequired
  • project_id stringrequired

    The Google Cloud project ID where the Speech-to-Text API is enabled. Get your project ID from the Google Cloud Console.

  • location stringrequired

    The Google Cloud region where the speech service is hosted, for example, global, us-central1, or europe-west1.

  • adc_credentials_string stringrequired

    The Google Cloud service account credentials JSON string used for authentication. Get your credentials from the Google Cloud Console.

  • language stringrequired

    The language code for speech recognition, for example, en-US, es-ES, or fr-FR. See supported languages for available language codes.

  • model stringnullable

    The recognition model to use.