Skip to main content

Google (Beta)

Google provides fast, reliable text-to-speech (TTS) with customizable voices.

Sample configuration

The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.


_13
"tts": {
_13
"vendor": "google",
_13
"params": {
_13
"credentials": "<GOOGLE_APPLICATION_CREDENTIALS_STRING>",
_13
"VoiceSelectionParams": {
_13
"name": "en-US-Chirp3-HD-Charon"
_13
},
_13
"AudioConfig": {
_13
"speaking_rate": 1.0,
_13
"sample_rate_hertz": 24000
_13
}
_13
}
_13
}

caution

The parameters listed on this page are validated for use with Conversational AI Engine. To avoid unpredictable behavior, Agora strongly recommends using only the supported parameters. For a complete reference, consult the Google documentation.

Key parameters

paramsrequired
  • credentials stringrequired

    The Google Cloud service account credentials JSON string used for authentication. Get your credentials from the Google Cloud Console.

  • VoiceSelectionParams objectrequired
    Show propertiesHide properties
    • name stringrequired

      The name of the voice to use, for example, en-US-Chirp3-HD-Charon or en-US-Neural2-A. See supported voices for available voice names.

  • AudioConfig objectnullable
    Show propertiesHide properties
    • speaking_rate numbernullable

      The speed of speech. Valid range is 0.25 to 2.0, where 1.0 is the normal speed. Values less than 1.0 slow down the speech, while values greater than 1.0 speed it up.

    • sample_rate_hertz integernullable

      The sample rate in Hertz for the audio output, for example, 24000 or 16000. The default value depends on the selected voice.