Sarvam (Beta)
Sarvam provides fast, reliable text-to-speech (TTS) with support for Indian languages and customizable voices.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
The parameters listed on this page are validated for use with Conversational AI Engine. To avoid unpredictable behavior, Agora strongly recommends using only the supported parameters. For a complete reference, consult the Sarvam documentation.
Key parameters
paramsrequired
- api_subscription_key stringrequired
The API key used for authentication. Get your API key from the Sarvam dashboard.
- speaker stringrequired
The voice ID to use for speech generation.
- Female voices:
anushka,manisha,vidya,arya. - Male voices:
abhilash,karun,hitesh.
- Female voices:
- target_language_code stringrequired
The language code for speech synthesis. Supported values:
en-INEnglish (India)hi-IN: Hindibn-IN: Bengalita-IN: Tamilte-IN: Telugukn-IN: Kannadaml-IN: Malayalammr-IN: Marathigu-IN: Gujaratipa-IN: Punjabior-IN: Odia
- pitch numbernullable
Range:
[-0.75, 0.75]The pitch adjustment for the voice. Positive values make the voice sharper; negative values make it deeper.
- pace numbernullable
Range:
[0.3, 3.0]The speed of speech.
1.0is the normal speed. Values less than1.0slow down the speech; values greater than1.0speed it up. - loudness numbernullable
Range:
[0.1, 3.0]The volume level of the speech. Higher values increase the loudness.
- sample_rate numbernullable
Possible values:
8000,16000,22050,24000The audio sample rate in Hz.