Sarvam (Beta)
Sarvam provides fast, reliable text-to-speech (TTS) with support for Indian languages and customizable voices.
Sample configuration
The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.
The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For a full list of supported options, refer to the Sarvam documentation.
Key parameters
paramsrequired
- api_subscription_key stringrequired
The API key used for authentication. Get your API key from the Sarvam dashboard.
- speaker stringrequired
The voice ID to use for speech generation.
- Female voices:
anushka,manisha,vidya,arya. - Male voices:
abhilash,karun,hitesh.
- Female voices:
- target_language_code stringrequired
The language code for speech synthesis. Supported values:
en-INEnglish (India)hi-IN: Hindibn-IN: Bengalita-IN: Tamilte-IN: Telugukn-IN: Kannadaml-IN: Malayalammr-IN: Marathigu-IN: Gujaratipa-IN: Punjabior-IN: Odia
- pitch numbernullable
Range:
[-0.75, 0.75]The pitch adjustment for the voice. Positive values make the voice sharper; negative values make it deeper.
- pace numbernullable
Range:
[0.3, 3.0]The speed of speech.
1.0is the normal speed. Values less than1.0slow down the speech; values greater than1.0speed it up. - loudness numbernullable
Range:
[0.1, 3.0]The volume level of the speech. Higher values increase the loudness.
- sample_rate numbernullable
Possible values:
8000,16000,22050,24000The audio sample rate in Hz.