Deepgram
Deepgram provides fast, accurate automatic speech recognition with advanced AI models optimized for real-time streaming and conversational applications across multiple languages.
Use a preset
To use Deepgram ASR with an Agora managed key, specify one of the following presets in the preset field when starting an agent:
deepgram_nova_2deepgram_nova_3
When using a preset, you do not need to provide the API key, endpoint URL, or model. You can still use the asr field to configure additional settings such as language and keyterm. To provide your own API key, see Sample configuration.
Sample configuration
The following example shows a starting asr parameter configuration you can use when you Start a conversational AI agent.
- Use a preset model
- Use your own API key (BYOK)
Key parameters
paramsrequired
- url stringnullable
The WebSocket URL for Deepgram's streaming API.
- key stringrequired
The API key used for authentication. Get your API key from the Deepgram Console.
- model stringnullable
The speech recognition model to use.
- language stringnullable
The language code for speech recognition (For example,
en,es,fr). See supported languages for language codes. - keyterm stringnullable
Boost specialized terms and brands. Only compatible with the
nova-3model.
The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For advanced configuration options, model selection, and detailed parameter descriptions, see the Deepgram API documentation.
The following parameters are not passed through and instead use default values set by Agora:
callbackcallback_methodchannelsencodingmultichannelsample_rate