Skip to main content

Release notes

This document tracks important changes and improvements to the Real-Time STT.

Releases

v7.2.1

Released on April 20, 2026

New Features

Included in this release:

  • Support for caption storage

    This version adds support for writing recorded subtitles to standard S3-compatible object storage. You can configure parameters such as endpoint, type, and provider in the captionConfig.extensionParams of the transcription request to connect to S3-compatible storage services such as MinIO, as well as some self-hosted object storage. See Record captions for details.

  • Agent list API

    This version adds a List Real-time STT agents API. Use it to retrieve real-time transcription and translation tasks that meet specified criteria by channel, time range, and task status.

v7.2

Released on March 30, 2026

New Features

Included in this release:

  • Keywords functionality

    The transcription API now supports configuring keywords to improve the recognition accuracy of specific words such as proper nouns and industry terms. You can configure up to 500 keywords per request. To specify keywords, pass the list in the keywords parameter of the request body when creating a transcription task or real-time transcription service.

  • Unique sentence_id

    When parsing transcription data, each subtitle carries a unique identifier sentence_id. When both original and translated subtitles are enabled, use this ID to match original and translated subtitles for the same sentence, ensuring accurate subtitle alignment.

v7.0

Released on May 1, 2025

Improvements

This release includes the following enhancements:

  • Supports updating the configuration of the STT agent.
  • Enhances the scalability of STT by supporting multiple endpoints.
  • API deprecations. Refer to the API reference for details.