Custom audio source

The default audio module of Video SDK meets the need of using basic audio functions in your app. For adding advanced audio functions, Video SDK supports using custom audio sources and custom audio rendering modules.

Video SDK uses the basic audio module on the device your app runs on by default. However, there are certain use-cases where you want to integrate a custom audio source into your app, such as:

Your app has its own audio module.
You need to process the captured audio with a pre-processing library for audio enhancement.
You need flexible device resource allocation to avoid conflicts with other services.

This page shows you how to capture and render audio from custom sources.

Understand the tech

To set an external audio source, you configure the Agora Engine before joining a channel. To manage the capture and processing of audio frames, you use methods from outside the Video SDK that are specific to your custom source. Video SDK enables you to push processed audio data to the subscribers in a channel.

Custom audio capture

The following figure illustrates the process of custom audio capture.

Audio data transmission

You implement the capture module using external methods provided by the SDK.
You call pushExternalAudioFrame to send the captured audio frames to the SDK.

Custom audio rendering

The following figure illustrates the process of custom audio rendering.

Audio Data Transmission

You implement the rendering module using external methods provided by the SDK.
You call pullPlaybackAudioFrame to retrieve the audio data sent by remote users.

Prerequisites

Ensure that you have implemented the SDK quickstart in your project.

Implementation

This section shows you how to implement custom audio capture and render audio from a custom source.

Custom audio capture

Refer to the following call sequence diagram to implement custom audio capture in your app:

Custom audio capture process

Custom audio capture

Follow these steps to implement custom audio capture in your project:

After initializing RtcEngine, call createCustomAudioTrack to create a custom audio track and obtain the audio track ID.

Java
Kotlin

AudioTrackConfig config = new AudioTrackConfig();config.enableLocalPlayback = false;customAudioTrack = engine.createCustomAudioTrack(Constants.AudioTrackType.AUDIO_TRACK_MIXABLE, config);

val config = AudioTrackConfig().apply {     enableLocalPlayback = false}customAudioTrack = engine.createCustomAudioTrack(Constants.AudioTrackType.AUDIO_TRACK_MIXABLE, config)

Call joinChannel to join the channel. In ChannelMediaOptions, set publishCustomAudioTrackId to the audio track ID obtained in step 1, and set publishCustomAudioTrack to true to publish the custom audio track.

Information

To use enableCustomAudioLocalPlayback for local playback of an external audio source, or to adjust the volume of a custom audio track with adjustCustomAudioPlayoutVolume, set enableAudioRecordingOrPlayout to true in ChannelMediaOptions.

Java
Kotlin

ChannelMediaOptions option = new ChannelMediaOptions();option.clientRoleType = Constants.CLIENT_ROLE_BROADCASTER;option.autoSubscribeAudio = true;option.autoSubscribeVideo = true;// In the audio self-collection use-case, the audio collected by the microphone is not publishedoption.publishMicrophoneTrack = false;// Publish the custom audio trackpublishCustomAudioTrack = true// Set the custom audio track IDpublishCustomAudioTrackId = customAudioTrack// Join the channelval res = engine.joinChannel(accessToken, channelId, 0, option)

val option = ChannelMediaOptions().apply {     clientRoleType = Constants.CLIENT_ROLE_BROADCASTER     autoSubscribeAudio = true     autoSubscribeVideo = true     // In the audio self-collection use-case, the audio collected by the microphone is not published     publishMicrophoneTrack = false     // Publish the custom audio track     publishCustomAudioTrack = true     // Set the custom audio track ID     publishCustomAudioTrackId = customAudioTrack } // Join the channel val res = engine.joinChannel(accessToken, channelId, 0, option)

Agora provides the AudioFileReader.java sample to demonstrate how to read and publish PCM-format audio data from a local file. In a production environment, you create a custom audio acquisition module based on your business needs.

Call pushExternalAudioFrame to send the captured audio frame to the SDK through the custom audio track. Ensure that the trackId matches the audio track ID you obtained by calling createCustomAudioTrack. Set sampleRate, channels, and bytesPerSample to define the sampling rate, number of channels, and bytes per sample of the external audio frame.

Information

For audio and video synchronization, Agora recommends calling getCurrentMonotonicTimeInMs to get the system’s current monotonic time and setting the timestamp accordingly.

Java
Kotlin

audioPushingHelper = new AudioFileReader(requireContext(), (buffer, timestamp) -> {     if (joined && engine != null && customAudioTrack != -1) {         // Push external audio frames to SDK         int ret = engine.pushExternalAudioFrame(buffer, timestamp,                 AudioFileReader.SAMPLE_RATE,                 AudioFileReader.SAMPLE_NUM_OF_CHANNEL,                 Constants.BytesPerSample.TWO_BYTES_PER_SAMPLE,                 customAudioTrack);         Log.i(TAG, "pushExternalAudioFrame times:" + (++pushTimes) + ", ret=" + ret);     }});

audioPushingHelper = AudioFileReader(requireContext()) { buffer, timestamp ->     if (joined && engine != null && customAudioTrack != -1) {         // Push external audio frames to SDK         val ret = engine.pushExternalAudioFrame(             buffer, timestamp,             AudioFileReader.SAMPLE_RATE,             AudioFileReader.SAMPLE_NUM_OF_CHANNEL,             Constants.BytesPerSample.TWO_BYTES_PER_SAMPLE,             customAudioTrack         )         Log.i(TAG, "pushExternalAudioFrame times: ${++pushTimes}, ret=$ret")     }}

To stop publishing custom audio, call destroyCustomAudioTrack to destroy the custom audio track.

Java
Kotlin

// Destroy the custom audio trackengine.destroyCustomAudioTrack(customAudioTrack);

// Destroy the custom audio trackengine.destroyCustomAudioTrack(customAudioTrack)

Custom audio rendering

This section shows you how to implement custom audio rendering. Refer to the following call sequence diagram to implement custom audio rendering in your app:

Custom audio rendering workflow

Custom Audio Rendering Workflow

To implement custom audio rendering, use the following methods:

Before calling joinChannel, use setExternalAudioSink to enable and configure custom audio rendering.

Java
Kotlin

rtcEngine.setExternalAudioSink(     true,      // Enable custom audio rendering     44100,     // Sampling rate (Hz). Set this value to 16000, 32000, 441000, or 48000     1          // Number of channels for the custom audio source. Set this value to 1 or 2);

rtcEngine.setExternalAudioSink(     true,      // Enable custom audio rendering     44100,     // Sampling rate (Hz). Set this value to 16000, 32000, 441000, or 48000     1          // Number of channels for the custom audio source. Set this value to 1 or 2)

After joining the channel, call pullPlaybackAudioFrame to get audio data sent by remote users. Use your own audio renderer to process the audio data and then play the rendered data.

Java
Kotlin

private class FileThread implements Runnable {     @Override     public void run() {         while (mPull) {             int lengthInByte = 48000 / 1000 * 2 * 1 * 10;             ByteBuffer frame = ByteBuffer.allocateDirect(lengthInByte);             int ret = engine.pullPlaybackAudioFrame(frame, lengthInByte);             byte[] data = new byte[frame.remaining()];             frame.get(data, 0, data.length);             // Write to a local file or render using a player             FileIOUtils.writeFileFromBytesByChannel("/sdcard/agora/pull_48k.pcm", data, true, true);             try {                 Thread.sleep(10);             } catch (InterruptedException e) {                 e.printStackTrace();             }         }     }}

private class FileThread : Runnable {     override fun run() {         while (mPull) {             val lengthInByte = 48000 / 1000 * 2 * 1 * 10             val frame = ByteBuffer.allocateDirect(lengthInByte)             val ret = engine.pullPlaybackAudioFrame(frame, lengthInByte)             val data = ByteArray(frame.remaining())             frame.get(data, 0, data.size)             // Write to a local file or render using a player             FileIOUtils.writeFileFromBytesByChannel("/sdcard/agora/pull_48k.pcm", data, true, true)             try {                 Thread.sleep(10)             } catch (e: InterruptedException) {                 e.printStackTrace()             }         }     }}

Using raw audio data callback

This section explains how to implement custom audio rendering.

To retrieve audio data for playback, implement collection and processing of raw audio data. Refer to Raw audio processing.

Follow these steps to call the raw audio data API in your project for custom audio rendering:

Retrieve audio data for playback using the onRecordAudioFrame, onPlaybackAudioFrame, onMixedAudioFrame, or onPlaybackAudioFrameBeforeMixing callback.
Independently render and play the audio data.

Custom audio source

Understand the tech

Custom audio capture

Custom audio rendering

Prerequisites

Implementation

Custom audio capture

Custom audio rendering

Using raw audio data callback

Reference

Sample projects

API reference

Was this helpful?