Audio and video technologies, have advanced rapidly over the last decade. Narrowband (8kHz sampling) audio was the most common bandwidth for audio. With the increased importance of audio conferencing for remote work environments, it is desirable to achieve the best audio quality possible. The transition from PSTNs to high-speed mobile wireless networks (4G, LTE, and 5G) has made tele-work possible. The evolution of the audio equipment in mobile devices and their networks allow for the utilization of wideband (16kHz sampling) audio to fullband (48kHz sampling) audio.
Despite low-cost embedded devices having more CPU speed than ever before, engineers are always looking to minimize power consumption while looking to add features that would make their product standout. One of the biggest resource consumers in the full-duplex audio communication realm is acoustic echo cancellation (AEC). AEC require long adaptive filters, and increasing the sampling rate of audio causes a double hit complexity. The complexity not only increases because you have to process more samples per second, but because the filter length has to increase to achieve the same tail length in time.
The table below shows number of multiplications per second (MPS) required for a time domain FIR and NLMS filter update. As it can be seen observed, to increase the sampling rate from 8kHz to 48kHz the number of MPS increases 36-fold.
|Sampling Rate||Tail Length||FIR Multiplies Per Second||NLMS Update Multiplies Per Second|
Time-domain multiplies for AEC
All hope is not lost. As discussed in Subband Adaptive Filtering, the computational resources can be significantly reduced by transforming from the time domain to the subband domain to perform the AEC filtering. The table below shows that the MPS for 128ms tail length for various sampling rates and subband decimation factors.
|Sampling Rate||# of Subbands||FIR Multiplies Per Second||NLMS Update Multiplies Per Second|
Subband domain multiplies for AEC
For 48kHz with 256 subbands, the MPS is reduced nearly 64 times over compared to the time domain implementation. There is an overhead to transform the data to and from the subband domain, but FFT’s can be heavily optimized and some DSPs have hardware FFT accelerators. In addition to the subband optimization, higher sampling rates can take advantage of the fact that higher frequencies are more easily absorb and therefore require shorter echo tails at higher frequencies. This characteristic of acoustics critically important for professional audio-conferencing applications where high-resolution audio (96kHz sampling) is used.
At VOCAL Technologies, our wideband, super-wideband, fullband, high-resolution AEC is efficiently implemented in the subband domain and optimized for a wide variety of embedded DSPs. Please contact us to learn more.