Call Us Today: +1-716-688-4675

The Combined Reduction of Echo and Noise

The combined reduction of echo and noise allows more efficient algorithms to be used in hands free communications devices. The two main acoustic signal processing challenges with hands-free devices are feedback of the loudspeaker signal reflected through the room to the microphone and the low signal-to-noise ratio of the near-end speaker. The first requires an echo control system to be in place to ensure the echoes are not of a disturbance to the far-end user. The second is due to the increased distance between speaker and microphone. A combined approach to address these challenges yields equivalent system performance compared to applying separate algorithms for each problem.

acoustic echo cancellation | VOCAL.comHands-Free systems must address room reflections of loudspeaker signal and low SNR of near-end speaker

It is more beneficial to the overall performance of the system if the echo canceller comes before the echo and noise reduction system. The main disadvantage of the echo canceller preceding the combined reduction system is that the adaptive filter of the acoustic echo cancellation (AEC) system has to process noisy signals, which puts a theoretical bound on the achievable attenuation of the echo canceller. In most scenarios, this disadvantage is outweighed by the fact that placing a noise reduction filter before the echo canceller adds variability to the echo path which significantly limits the ability of the adaptive filter to train to the echo path. An additional advantage to place the echo canceller before the combined reduction system is that the level of the echo (i.e. non-stationary noise sources) is greatly reduced.

In the combined echo and noise reduction system, the signal model is y(n) = s(n) + b(n) + d(n), where y(n) is the microphone signal after the echo canceller, s(n) is the desired near-end speech, b(n) is the additive noise signal, and d(n) is the residual echo signal at time instance n. The goal is to design an adaptive filter such that Ŝ(ω,n) = Hc(ω,n) Y(ω,n)). In Post Filtering for Residual Echo Control, it was shown that the attenuation factor for residual echo was

Hres(ω,n) = 1 - Ŝd(ω,n)
Sy(ω,n)
      (1)

where Ŝd(ω,n) is the estimate of the spectrum energy of the residual echo at frequency, ω at time n. Similarly,

Hb(ω,n) = 1 - Ŝd(ω,n)
Sy(ω,n)
        (2)

can be used as the attenuation factor for the ambient noise components, where Ŝd(ω,n) can be estimated using techniques described in Noise Reduction of Non-stationary Noise Sources. Therefore, (1) and (2) can be combined to produce

Hc(ω,n) = max{Hmin, Hb(ω,n) ⋅ Hres(ω,n)}      (3)

where Hmin is maximum allowable attenuation.The method above describes a single channel estimation approach to combined echo and noise control. Other approaches also can modify the combined filter to psychoacoustic designs of human ear and/or use multiple channels to take advantage of the spatial coherence of the noise sources to further improve the perceptual quality of the system.

Social Media

Contact Us

VOCAL Technologies, Ltd.
520 Lee Entrance, Suite 202
Amherst, New York 14228
Phone: +1-716-688-4675
Fax: +1-716-639-0713