Complete Communications Engineering

There are two major groups of codecs used for Packet Telephony (such as Voice-over-Internet-Protocol technologies).  The main differentiation between these two groups of codecs is the coding/decoding mechanism: in the first group there are codecs whose encoding/decoding principle does not depend on the type of encoded/decoded signals. In the second group there are codecs whose encoding/decoding mechanism is based on modeling production apparatus and process of human speech. The speech modeling codecs do not operate exceptionally well on non-speech signals (such as harmonic tones and music). Group I codecs do not have this limitations. However, they occupy greater bit-rate than Group II codecs.

An example of a codec belonging to Group I is G.711 PCM codec.  Codecs belonging to Group I are called waveform codecs. They are mostly lossy codecs (meaning that the encoding/decoding operation results in some loss of information). An exception is G.711 PCM codec which a pseudo-lossless codec (a two-pass through the cycle of encoding/decoding results in ceasing the information loss).

An example of a codec belonging to Group II is G.729A codec, a Linear Prediction-type of codec; specifically, it is a CS-ACELP (Conjugate-Structure Algebraic-Code-Excited Linear-Prediction codec), one of the ITU-T-standard low-bit-rate codecs. All codecs belonging to Group II are lossy.

In Table 1, there are codecs belonging to Group I (waveform codecs) with notes on basic characteristics (cf. Ref. [1]).

Table 1: Common waveform codecs used in VoIP

Name (Standard)

Description

Bit rate

[kbps]

Sampling

[kHz]

Frame size [ms]

Notes

G.711 (ITU-T)PCM648Sample-basedA-law & μ-law
G.711.1 (ITU-T)PCM80-968/16Sample-based
G.721 (ITU-T)ADPCM328Sample-basedNow G.726
G.722 (ITU-T)2-band ADPCM6416Sample-based
G.722.1 (ITU-T)MLT24/321620
G.722.2 (ITU-T)AMR-WB6.6 – 23.851620Main app: 3rd gen mobile
G.723 (ITU-T)ADPCM-extension24/408Sample-basedNow G.726
G.726  (ITU-T)ADPCM16/24/32/408Sample-basedReplaces G.721 and G.723
G.727  (ITU-T)ADPCMVariable8Sample-basedSimilar to G.726
DVI (IMA)DVI4 uses ADPCM32VariableSample-based
L16Linear PCM128VariableSample-basedWB (uncompressed)

In Table 2, there are codecs belonging to Group II (model-based codecs, i.e., related directly or indirectly to Linear Prediction models) with notes on basic characteristics.

Table 2: Common model-based (LP-related) codecs used in VoIP

Name (Standard)

Description

Bit rate

[kbps]

Sampling

[kHz]

Frame size [ms]

Notes

G.723.1 (ITU-T)MP-MLQ and ACELP5.6/6.3830A part of H.323 video conf
G.728 (ITU-T)LD-CELP1682.5Very low delay
G.729 (ITU-T)CS-ACELP8810Low delay (15 ms)
G.729.1 (ITU-T)CS-ACELP8-32, step 2810WB
GSM 06.10 (ETSI)RPE-LTP13822.5Mostly in GSM cellular; seldom in VoIP
LPC10 (USA Gov)LP2.4822.5Very seldom used in VoIP
SpeexCELP8,16,32NB & WB30 (NB), 34 (WB)Also includes ECAN and NR features
DoD CELP (DoD)CELP4.830
EVRC (3GPP2)RCELP9.6/4.8/1.2820Enhanced Variable Rate CODEC; for CDMA
DVI (IMA)IMA32VariableSample-based
SILK (Skype)LPC6-40Variable20Adaptive bit-rate codec; low delay
iLBCAAC813.330NB (there is a 15,2 kbps version)

Tables 1 and 2 include examples of codecs that are relatively frequently used in VoIP technology. The full list of waveform-codecs and model-based (for example, CELP) codecs is much longer. There are also many proprietary solutions for encoding and decoding speech signals following waveform coding as well as the model-based coding.

VOCAL’s VoIP Software modules are part of our VoIP Reference Design and provide secure, real-time unified communications for voice, video, radio and data over the Internet or any other IP network. Speech codecs are part of the software library modules. Contact us to discuss your speech application requirements with our engineering staff.

More Information

References

  1. 3CX
  2. SPEEX
  3. The RCELP speech-coding algorithm, Transactions on Emerging Telecommunication Technologies; autors: W. Bastiaan Kleijn1, Peter Kroon1 and Dror Nahumi2