SILK codec
SILK audio codec is used for real-time multi-channel audio VoIP applications

SILK codec is offered as a real-time implementation that can be configured to support multichannel wideband audio VoIP applications. Our SILK speech compression algorithm is available for licensing standalone, as part of a modular software library suite, or with a VoIP stack for different integration options with user applications. Custom designs are also available for our clients. Contact us to discuss your SILK voice codec application requirements.

download brochure

VOCAL’s SILK codec source code is optimized for execution on ANSI C and DSP architectures from TI, ADI, AMD, ARM, MIPS, Intel and other vendors. It can be integrated with our Acoustic Echo Canceller (AEC), G.168 line echo canceller, noise reduction algorithms, other telephony products, as well as with our VoIP stack.


SILK is designed by Skype as an internet wideband audio codec for use in VoIP. It operates at four different sampling rates: 8 kHz narrowband, 12 kHz mediumband, 16 kHz wideband, and 24 kHz superwideband. These allow for the capture of higher frequencies, which provide fuller sound, while also allowing interoperability with the Public Switched Telephone Network (PSTN). SILK has automatic detection of superwideband audio and can downsample to wideband if there is no benefit to the inclusion of the higher frequencies.

SILK codec has a variable bitrate (VBR) that can be set between 6 and 40 kbps, with higher bitrates providing better sound quality due to less quantization noise. SILK operates on 20 ms frames with a 5ms look ahead which causes a 25 ms algorithmic delay. The compressed audio is contained in packets of 20, 40, 60, 80 or 100 ms. Larger packets can provide a lower effective bitrate due to IP overhead per packet and certain parameters being conditionally encoded after the first frame per packet. Larger packets also cause longer delays and are more prone to errors due to lost packets. SILK has Low Bite Rate Redundancy (LBRR), also called Forward Error Correction (FEC), which makes it robust to packet loss.

SILK voice codec has three complexity settings: low, medium, and high. These control the order of the filters used for pitch estimation and noise shaping ans the number of stages used in the quantization of the noise shape parameters, the Long Term Prediction (LTP) coefficients, and the Normalized Line Spectral Frequencies (NLSF). This allows SILK to take advantage of today’s powerful processors while not abandoning older processors.



VOCAL’s speech coders are available for the following platforms. Please contact us for specific SILK codec supported platforms.

  • Texas Instruments – C6000 (TMS320C62x, TMS320C64x, TMS320C645x, TMS320C66x, TMS320C67x), DaVinci, OMAP, C5000 (TMS320C54x, TMS320C55x)
  • Analog Devices – Blackfin, TigerSHARC, SHARC
  • PowerPC
  • MIPS – MIPS32, MIPS64, MIPS4Kc
  • ARM – ARM7, ARM9, ARM9E, ARM10E, ARM11, StrongARM, ARM Cortex-A8, Cortex-M1
  • Intel / AMD x86, x64 (both 32 and 64 bit modes)


Operating Systems

VOCAL’s SILK voice codec is available for the following Operating Systems. Please contact us for other supported Operating Systems.

  • Google Android
  • Apple iOS / iPhone / iPad & MacOS
  • Unix,  Linux, μClinux, BSD
  • Microsoft Windows ACM / RTC / CE / Mobile
  • Symbian
  • eCOS / eCOSPro
  • Wind River VxWorks
  • Green Hills Integrity
  • Micrium μCOS