EVS Codec Reveals Superior Performance over AMR-WB under Error Conditions

By :

The EVS codec offers many audio quality enhancement features, including an error concealment mechanism to neutralize effects of transmission errors.

The new Enhanced Voice Services (EVS) codec was standardized by the 3GPP in December 2014 to offer improved performance.  Among its many audio quality enhancement features, it provides an error concealment mechanism to neutralize the effects of transmission errors.

EVS codec error resilience

A frequent issue for audio services is deterioration of quality due to radio channel characteristics and network error conditions, leading to the loss, corruption, or unacceptable latency of audio packets. To combat this issue, the EVS codec performs frame error concealment (FEC) on a limited number of frames in order to produce a neutral signal and replace what otherwise sounds strange or missing.  Previous AMR-WB audio codecs can manage approximately 3% of lost frames without critical artifacts, but there is noticeable distortion. In comparison, the EVS codec can perform FEC on more than 15% of lost frames without critical artifacts and still maintain relatively clear voice signals.

Packet loss is seen by users to be the most noticeable disturbance in voice communications and is likely to occur for a variety of reasons.  This is especially true with unmanaged networks, where Quality of Service (QoS) is not present, due to congestion and misrouted packets.  Managed networks have more capability than unmanaged networks, but they require a skilled administrator or engineer to achieve the best performance. Still, packet loss causes degradation in voice quality. Packet loss can be bursty in nature, with periods of several seconds during which the loss may be as high as 10% to 15%. The average packet loss rate for a call may be low; however, these periods of high loss rate can cause noticeable degradation in call quality.

The EVS channel aware mode offers significantly improved error concealment in voice communications as compared to AMR-WB and AMR-NB codecs over Voice over LTE (VoLTE). The error resilience is achieved using a form of in-band forward error correction.  Source-controlled coding techniques are used to identify candidate speech frames for bit rate reduction, leaving spare bits for transmission of partial copies of prior frames such that a constant bit rate is maintained. The self-contained partial copies are used to improve the error robustness, in case the original primary frame is lost or discarded due to late arrival. Subjective evaluation results from ITU-T P.800 Mean Opinion Score (MOS) tests are provided below, showing improved quality under channel impairments as well as negligible impact to clean channel performance. EVS channel aware mode is specifically created for reliable speech transmission under especially bad network conditions, such as high packet loss, jitter, etc. Channel aware is available in two configuration modes: EVS WB 13.2 and EVS S-WB 13.2.

Image of the Audio MOS Downlink screen

As the graph indicates, an audio quality comparison between EVS and AMR reveals that EVS delivers superior quality at similar bit rates. The robustness of EVS using channel aware mode (as shown with 13.2k) delivers superior performance through all packet loss scenarios, with excellent audio quality results.

More on audio quality measurement for voice

Perceptual Objective Listening Quality Assessment (POLQA) is based on ITU-T Recommendation P.863. POLQA provides voice quality analysis of HD Voice and Full HD Voice. It offers an advanced level of benchmarking accuracy, and adds significant new capabilities for WB, S-WB and EVS voice signals along with support for the most recent voice coding and VoIP transmission technologies.  POLQA is one of the best choices for evaluating, optimizing and measuring the voice quality of next-generation networks and delivers accurate, high-resolution analysis results. POLQA is measured as a MOS.  MOS is expressed as a single number in the range 1 to 5, where 1 is lowest perceived audio quality and 5 is the highest perceived audio quality measurement. The standard deviation of the mean distribution needs to be very small.

It is important to test the uplink and downlink audio MOS measurement of devices. When tested in parallel, the MOS needs to remain stable.




Eli Cohen
Eli Cohen

Eli Cohen is the Director of product management for IMS/RCS/VoLTE in Wireless Business Unit and the founder of the ProLab Testing Suite and Evident Monitoring Solution. For the past 16 years, he has created and managed innovative testing and measurement solutions for IMS, RCS , VoLTE, VoIP and 3G, including Audio and Video Quality measurement. In addition, he has been managing and leading research on Video Quality and Video Quality algorithms since its beginning. Before joining Spirent, Eli served in several product management positions in Industry leading companies including, Avaya, Radvision, MTS and SNCF. He holds a B.Sc in Computer Science from Jerusalem College of Technology.