David's Web Page



Introduction - or: What is "MPEG Audio Layer-3"?

Today, efficient coding techniques are a must for cost-effective processing of digital audio and video data by computers. Data reduction of moving pictures and sound is a key technology for any application with limited transmission or storage capacity. In the recent years, a lot of progress has been achieved. While there (still) exist several proprietary formats for audio and video coding, the ISO/IEC standardisation body has released an international standard ("MPEG") for powerful audio and video coding tools (see: Overview about the ISO-MPEG Standard - or: What is MPEG all about?).

Without data reduction, digital audio signals typically consist of 16 bit samples recorded at a sampling rate more than twice the actual audio bandwidth (e.g. 44.1 kHz for Compact Disks). So you end up with more than 1400 kbit to represent just one second of stereo music in CD quality. By using MPEG audio coding, you may shrink down the original sound data from a CD by a factor of 12, without losing sound quality. Factors of 24 and even more still maintain a sound quality that is significantly better than what you get by just reducing the sampling rate and the resolution of your samples. Basically, this is realized by "perceptual coding" techniques addressing the perception of sound waves by the human ear (see: Basics of Perceptual Audio Coding - or: What is the trick?).

Using MPEG audio, one may achieve a typical data reduction of

1:4

by Layer 1 (corresponds with 384 kbps for a stereo signal),

1:6...1:8

by Layer 2 (corresponds with 256..192 kbps for a stereo signal),

1:10...1:12

by Layer 3 (corresponds with 128..112 kbps for a stereo signal),

still maintaining the original CD sound quality.

By exploiting stereo effects and by limiting the audio bandwidth, the coding schemes may achieve an acceptable sound quality at even lower bitrates. Layer-3 is the most powerful member of the MPEG audio coding family. For a given sound quality level, it requires the lowest bitrate - or for a given bitrate, it achieves the highest sound quality (see: Advanced Features of Layer-3 - or: Why does Layer-3 perform so well?).

Some typical performance data of Layer-3 are:

sound quality

bandwidth

mode

bitrate

reduction ratio

"telephone sound"

2.5 kHz

mono

8 kbps (*)

96:1

"better than shortwave"

4.5 kHz

mono

16 kbps

48:1

"better than AM radio"

7.5 kHz

mono

32 kbps

24:1

"similar to FM radio"

11 kHz

stero

56...64 kbps

26...24:1

"near-CD"

15 kHz

stereo

96 kbps

16:1

"CD"

>15 kHz

stereo

112..128kbps

14..12:1

*: Fraunhofer uses a non-ISO extension of Layer-3 for enhanced performance ("MPEG 2.5")

All in all, Layer-3 is the key for numerous low-bitrate, high-quality sound applications (see: Applications - or: Layer-3, what is it good for?).

Real-time Layer-3 players

WinPlay3

"WinPlay3" allows the decoding simply by software on any Pentium PC in real time. A 80486 class CPU with a built-in floating-point-unit will also allow some limited operation. For the availability of supported modes, please refer to the following performance matrix:

Pentium

486DX4-133

486DX2-66

486DX-50

486DX-33

MPEG-1 stereo

ok

ok

-

-

-

MPEG-1 downmix*

ok

ok

ok

-

-

MPEG-1 mono

ok

ok

ok

ok

-

MPEG-2 stereo

ok

ok

ok

ok

-

MPEG-2 downmix

ok

ok

ok

ok

ok

MPEG-2 mono

ok

ok

ok

ok

ok

*downmix: the original stereo signal will be played back as a mono signal
"MPEG-1" = "MPEG-1 Layer-3", i.e. sample rates 32, 44.1 or 48 kHz
"MPEG-2" = "MPEG-2 Layer-3", i.e. sample rates 16, 22.05 or 24 kHz

On a Pentium-90, WinPlay3 consumes less than 30 % of the CPU power to decode Layer-3 stereo @ 44.1 kHz, or around 5 % of the CPU power to decode Layer-3 mono @ 16 kHz.
At least, a 8-bit stereo sound card is required. For full quality audio, a 16-bit card is recommended. The card´s MCI driver should support sampling frequencies from 8 kHz to 48 kHz.
A standard VGA graphics card is required.
As WinPlay3 buffers up to 4 seconds of sound data due to the limitations of the Microsoft Windows multitasking architecture, around 1 MByte free physical memory must be available.
WinPlay3 runs with the following operating systems: Microsoft Windows 3.1/3.11 (in extended 386 mode), Windows 95 und Windows NT (long file names not yet supported).
WinPlay3 supports file play back of *.mp3 files and direct play from an URL via HTTP. WinPlay3 can simply be integrated as an helper application in common browsers, for example Netscape or Mosaic.
WinPlay3 is available at http://www.iis.fhg.de/departs/amm/layer3/winplay3/.