Chapter 5 Summary: Review of MPEG-4 General Audio Coding Acronyms: Two types of coding: - General Audio Coding: general audio coding A Codecs source coding Perceptual coding LPC is described, whose main modules are the difference calculation, the quantizer and the predictor. the error between the prediction and the real signal is what's transmitted A sub-band codec uses a filter bank which splits different frequencies into bands which are then quantized using fewer bits since sub-bands have smaller steps than the original Perceptual coding makes use of a perceptual model. It attempts to partition frequency (and time) to match the capacity of the human hearing. The rate (loop?) used into such codecs are used to optimize rate versus quality. 2. MPEG-2 AAC Advanced Audio Coding Directly incorporated into MPEG4 General Audio Standard. It uses The block diagram from page 136 introduces a few elements Noise Shaper: work with the Filter Bank by modifying it's characteristics Rate-distortion control: adjust the scale factor to adjust the overall bit rate Analysis-synthesis filter bank Stationary signal: Transient signal:
Perceptual model Quantization and Noiseless Coding AAC is a variable-rate codec, so the rate distortion module adjusts step size and # of quantization level to achieve *constant* rate non-linear quantization (Huffman coded - 11 different codebooks, can be used for the same spectrum) MPEG4 Addition to AAC new tools within the codec: - LTP (Long Term Prediction) - TwinVQ (Transform Domain Weighted Interleaved Vector Quantization) MPEG4 Scalabel General Audio Coding This allows to encode independently of any bit rate requirement. Bit streams generated consist of several partial bitstream encoded separately.
Having a CELP core coder and ACC enhancement layer is the best configuration for speech quality. General Characteristics: - 2kbits/sec to 100kbits/sec per channel |