MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD 21218.

Slides:



Advertisements
Similar presentations
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Advertisements

Chapter 7 End-to-End Data
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Department of Computer Engineering University of California at Santa Cruz Data Compression (3) Hai Tao.
JPEG.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Image (and Video) Coding and Processing Lecture: DCT Compression and JPEG Wade Trappe Again: Thanks to Min Wu for allowing me to borrow many of her slides.
CS :: Fall 2003 MPEG-1 Video (Part 1) Ketan Mayer-Patel.
JPEG Still Image Data Compression Standard
Hao Jiang Computer Science Department Sept. 27, 2007
CMPT 365 Multimedia Systems
T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.
CS430 © 2006 Ray S. Babcock Lossy Compression Examples JPEG MPEG JPEG MPEG.
5. 1 JPEG “ JPEG ” is Joint Photographic Experts Group. compresses pictures which don't have sharp changes e.g. landscape pictures. May lose some of the.
1 Image and Video Compression: An Overview Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
Image Compression: JPEG Multimedia Systems (Module 4 Lesson 1)
Image and Video Compression
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Still Image Conpression JPEG & JPEG2000 Yu-Wei Chang /18.
Lossy Compression Based on spatial redundancy Measure of spatial redundancy: 2D covariance Cov X (i,j)=  2 e -  (i*i+j*j) Vertical correlation   
Introduction to JPEG Alireza Shafaei ( ) Fall 2005.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 8 – JPEG Compression (Part 3) Klara Nahrstedt Spring 2012.
JPEG Motivations: Motivations: 1.Uncompressed video and audio data are huge. In HDTV, the bit rate easily exceeds 1 Gbps. --> big problems for.
ECE472/572 - Lecture 12 Image Compression – Lossy Compression Techniques 11/10/11.
1 Image Compression. 2 GIF: Graphics Interchange Format Basic mode Dynamic mode A LZW method.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
1 Section 3. Image Compression Xudong Ni Group Member: Wei Yan,Li Yang,Xudong Ni Computer Science Florida International University.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Klara Nahrstedt Spring 2011
Concepts of Multimedia Processing and Transmission IT 481, Lecture 5 Dennis McCaughey, Ph.D. 19 February, 2007.
EE465: Introduction to Digital Image Processing1 Data Compression Techniques Text: WinZIP, WinRAR (Lempel-Ziv compression’1977) Image: JPEG (DCT-based),
JPEG. The JPEG Standard JPEG is an image compression standard which was accepted as an international standard in  Developed by the Joint Photographic.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
Indiana University Purdue University Fort Wayne Hongli Luo
CIS679: Multimedia Basics r Multimedia data type r Basic compression techniques.
JPEG CIS 658 Fall 2005.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Chapter 9 Image Compression Standards 9.1 The JPEG Standard 9.2 The JPEG2000 Standard 9.3 The JPEG-LS Standard 9.5 Further Exploration Li & Drew1.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 10 – Compression Basics and JPEG Compression (Part 4) Klara Nahrstedt Spring 2014.
The JPEG Standard J. D. Huang Graduate Institute of Communication Engineering National Taiwan University, Taipei, Taiwan, ROC.
JPEG - JPEG2000 Isabelle Marque JPEGJPEG2000. JPEG Joint Photographic Experts Group Committe created in 1986 by: International Organization for Standardization.
JPEG (Joint Photographic Expert Group)
JPEG Image Compression Standard Introduction Lossless and Lossy Coding Schemes JPEG Standard Details Summary.
ELE 488 F06 ELE 488 Fall 2006 Image Processing and Transmission ( ) Image Compression Quantization independent samples uniform and optimum correlated.
JPEG.
CS654: Digital Image Analysis
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Introduction to JPEG m Akram Ben Ahmed
JPEG. Introduction JPEG (Joint Photographic Experts Group) Basic Concept Data compression is performed in the frequency domain. Low frequency components.
MPEG CODING PROCESS. Contents  What is MPEG Encoding?  Why MPEG Encoding?  Types of frames in MPEG 1  Layer of MPEG1 Video  MPEG 1 Intra frame Encoding.
By Dr. Hadi AL Saadi Lossy Compression. Source coding is based on changing of the original image content. Also called semantic-based coding High compression.
IS502:M ULTIMEDIA D ESIGN FOR I NFORMATION S YSTEM M ULTIMEDIA OF D ATA C OMPRESSION Presenter Name: Mahmood A.Moneim Supervised By: Prof. Hesham A.Hefny.
Signal Prediction and Transformation Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
JPEG Compression What is JPEG? Motivation
Chapter 9 Image Compression Standards
The Johns Hopkins University
JPEG Image Coding Standard
Discrete Cosine Transform
JPEG.
Image Compression Standards (JPEG)
CMPT 365 Multimedia Systems
JPEG Pasi Fränti
JPEG Still Image Data Compression Standard
The JPEG Standard.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD 21218

MP3  MP3 = MPEG2 Layer III audio coding  Transform: cascade of 32- channel filter bank and 6- channel or 18- channel MDCT  Quantization: uniform scalar quantizer with a psycho-acoustic model  Entropy coding: run-length + Huffman

Transformation Stage in MP3 H (z) 0 1 x[n] channel 512-tap CMFB 6-channel 12-tap MLT/MDCT H (z) channel 36-tap MLT/MDCT transients steady-state

Masking  Masking discovered from psycho-acoustic experiments  Human auditory system is less sensitive around a strong tonal signal

Masking: Original Signal

Masking Threshold  Signal components below the masking threshold are deemed insignificant (can be quantized to zero)  Components are computed from overlapping 1024-long Hanning windows

Advanced Audio Coding (AAC)  Successor of MP3  Better audio quality than MP3 at most bit rates  Perceptually lossless at 320 kbps for 5-channel surround sound (64 kbps/channel)  Almost CD quality at 96 kbps (48 kbps/channel)  AAC is part of the MPEG4 Standard  Default audio format of Apple’s iPhone, iPod, iTunes; Sony PlayStation 3; Nintendo Wii  MDCT – Scalar Quantization – Huffman Coding

Transformation Stage in AAC H (z) channel 256-tap MDCT H (z) channel 2048-tap MDCT for transient signals for steady-state signals  AAC adaptively switches between  8 blocks of 128-point MDCT with 256-point windows  1 block of 1024-point MDCT with 2048-point window  All windows have 50% overlap x[n] x[n]

JPEG Still Image Coding Standard Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD 21218

Overall Structure of JPEG Color Converter Level Offset 8x8 DCT Uniform Quant. DC Pred. DC VLC Zigzag Scan Run -Level AC VLC DC AC  Color converter  RGB to YUV  Level offset  subtract 2^(N-1). N: bits / pixel.  Quantization  Different step size for different coefficients  DC  Predict from DC of previous block  AC:  Zigzag scan to get 1-D data  Run-level: joint coding of non-zero coeffs and number of zeros before

JPEG Quantization  Uniform mid-tread quantizer  Larger step sizes for chroma components  Different coefficients have different step sizes  Smaller steps for low frequency coefficients (more bits)  Larger steps for high frequency coefficients (less bits)  Human visual system is not sensitive to error in high frequency  Chroma Quantization Table  Luma Quantization Table  Actual step size: Scale the basic table by a quality factor

Scaling of Quantization Table  Actual Q table = scaling x Basic Q table:  quality factor ≤ 50: scaling = 50/quality  quality factor > 50: scaling = 2 - quality/50 Quality Factor Scaling

DC Prediction  DC Coefficients: average of a block  DC of neighboring blocks are still similar to each others: redundancy  The redundancy can be removed by differential coding:  e(n) = DC(n) – DC(n-1)  Only encode the prediction error e(n) 8x8 DC coeffs of Lena

Coefficient Category  Divide coefficients into categories of exponentially increased sizes  Use Huffman code to encode category ID  Use fixed length code within each category  Similar to Exponential Golomb code RangesRange SizeDC Cat. IDAC Cat. ID 010N/A -1, , -2, 2, , -6, -5, -4, 4, 5, 6, , …, -8, 8, …, , …, -16, 16, …, , …, -32, 32, …, ………… [-32767, ], [16384, 32767]

Coding of DC Coefficients  Encode e(n) = DC(n) – DC(n-1) 8x8 DC Cat.Prediction ErrorsBase Codeword , , -2, 2, , -6, -5, -4, 4, 5, 6, , …, -8, 8, …, , …, -16, 16, …, , …, -32, 32, …, ……… Our example: DC: 8. Assume last DC: 5  e = 8 – 5 = 3. Cat.: 2, index 3  Bitstream: 10011

Coding of AC Coefficients  Most non-zero coefficients are in the upper-left corner  Zigzag scanning  Example  Zigzag scanning result (DC is coded separately): EOB

A Complete Example  Original data:2-D DCT  Quantized by basic table  Zigzag scanning EOB Q table: … 12 … 14 … floor(39.8/ ) = 2 floor(6.5/ ) = 1 -floor(102.4/ ) = -9 floor(37.7/ ) = 3

A Complete Example  Zigzag scanning EOB  Inverse Quantization  Reconstructed block  MSE: 5.67

Progressive JPEG  Baseline JPEG encodes the image block by block:  Decoder has to wait till the end to decode and display the entire image  Progressive: Coding DCT coefficients in multiple scans  The first scan generates a low-quality version of the entire image  Subsequent scans refine the entire image gradually.  Two procedures defined in JPEG:  Spectral selection:  Divide all DCT coefficients into several bands (low, middle, high frequency subbands…)  Bands are coded into separate scans  Successive approximation:  Send MSB of all coefficients first  Send lower significant bits in subsequent scans

JPEG Coding Result for Lena Quality factor: QF 25 QF 5 Blocking artifact

Summary  Transformation  Karhunen-Loeve Transform (KLT): optimal linear transform  Discrete Cosine Transform (DCT): for images & video  MDCT: overlapped higher frequency resolution for audio  Discrete Wavelet Transform (DWT): multi-resolution representation  MP3 & AAC  Audio coding: FB/MDCT – Quantization – Huffman  JPEG: first international compression standard for still images  DCT – Quantization – Run-length – Huffman  JPEG2000: latest technology, wavelet-based  Scalable, progressive coding with flexible intelligent functionalities