MPEG-1 Overview of MPEG-1 Standard

Slides:



Advertisements
Similar presentations
T.Sharon-A.Frank 1 Multimedia Compression Basics.
Advertisements

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
Developement and Implementation of an MPEG1 Layer III Decoder on x86 and TMS320C6711 platforms Braidotti Enrico (Farina Simone)
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
CGMB324: Multimedia System Design
MPEG Audio Formats Jason Leung Wednesday, February 5, 2014.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
© Copyright 2002 Michael G. Christel and Alexander G. Hauptmann 1 Carnegie Mellon.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
An Overview of Perceptual Audio Coding and MPEG AAC
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 15 – MP3 and MP4 Audio Klara Nahrstedt Spring 2014.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
1 Basics of Audio Signal Processing Sudhir K. 2 Summary Slide  Digital Representation of Audio  Psycho-Acoustic principles  Lossy Compression of Audio.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Week 7 Psychoacoustic Compression1ESE 250 – S’12 Kod & DeHon ESE250: Digital Audio Basics Week 7 February 23, 2012 Psychoacoustic Compression.
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Dhatchaini Rajendran Student ID: Date :
CIS679: Multimedia Basics r Multimedia data type r Basic compression techniques.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
1 Classification of Compression Methods. 2 Data Compression  A means of reducing the size of blocks of data by removing  Unused material: e.g.) silence.
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Compression  Data files compression  Music compression  Image and video compression.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.7 (M Nov 04) The MP3 frame format.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Project Proposal Audio Compression Variants
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
III Digital Audio III.7 (W Nov 04) The MP3 frame format.
Data Compression.
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
III Digital Audio III.7 (F Oct 20) The MP3 frame format.
Data Compression.
Data Compression CS 147 Minh Nguyen.
Digital Audio Processing and Applications EE5809
III Digital Audio III.7 (Mo Oct 22) The MP3 frame format.
Standards Presentation ECE 8873 – Data Compression and Modeling
Judith Molka-Danielsen, Oct. 02, 2000
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

MPEG-1 Overview of MPEG-1 Standard Introduction to perceptual and entropy codings

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

Introduction Digitizing an analog signal is (lossy) compression Digitizing introduces quantization noise Quantization noise imply loss of quality Linear quantization > 16 bit (98 dB)  inaudible noise (CD) Linear quantization 4 bit (26 dB) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

History Moving Picture Expert Group (MPEG) MPEG-1 standard Created in January 1988 Starts the development of MPEG-1 in May 1988 Publishes the MPEG-1 standard in November 1992 (ISO/IEC 11172-3 for audio) MPEG-1 standard Defines bit-stream Defines decoding functions DOES NOT define encoding techniques Inspired by MUSICAM (Masking pattern Universal Subband Integrated Coding And Multiplexing) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

Psychoacoustics Masking effect Critical bands 18 January 2019 Time domain masking (Pohlmann 2000) Frequency domain masking (Pohlmann 2000) z/Bark lower boundary higher boundary bandwidth central frequency 100 50 1 200 150 2 300 250 3 400 350 4 510 110 450 5 630 120 570 6 770 140 700 7 920 840 (Brandenburg) Idealized critical bands (Painter & Spanias 2000) 18 January 2019 MPEG-1 Presentation

Perceptual Coding Dividing the different subbands of a signal Ignoring masked audio information Introducing inaudible quantization noise Bits association according to masking threshold (Pohlmann 2000) Quantization noise added according to masking threshold (Pohlmann 2000) 18 January 2019 MPEG-1 Presentation

Perceptual Coding Perceptual Encoder/Decoder (Kahrs & Brandenburg 1998) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

Entropic Coding Use information about the signal to code efficiently Entropy of a signal Example 1: {0, 2, 2, 2, 0, 0, 0, 0, 0, 2, 0, 3, 2, 2, 0, 0, 0, 3, 0, 0} 20 symbols – twelve 0 (0.6), zero 1 (0), six 2 (0.3), two 3 (0.1) Entropy H = 1.30 Example 2: {1, 2, 3, 0, 2, 1, 1, 2, 3, 0, 0, 1, 0, 3, 3, 3, 2, 0, 1, 2} 20 symbols – five 0 (0.25), five 1 (0.25), five 2 (0.25), five 3 (0.25) Entropy H = 2 Shannon theorem  It is impossible to code with less than H bits/symbol  It is possible to code with less than H+1 bits/symbol 18 January 2019 MPEG-1 Presentation

Entropic Coding Huffman coding Efficiency: Example 1: {0, 2, 2, 2, 0, 0, 0, 0, 0, 2, 0, 3, 2, 2, 0, 0, 0, 3, 0, 0} 20 symbols – twelve 0 (0.6), zero 1 (0), six 2 (0.3), two 3 (0.1) Entropy H = 1.30 Immediate coding: 0  “00” 1  “01” 2  “10” 3  “11” “0010101000000000001000111010000000110000” Huffman coding: 0  “0” 1  “111” 2  “10” 3  “110” “010101000000100110101000011000” Efficiency: Immediate coding: 2 bits/symbol Huffman coding: 1.5 bits/symbol (statistically) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

MPEG-1 Sampling rate: 32, 44.1 and 48 kHz Four modes: Mono: 1 channel Stereo: 2 channels Dual: 2 channels independent (e.g. bilingual programmes) Joint stereo: 2 channels coded together 2 perceptual models Floating point quantization (normalization) Error checking: Cyclic redundancy check (CRC) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

MPEG-1 Layer I From 32 to 448 kbps 32-subband polyphase filterbank Bit allocation (0-15) Max dynamic range > 120 dB Linear quantization 1 frame  384 samples Example: Philips Digital Compact Cassette Example of Layer I encoder (Pohlmann 2000) Layer I frame format (Pohlmann 2000) 18 January 2019 MPEG-1 Presentation

MPEG-1 Layer II From 32 to 384 kbps Improvement of Layer I Improved FFT analysis Scale factor redundancy Finer quantization 1 frame  1152 samples Example: Digital Audio Broadcasting (DAB) Example of Layer II encoder (Pohlmann 2000) Layer II frame format (Pohlmann 2000) 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

MPEG-1 Layer III (MP3) From 32 to 320 kbps Improvements: Finer psychoacoustics model Alias reduction (MDCT filters) Nonuniform quantization Entropy coding Adaptative block size Only Layer with patents Inspired by: ASPEC (audio spectral perceptual entropy coding) OCF (optimal coding in the freq. domain) Example of Layer III encoder (Pohlmann 2000) Layer III frame format (Pohlmann 2000) 18 January 2019 MPEG-1 Presentation

MPEG-1 Layer III (MP3) Filtering: Hybrid polyphase filter/MDCT Steady-state signals: 18-point MDCT on every subband Frequency resolution: 41.67 Hz Time resolution: 24 ms Transient signals: 6-point MDCT Frequency resolution: 125 Hz Time resolution: 8 ms 3 blocks modes Pre-echo detection Quantization : power 3/4 Entropy coding: Huffman tables Run length coding Filtering stage in Layer III encoder (Pohlmann 2000) MDCT filterbank in Layer III encoder (Pohlmann 2000) 18 January 2019 MPEG-1 Presentation

Joint Stereo Coding Intensity coding MS (mid/side) stereo coding Sum of left/right channels Coding of the sum and of left/right scale factors Usually only for high-frequency subbands Efficient for redundant audio channels MS (mid/side) stereo coding Sum and difference of left/right channels Coding of the two values Stereo masking 18 January 2019 MPEG-1 Presentation

Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and Audio Quality 18 January 2019 MPEG-1 Presentation

Audio Quality Comparison with CD-quality at 48 kHz (16 bit - 1.412 Mbps) Layer I: No perceptual difference for 384 kbps (stereo) – 2:1 compression Layer II: No perceptual difference for 256 kbps (stereo) – 4:1 compression Layer III Increase of mean opinion score compared to Layer II at 256 kbps (stereo) for 128 kbps (stereo) – 8:1 compression 18 January 2019 MPEG-1 Presentation

Comparison between Layers Layers I and II are very similar Each Layer has its defined decoder design Encoding/Decoding complexity: Layer I/II  broadcasting Encoding/Decoding quality: Layer III  audio storage 18 January 2019 MPEG-1 Presentation

Conclusion Bibliography: Ambikairajah, E. & al. Auditory masking and MPEG-1 audio compression, Electronics & Communication Engineering Journal, 1997 Brandenburg, K. & Bosi, M. Overview of MPEG Audio: Current and Future Standards for Low-Bit-Rate Audio Coding, Journal of the Audio Engineering Society, 1997, Vol. Vol. 45(No. 1/2) Painter, T. & Spanias, A. Perceptual Coding of Digital Audio, Proceedings of IEEE, 2000, Vol. Vol. 88(No. 4) Painter, T. & Spanias, A. A Review of Algorithms for Perceptual Coding of Digital Audio Signals, Digital Signal Processing, 1997 Pan, D. A Tutorial on MPEG/Audio Compression, IEEE MultiMedia, IEEE Computer Society, 1995, Vol. 2(2), pp. 60-74 Pan, D.Y. Digital Audio Compression, Digital Technical Journal, 1993, Vol. 5 Kahrs, M. and Brandenburg, K. Applications of digital signal processing to audio and acoustics, Kluwer Academic Publishers, 1998 Mallat, S. Traitement du Signal, Ecole Polytechnique, 2000 Pohlmann, K.C. Principles of Digital Audio, McGraw-Hill Professional, 2000 18 January 2019 MPEG-1 Presentation