Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.

Slides:



Advertisements
Similar presentations
Alex Chen Nader Shehad Aamir Virani Erik Welsh
Advertisements

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Copyright 2003, Marchany Hiding Text in MP3 Files Randy Marchany VA Tech Computing Center Blacksburg, VA
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
Developement and Implementation of an MPEG1 Layer III Decoder on x86 and TMS320C6711 platforms Braidotti Enrico (Farina Simone)
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Audio Coding Team Member: ChungMing Yan, Chun Tong.
CGMB324: Multimedia System Design
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
DIGITAL COMMUNICATIONS.  The modern world is dependent on digital communications.  Radio, television and telephone systems were essentially analog in.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
Audiovisual digital documents Adolf Knoll National Library of the Czech Republic
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
Department of Computer Engineering University of California at Santa Cruz Data Compression (3) Hai Tao.
Department of Computer Engineering University of California at Santa Cruz Video Compression Hai Tao.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
Department of Computer Engineering University of California at Santa Cruz Data Compression (2) Hai Tao.
11/11/03CSE 100 – Info Technology & Its Impact on Society1 MP-3 Compression: How it works.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Introduction to Digital Audio
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 15 – MP3 and MP4 Audio Klara Nahrstedt Spring 2014.
Audio Fundamentals Lesson 2 Sound, Sound Wave and Sound Perception
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
Psycho- acoustics and MP3 audio encoding Physics of Music PHY103.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
By: T’quoia Boyd Science Glossary Encoder- a part in MP3 that turns messages into codes Polyphase filter bank-a part used in MP3 to separate sound.
Media Representations - Audio
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Dhatchaini Rajendran Student ID: Date :
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.7 (M Nov 04) The MP3 frame format.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Submitted To: Submitted By: Seminar On Digital Audio Broadcasting.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
III Digital Audio III.7 (W Nov 04) The MP3 frame format.
Objective and Subjective Audio Assessment of MP3 Players’ Quality
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
III Digital Audio III.7 (F Oct 20) The MP3 frame format.
Chapter 2 Signal Sampling and Quantization
III Digital Audio III.7 (Mo Oct 22) The MP3 frame format.
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
Presentation transcript:

Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao

Department of Computer Engineering University of California at Santa Cruz History n MP3 originally developed by Fraunhofer IIS partly under EUREKA project EU147, Digital Audio Broadcasting (DAB). Then adopted as ISO-MPEG Audio Layer-3 (1991, There are Layer 1 and 2 also) n For 44.1 KHz sampling, stereo 16 bit audio, the bit rate is 44100x2x16=1.41Mbit/s, Using MP3, good quality sound can be achieved even with >1:24 compression ration n Layer 3 is the most advanced in all three methods 1:4by Layer 1 (corresponds to 384 kbps for a stereo signal), 1:6...1:8by Layer 2 (corresponds to kbps for a stereo signal), 1:10...1:1 2 by Layer 3 (corresponds to kbps for a stereo signal),

Department of Computer Engineering University of California at Santa Cruz Performance n Typical performance for various sources sound qualitybandwidthmodebitratereduction ratio telephone sound2.5 kHzmono8 kbps *96:1 better than short wave4.5 kHzmono16 kbps48:1 better than AM radio7.5 kHzmono32 kbps24:1 similar to FM radio11 kHzstereo kbps :1 near-CD15 kHzstereo96 kbps16:1 CD>15 kHzstereo kbps14..12:1

Department of Computer Engineering University of California at Santa Cruz Basic Diagram

Department of Computer Engineering University of California at Santa Cruz Psycho-Acoustic Perceptual Model n The basic perceptual model used in MP3 is that louder frequencies mask out adjacent quieter ones. People can not hear a quiet sound at one frequency if there is a loud sound at another n This can be explained better by the following figures presented by Rapha Depke

Department of Computer Engineering University of California at Santa Cruz Audio perception n Human perception of audio signal

Department of Computer Engineering University of California at Santa Cruz Perceptual model n People can not hear a quiet sound at one frequency if there is a loud sound at another

Department of Computer Engineering University of California at Santa Cruz Perceptual model n The louder the sound, the larger the mask area n The bottom area is called absolute threshold of hearing (ATH), defining the weakest sound that can be heard

Department of Computer Engineering University of California at Santa Cruz Filter Banks n The audio signal passes through 32 filters with different frequency

Department of Computer Engineering University of California at Santa Cruz Dominant band and the mask n Dominant band is found and the corresponding mask is applied

Department of Computer Engineering University of California at Santa Cruz Quantization of Audible Sound n The components exceed the mask are quantized and encoded using the Huffman coding method

Department of Computer Engineering University of California at Santa Cruz Joint Stereo n Joint stereo coding takes advantage of the fact that both channels of a stereo channel pair contain similar information n These stereophonic irrelevancies and redundancies are exploited to reduce the total bitrate n Joint stereo is used in cases where only low bitrates are available but stereo signals are desired.

Department of Computer Engineering University of California at Santa Cruz Encoder n A typical solution has two nested iteration loops Distortion/Noise control loop (outer loop) Rate control loop (inner loop)

Department of Computer Engineering University of California at Santa Cruz Rate control loop n For a given bit rate allocation, adjust the quantization steps to achieve the bit rate. This loop checks if the number of bits resulting from the coding operation exceeds the number of bits available to code a given block of data. If it is true, then the quantization step is increased to reduce the total bits. This can be achieved by adjusting the global gain

Department of Computer Engineering University of California at Santa Cruz Distortion control loop n This loop shape the quantization steps according to the perceptual mask threshold Start with a default factor 1.0 for every band If the quantization error in a band exceeds the mask threshold, the scale factor is adjusted to reduce this quantization error This will cause more bits and the rate control loop has to be invoked every time the scale factors are changed The distortion control is executed until the noise level is below the perceptual mask for every band

Department of Computer Engineering University of California at Santa Cruz Decoder n Decoder side relatively easier. The gain, scale factor, quantization steps recovered and used for reconstruct the filter bank responses n Filter bank responses are combined to reconstruct the decoded audio signal