III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.

Slides:



Advertisements
Similar presentations
Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
Advertisements

MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
CGMB324: Multimedia System Design
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Dale & Lewis Chapter 3 Data Representation Analog and digital information The real world is continuous and finite, data on computers are finite  need.
Digital Audio Compression
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
Data Compression Michael J. Watts
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
Lecture 14: Spring 2007 MPEG Audio Compression
Audio compression zAlgorithms. zStandards.. Coding gain zRatio of uncompressed size to compressed size. zSources: yLossless coding. yLossy perceptual.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
MPEG1 Coding Standard By: Richard M Tarbell. MPEG: Motion Picture Expert Group First devised in 1988 by a group of almost 1000 experts Primary motivations:
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
{ Lossy Compression William Dayton Nick Trojanowski.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 15 – MP3 and MP4 Audio Klara Nahrstedt Spring 2014.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
The Application Layer Chapter 7. DNS – The Domain Name System a)The DNS Name Space b)Resource Records c)Name Servers.
Psycho- acoustics and MP3 audio encoding Physics of Music PHY103.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
By: T’quoia Boyd Science Glossary Encoder- a part in MP3 that turns messages into codes Polyphase filter bank-a part used in MP3 to separate sound.
CMPT 365 Multimedia Systems
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Compression  Data files compression  Music compression  Image and video compression.
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.5 (F Oct 30) MP3 and other digital audio file formats.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.7 (M Nov 04) The MP3 frame format.
AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 11 COMPRESSION.
Project Proposal Audio Compression Variants
Perceptual Audio Coding The AT&T/Bell Labs view James D. Johnston Chief Scientist Neural Audio, Kirkland, Wa.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Fundamentals of Multimedia 2 nd ed., Chapter 14 Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Audio Codecs 14.4 MPEG-7.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
Introduction to Computer Security ©2004 Matt Bishop Information Security Principles Assistant Professor Dr. Sana’a Wafa Al-Sayegh 1 st Semester
III Digital Audio III.7 (W Nov 04) The MP3 frame format.
III Digital Audio III.5 (W Oct 18) MP3 and other digital audio file formats.
III Digital Audio III.7 (F Oct 20) The MP3 frame format.
High Resolution Digital Audio
Audio Henning Schulzrinne Dept. of Computer Science
III Digital Audio III.7 (Mo Oct 22) The MP3 frame format.
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC

6. Frame Outputstream Formatting The MP3 encoder chain 5. Huffman Compression 6. Frame Outputstream Formatting Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line 1. Digital Datastream 4. Quantization 2. FFT with Filter Bank 3. Psychoacoustical Model (Perceptual-Audio-Coding Model PAC)

The MP3 encoder chain 1. Digital Datastream 2 ~ stereo Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line 1. Digital Datastream 2 ~ stereo 768 kbit/s ~ 48 000 × 16 b/s

Since # sample rate = # Fourier coefficients, The MP3 encoder chain Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line 2. FFT with Filter Bank Important: Since # sample rate = # Fourier coefficients, speak of “Fourier samples per second” 2.1 Cut spectrum 0 – 20 kHz into 32 subbands of 625 Hz each (32 × 625 = 20 000) for 1/40 sec windows. 2.2 Use MDCT (Modified Discrete Cosine Transformation ~ variant of FFT) to split each 625 Hz band into 18 subbands with variable widths, according to psychoacoustical criteria. Get 576 = 18 × 32 “lines”.

5. Huffman lossless Compression The MP3 encoder chain 5. Huffman lossless Compression Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line 4. lossy Quantization Already discussed, Ok!!!! 40% of compression

3. Psychoacoustical Model (Perceptual-Audio-Coding Model PAC) The MP3 encoder chain Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line 3. Psychoacoustical Model (Perceptual-Audio-Coding Model PAC)

PAC 1: hearing thresholds PAC 2: auditory masking The MP3 encoder chain Psychoacoustical Model (Perceptual-Audio-Coding Model PAC) = core features of MP3, it covers 60 % of MP3 compression The PAC Model is based upon three limitations of human audio-perception: PAC 1: hearing thresholds PAC 2: auditory masking PAC 3: temporary masking All three PAC components generate lossy compression

PAC 1: hearing thresholds The MP3 encoder chain you don’t hear sinusoidal sounds below this threshold of loudness PAC 1: hearing thresholds loudness frequency (kHz)

The MP3 encoder chain PAC 2: auditory masking frequency loudness For every sinusoidal frequency component of frequency f and loudness l, there is a surrounding masking surface, where other frequency/loudness components cannot be heard together with the given one. Example: the 4 kHz/40 dB component (red) masks the blue one.

PAC 3: temporary masking The MP3 encoder chain PAC 3: temporary masking loudness time For every sinusoidal frequency component of frequency f and loudness l (red) another subsequent component (blue) cannot be heard below the given curve of loudness in time, because the ear needs some time to “recover” from that first component’s perception. This is even true for sounds before the given one (red curve), because the perception needs to be built up!

6. Frame Outputstream Formatting The MP3 encoder chain 6. Frame Outputstream Formatting Audio Data Filter Bank 32 Subbands Subbands Psychoacoustical Model Quantization and Encoding (Check of Quantization loop) External Check Encoding Encoding of Additional Information Datastream Formatting to Frames etc. Additional Data Data Stream 2*16 to Line