PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004

Slides:



Advertisements
Similar presentations
Frequency analysis.
Advertisements

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
MPEG Audio Formats Jason Leung Wednesday, February 5, 2014.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
Week 6 – Psychoacoustics ESE 250 S’13 DeHon Kadric Kod Wilson-Shah 1 ESE250: Digital Audio Basics Week 6 February 19, 2013 Human Psychoacoustics.
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Lecture 14: Spring 2007 MPEG Audio Compression
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Dolby AC-3 Audio Encoding & THX Wai Kam (Winnie) Henele Adams Peter Boettcher.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
An Overview of Perceptual Audio Coding and MPEG AAC
Introduction Research Methods Fall 2010 Tamás Bőhm.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 15 – MP3 and MP4 Audio Klara Nahrstedt Spring 2014.
Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
Psycho- acoustics and MP3 audio encoding Physics of Music PHY103.
1 Basics of Audio Signal Processing Sudhir K. 2 Summary Slide  Digital Representation of Audio  Psycho-Acoustic principles  Lossy Compression of Audio.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Concepts of Multimedia Processing and Transmission IT 481, Lecture 6 Dennis McCaughey, Ph.D. 26 February, 2007.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Week 7 Psychoacoustic Compression1ESE 250 – S’12 Kod & DeHon ESE250: Digital Audio Basics Week 7 February 23, 2012 Psychoacoustic Compression.
CMPT 365 Multimedia Systems
Media Representations - Audio
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Speech and Audio Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2013 Last updated
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Dhatchaini Rajendran Student ID: Date :
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Perceptual Audio Coding The AT&T/Bell Labs view James D. Johnston Chief Scientist Neural Audio, Kirkland, Wa.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Fletcher’s band-widening experiment (1940) Present a pure tone in the presence of a broadband noise. Present a pure tone in the presence of a broadband.
Fundamentals of Multimedia 2 nd ed., Chapter 14 Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Audio Codecs 14.4 MPEG-7.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
Fletcher’s band-widening experiment (1940)
PSYCHOACOUSTICS A branch of psychophysics
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
PCM (Pulse Code Modulation)
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004

Overview  Audio Recording  Coding-ultimate goal  AAC Encoder Block Diagram  Principles of Psychoacoustics  Perceptual Entropy  Quantization and Coding  Samples

Introduction "If a tree falls in the forest with no one around to hear it, does it make a sound?"

Audio Recording  Edison, 1877

Audio Recording  Philips, 1978 A/D Converter PCM

Coding  Ultimate Goal: reduce the number of bits needed to represent the data. Bitrate = F sa x Wordlength

AAC Encoder Block Diagram Perceptual Model Gain Control MDCTTNS Multi-Channel M/S, Intensity Predictionz^-1 Quant Scale Factor Extract Iterative Rate Control Loop Entropy Coding Side information coding, Bitstream channel s(n)

Principles of Psychoacoustics  Source localization. Two ears are necessary. Brain uses intensity differences, and time delays between the two perceived signals.

Principles of Psychoacoustics inaudible audible Absolute Hearing Threshold

Principles of Psychoacoustics  Human Ear Loudness characteristic Robinson and Dadson equi-loudness contours.

Principles of Psychoacoustics  Critical Bands Concept introduced by Harvey Fletcher Frequency to Place Transform. Function of frequency that quantifies the cochlear filter passbands. Example: The critical band for a 1kHz is about 160Hz in width. A narrow band noise centered at 1kHz is perceived with the same loudness as long as the width < 160Hz.

Principles of Psychoacoustics  Simultaneous Masking: Frequency inaudible audible

Principles of Psychoacoustics Simplified Paradigms: Noise Masking Tone Tone Masking Noise 1Bark TH N 1Bark TH T K=3dB...5dB (constant)

Principles of Psychoacoustics 1Bark th Spread of Masking

Principles of Psychoacoustics  Masking: Temporal

Perceptual Entropy  Perceptual Entropy, objective metric of perceptually relevant introduced by J. Johnston The perceived information from an audio signal is only a fraction of the total information emanated by the source.

Perceptual Entropy  Procedure: 1.Window and transform to frequency. 2.Masking Threshold is computed using perceptual rules 3.A determination is made of the number of bits required to quantize the spectrum, without injecting perceptible noise.

Perceptual Entropy s(n) Hann Window MDCT Determine nature (Noise-like) (Tone-like) Apply Thresholding rules Spectral Flatness Measure Coefficient of ‘Tonality’ Offset JND Estimates

Perceptual Entropy i:index of critical band; bli, blh: lower and upper bounds of band i; ki:number of transform component in band i; Ti:masking threshold in band i; nint:rounding to the nearest integer.

Returning  "If a tree falls in the forest with no one around to hear it, does it make a sound?" From a Perceptual Coding standpoint, if no one can hear it, THERE IS NO TREE.

AAC Encoder Block Diagram Perceptual Model Gain Control MDCTTNS Multi-Channel M/S, Intensity Predictionz^-1 Quant Scale Factor Extract Iterative Rate Control Loop Entropy Coding Side information coding, Bitstream channel s(n)

Quantization and Coding  Power-law quantizer  Huffman Coding (table can be chosen)  Global Gain -> Quantization step size  Scale Factors-> noise shaping factor

Quantization and Coding while NOISE_CTL while FINDING_RATE Nr_bits= get_bits_needed(); if (Nr_bits > max_bits) adjust_global_gain(); else FINDING_RATE=0; end q_noise=get_quant_noise_level(); if (q_noise> Th(band)) adjust_band_scale_factor(); else NOISE_CTL=0; end

Samples Castanets Original 48kHz Stereo 128kbps AAC Stereo (48kHz) Piano Timpani

References [1] Ted Painter and Andreas Spanias. Perceptual coding of digital audio. Proceedings of the IEEE, 88(4): Abril [2] Karlheinz Brandenburg, MP3 and AAC explained, AES 17 th International Conference on High Quality Audio Coding, [3] J.D. Johnston, A.J. Ferreira, Sum-Difference Stereo Transform Coding, Proc. ICASSP [4] Deepen Sinha, James D. Johnston. Audio Compression at low bit rates using a Signal Adaptive switched Filterbank. Proc. of the ICASSP 1996, pp