MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.

Slides:



Advertisements
Similar presentations
Alex Chen Nader Shehad Aamir Virani Erik Welsh
Advertisements

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
MP3 Overview John Ehrhardt Elena Silenok CSE228 – Spring 03.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
Developement and Implementation of an MPEG1 Layer III Decoder on x86 and TMS320C6711 platforms Braidotti Enrico (Farina Simone)
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG Audio Formats Jason Leung Wednesday, February 5, 2014.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
August 2004Multirate DSP (Part 2/2)1 Multirate DSP Digital Filter Banks Filter Banks and Subband Processing Applications and Advantages Perfect Reconstruction.
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
Multimedia communications EG-371Dr Matt Roach Multimedia Communications EG 371 and EG 348 Dr Matthew Roach Lecture 2 Digital.
Chapter 7 End-to-End Data
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Audio compression zAlgorithms. zStandards.. Coding gain zRatio of uncompressed size to compressed size. zSources: yLossless coding. yLossy perceptual.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Dolby AC-3 Audio Encoding & THX Wai Kam (Winnie) Henele Adams Peter Boettcher.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
Chapter 4 Digital Transmission
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
Perceptual audio coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Week 7 Psychoacoustic Compression1ESE 250 – S’12 Kod & DeHon ESE250: Digital Audio Basics Week 7 February 23, 2012 Psychoacoustic Compression.
Multiresolution STFT for Analysis and Processing of Audio
CMPT 365 Multimedia Systems
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.
Speech and Audio Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2013 Last updated
Sept. 25, 2006 Assignment #1 Assignment #2 and Lab #3 Now Online Formula Cheat Sheet Cheat SheetCheat Sheet Review Time, Frequency, Fourier Bandwidth Bandwidth.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
1 Classification of Compression Methods. 2 Data Compression  A means of reducing the size of blocks of data by removing  Unused material: e.g.) silence.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.7 (M Nov 04) The MP3 frame format.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
Perceptual Audio Coding The AT&T/Bell Labs view James D. Johnston Chief Scientist Neural Audio, Kirkland, Wa.
Fundamentals of Multimedia 2 nd ed., Chapter 14 Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Audio Codecs 14.4 MPEG-7.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
III Digital Audio III.7 (W Nov 04) The MP3 frame format.
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
III Digital Audio III.7 (F Oct 20) The MP3 frame format.
4.1 Chapter 4 Digital Transmission Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
III Digital Audio III.7 (Mo Oct 22) The MP3 frame format.
Image Transforms for Robust Coding
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch

Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause a decrease in available BW or, we could decrease word size. This will introduce noise into the signal (lower S:N ratio). Solution: –perceptual coding  reduce word size based on signal conditions

Quick Overview MPEG removes “irrelevancy” & statistical redundancy lossy (but not perceptibly so) 1.41 Mbps (cd audio)  between 64 and 448 kbps. (95% to 68% reduction) ratios of 4:1, 6:1 can be transparent in advanced listening tests supports 32, 44.1, 48 kHz sampling rates

MPEG-1 Types 3 layers: I, II, and III –I is simplest  III is most complex –a layer can play encodings of those beneath it eg. Layer III can play I, II, and III; layer II may only play I and II

Components There are two “components” to MPEG-1: the encoder and the decoder. –the decoder is what is actually described under the specification; the encoder is not. –improvements to the encoder will have immediate effects in quality without necessitating corresponding changes to the decoder

Encoder vs. Decoder Encoder –does all the work –forward adaptive encoding all allocation of bits is performed by the encoder the psychoacoustic model used to determine “irrelevant” data is contained here improving psychoacoustic models/changes to encoder doesn’t require changing the decoder Decoder does less work

Encoder Details audio (PCM) passes through a polyphase filter bank, splitting the signal into 32 bands –filter outputs one sample per band for every 32 samples in layer I: after each band gets 12 samples the decoder determines the bit allocation for that band layer II: operates on 12 x 3= 36 samples per band (larger frame). Lower bands may receive: 15 bits, middle: 7 bits, and high: 3 bits max layer III is different…we’ll come back to it.

Encoder Details FFT is performed (w/Hann window) –512 point for layer I –1024 point for layer II a psychoacoustic model compares the output and is used to calculate masking thresholds used to determine which are the audible components (ie. SMR)

More details...

How bits are allocated data in the band is coded, NOT the FFT data. more “audible” components (ie. those highest above the masking threshold) are assigned the most bits

Encoder Details Scale factor is calculated –largest sample value in the band for each frame is found. Each of the 12 samples in the band are divided by this factor –layer II has 3 scale factors (for 3 groups of 12 samples), but one may suffice if the differences are small Corresponds to max. SPL in each band

Encoder Schematic

Encoder Details (layer III) layer III: –each band is transformed into 18 spectral coefficients with a MDCT (50% overlap) gives 576 coefficients, each representing a BW of at 48 kHz  24ms –window size of the MDCT is variable long window for steady state signals (36 samples) to small windows for transient (12 samples)

Encoder Details (layer III) framerate varies in layer III can also use a bit reservoir for if more accuracy is needed Huffman encoding employed

Encoder a portion of the data stream is consumed by coding info: –headers –bit allocation info –scale factors –samples from each band

Other Features stereo joint coding –stereophonic irrelevance/redundancy eliminated –sum and difference signals (layer III) –L/R high frequency band samples summed into one channel, but scale factors remain independent

Decoder Details Put signal back together: –decode bit allocation info –samples multiplied by scale factors and run through an inverse filterbank –delays typically range from 10 to 30ms

Decoder schematic

Summary Split signal into 32 bands determine max. SPL levels for each band FFT to calculate masking thresholds –determine global masking curve calculate SMR for each band and assign bits accordingly