CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.

Slides:



Advertisements
Similar presentations
Part II (MPEG-4) Audio TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Advertisements

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Audio Coding Team Member: ChungMing Yan, Chun Tong.
Sound can make multimedia presentations dynamic and interesting.
4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.
2 Information Processing
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG Further.
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
Lecture 14: Spring 2007 MPEG Audio Compression
Audio compression zAlgorithms. zStandards.. Coding gain zRatio of uncompressed size to compressed size. zSources: yLossless coding. yLossy perceptual.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
EET 450 Chapter 18 – Audio. Analog Audio Sound is analog Consists of air pressure that has a variety of characteristics  Frequencies  Amplitude (loudness)
T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.
Chapter 9 Audio.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
Digital Audio Multimedia Systems (Module 1 Lesson 1)
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Fundamentals Rawesak Tanawongsuwan
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Media Representations - Audio
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.
Multimedia Elements: Sound, Animation, and Video.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
Dhatchaini Rajendran Student ID: Date :
Multimedia Technology and Applications Chapter 2. Digital Audio
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 9 This presentation © 2004, MacAvon Media Productions Sound.
Chapter 4 Audio and video compression
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
More Meaningful Jargon Or, All You Need to Know to Speak Like a Geek Sound.
Physics 1251 The Science and Technology of Musical Sound Unit 4 Session 42 MWF Computer Music Unit 4 Session 42 MWF Computer Music.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
CSCI-100 Introduction to Computing Hardware Part II.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
Interactive Multimedia Sound Mikael Fernström. Data sources Microphones and transducers –Sample acoustic reality Synthesis –Simulate reality (and beyond.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
UNIT V. Linear Predictive coding With the advent of inexpensive digital signal processing circuits, the source simply analyzing the audio waveform to.
Fundamentals of Multimedia 2 nd ed., Chapter 14 Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Audio Codecs 14.4 MPEG-7.
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
MP3 and MP4 Audio By: Krunal Tailor
Audio Compression.
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
Sound Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007

CS335 Principles of Multimedia Systems Digital Audio  Audio comes from different sources: –Speech. –Sounds of instruments, Music. –Sounds of all other kinds (the sound of wind, train and ocean).  Audio needs new methods for coding and processing.  Audio processing is a key task in multimedia systems –Audio coding (MPEG audio, mp3, AAC and others) –Authoring and representation (composition) –Analysis and searching (retrieval and database) –3D sound, etc.  We will focus on basic audio processing, MPEG audio and related topics.

CS335 Principles of Multimedia Systems Audio Processing  Audio authoring Audio file formats: Waveform files and MIDI. MIDI: Musical Instrument Digital Interface. Instead of storing the waveform samples, MIDI file has a sequence of commands to control an audio device to generate a specified note with given properties.

CS335 Principles of Multimedia Systems Audio Processing Using Matlab  To load a wave in Windows: audat = wavread(‘filename.wav’) ; Or, directly open the file and load a stream of “words” (2 bytes) or bytes depending on the wav format.  To play a sound, use sound(audat, samplingrate).  To display the spectrogram, use specgram.  Audio analysis are done in frames of 20ms – 40ms long.

CS335 Principles of Multimedia Systems Frequency Domain Analysis  Fourier transform can be used to decompose any signal into summation of sinusoidal waves.  In Matlab, we can use fft (Fast Fourier Transform) for frequency domain analysis. The time domain waveform The frequency Domain components. Base frequency ¼ 1/T T

CS335 Principles of Multimedia Systems MP3 and Others  MPEG (Motion Picture Expert Group) and ISO (International Standard Organization) have published several standards about digital audio coding. –MPEG-1 Layer 1,2 and 3 (MP3) –MPEG2 AAC –MPEG4 AAC and TwinVQ  Other standards –Dolby AC3  They have been widely used in consumer electronics, digital audio broadcasting, DVD and movies etc.

CS335 Principles of Multimedia Systems Perceptual Coding in MPEG Encoder FFT Masking Threshold Dynamic bit allocation MUX Encoder Dynamic bit allocation Bit stream audio

CS335 Principles of Multimedia Systems Simultaneous Masking Hz dB Masking threshold Masker Sound pressure level Threshold in quiet  A strong audio component can mask its nearby frequency components.

CS335 Principles of Multimedia Systems Masking and Quantization Hz dB Masker Sound pressure level 20 Critical band A Neighbor critical band Minimum masking threshold for band A. Signal To mask ratio m-bit quantizer SNR m+1-bit quantizer SNR A critical band defines the “resolution” of the hearing at some frequency location.

CS335 Principles of Multimedia Systems Temporal Masking time Amplitude Pre-masking curve Post-masking curve

CS335 Principles of Multimedia Systems MPEG Perceptual Model  A matlab demo.

CS335 Principles of Multimedia Systems MPEG Audio Layer 1  MPEG (1 and 2) audio allows sampling rate at , 32, 22.05, 24 and 16KHz.  MPEG filters the input audio into 32 bands. Filtering And downsampling Audio 384 samples 12 samples Perceptual coder Normalize By scale factor

CS335 Principles of Multimedia Systems MPEG Audio Layer 2  Layer 2 is very similar to Layer 1, but groups samples together in coding.  It also improves the scaling factor quantization and also groups 3 audio samples together in bit assignment. Filtering And downsampling Audio 3x384 samples 36 samples Perceptual coder Normalize By scale factor

CS335 Principles of Multimedia Systems Overlapped Transform and MDCT 2N Window 1 Window 2 Window 3 Window 4 In overlapped transform, 2N samples are transformed to N elements Reconstructed result. In reverse Transform:

CS335 Principles of Multimedia Systems Some Matlab Codes  The program compares DCT and MDCT in audio processing.  Code is available on the course website as a tar ball mdct_and_dct.tar.

CS335 Principles of Multimedia Systems MP3  MP3 is another layer built on top of MPEG audio layer 2.  MP3 further does MDCT on each band and tries to encode the MDCT coefficients.  MP3 then uses Huffman coding to further compress the bit streams losslessly.

CS335 Principles of Multimedia Systems File Format HeaderCRC Bit Allocation Scale factors Subband DataHeaderCRC Bit Allocation Scale factors Subband Data Mpeg audio puts header in each of the frame, so that they can be decoded separately. Frame 1 Frame 2

CS335 Principles of Multimedia Systems Other Audio Coding Standards  MPEG 2 and MPEG 4 ACC (advanced audio coding) –Not backward compatible –Use MDCT without bandpass filtering  Dolby AC3 –MDCT based codec –Similar to MPEG ACC but uses a different quantization and coding scheme –A de-facto standard for DVD and Digital audio in Movie.

CS335 Principles of Multimedia Systems Realtime Audio Systems Audio Processing Unit Audio input circular queue Write pointer Read pointer Audio I/O Process Audio output circular queue