1 Introduction to MPEG Surround 韓志岡 2/9/2005. 2 Outline Background – Motivation – Perception of sound in space Pricicple of MPEG Surround – Downmixing.

Slides:



Advertisements
Similar presentations
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Advertisements

Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG Audio Formats Jason Leung Wednesday, February 5, 2014.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
EE2F2 - Music Technology 2. Stereo and Multi-track Recording.
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Spatial Perception of Audio J. D. (jj) Johnston Neural Audio Corporation.
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
SWE 423: Multimedia Systems Chapter 3: Audio Technology (2)
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
1 © NOKIA Audio Codecs Audio Codecs Miikka Vilermo Nokia Research Center – Audio Visual Systems Laboratory.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
3-D Spatialization and Localization and Simulated Surround Sound with Headphones Lucas O’Neil Brendan Cassidy.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Dolby AC-3 Audio Encoding & THX Wai Kam (Winnie) Henele Adams Peter Boettcher.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
Multi-Shift Principal Component Analysis based Primary Component Extraction for Spatial Audio Reproduction Jianjun HE, and Woon-Seng Gan 23 rd April 2015.
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
Binaural Sound Localization and Filtering By: Dan Hauer Advisor: Dr. Brian D. Huggins 6 December 2005.
1 Ambisonics: The Surround Alternative Richard G. Elen The Ambisonic Network.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
MPEG-2 Digital Video Coding Standard
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY * By: Ricardo A. Garcia *Research done at: University.
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.
Media Representations - Audio
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Outline Kinds of Coding Need for Compression Basic Types Taxonomy Performance Metrics.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Basic Concepts of Audio Watermarking. Selection of Different Approaches Embedding Domain  time domain  frequency domain DFT, DCT, etc. Modulation Method.
Jens Blauert, Bochum Binaural Hearing and Human Sound Localization.
TIME-SHIFTED PRINCIPAL COMPONENT ANALYSIS BASED CUE EXTRACTION FOR STEREO AUDIO SIGNALS Jianjun HE, Ee-Leng Tan, Woon-Seng Gan Digital Signal Processing.
Study on Frequency Domain Primary-Ambient Extraction (PAE) HE Jianjun PhD Candidate, DSP Lab, School of EEE, Nanyang Technological University, Singapore.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 4 – Audio and Digital Image Representation Klara Nahrstedt Spring 2010.
(B1) What are the advantages and disadvantages of digital TV systems? Hint: Consider factors on noise, data security, VOD etc. 1.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Fletcher’s band-widening experiment (1940)
3-D Sound and Spatial Audio MUS_TECH 348. What do these terms mean? Both terms are very general. “3-D sound” usually implies the perception of point sources.
MP3 and MP4 Audio By: Krunal Tailor
mp3DirectCut Audio recording.
Audio Compression.
Introduction to Audio Watermarking Schemes N. Lazic and P
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
What is stereophony? Stereos = solid (having dimensions: length width, height) Phonics = study of sound stereophony (stereo) is an aural illusion – a.
3) determine motion and sound perceptions.
Data Compression.
Basic Concepts of Audio Watermarking
Nokia Research Center – Audio Visual Systems Laboratory
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Govt. Polytechnic Dhangar(Fatehabad)
3 primary cues for auditory localization: Interaural time difference (ITD) Interaural intensity difference Directional transfer function.
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
Presentation transcript:

1 Introduction to MPEG Surround 韓志岡 2/9/2005

2 Outline Background – Motivation – Perception of sound in space Pricicple of MPEG Surround – Downmixing to one channel – Estimation of spatial cues – Synthesis of spatial cues Conclusions & Reference

3 Motivation The vast majority of audio playback equipment use traditional two-channel presentations (stereo) More reproduction channels ( “ multi-channel audio ” or “ surround sound ” ) is quite visible in the market place A non-disruptive transition from stereo to multi-channel audio requires media formats that can serve both those using conventional stereo equipment and those using next-generation multi-channel equipment.

4 Perception of sound in space HRTF(Head Related Transfer Function) modeling the path of sound from a source to the left and right ear entrances.

5 Perception of sound in space(cont.) Three parameters(cues) describing how human localize sound in the horizontal plane: – Interaural level difference (ILD) – Interaural time difference (ITD) – Interaural coherence (IC)

6 ITD (Interaural time difference) & ILD (Interaural level difference)

7 ITD (Interaural time difference) & ILD (Interaural level difference) (cont.) ITD and ILD between a pair of headphone signals determine the location of the auditory event which appears in the frontal section of the upper head.

8 IC (Interaural coherence) The spatial impression of the auditory enent is related to IC

9 Two sound source: Summing localization Inter-channel time difference (ICTD) Inter-channel level difference (ICLD) Inter-channel coherence (ICC)

10 Two sound source: Summing localization (cont.)

11 MPEG Surround MPEG Surround exploits inter-channel differences in level, phase and coherence equivalent to the ILD, ITD and IC cues to capture the spatial image of a multi-channel audio signal Downmix signal and encodes these cues in a very compact form such that the cues and the transmitted signal can be decoded to synthesize a high quality multi-channel representation. Provide backward compatibility with stereo/mono audio systems.

12 Coding Scheme

13 Downmixing to one channel (1/2) The sum signal is generated by adding the input channels in a subband domain Multiplying the sum with a factor in order to preserve signal power

14 Downmixing to one channel (2/2)

15 Estimation of spatial cues (1/4) The spatial cues, ICTD, ICLD, and ICC are estimated in a subband domain. The spatial cue estimation is applied independently to each subband

16 Estimation of spatial cues(2/4) ICTD (samples): with a short-time estimate of normalized cross- correlation function where and is a short-time estimate of the mean of

17 Estimation of spatial cues(3/4) ICLD (dB): ICC :

18 Estimation of spatial cues(4/4) For multi-channel audio signals, ICTD and ICLD are defined between the reference channel and each other C-1 channels

19 Synthesis of spatial cues(1/3) ICTD are synthesized by imposing delays, ICLD by scaling, and ICC by applying de-correlation filters.

20 Synthesis of spatial cues(2/3) The delays are determined by the ICTDs

21 Synthesis of spatial cues(3/3) The scale factors are determined by the ICLDs satisfying: After delays and scaling, we need to reduce correlation between the subbands. This is achieved by designing the filters h c controlled as a function of ICC.

22 Conclusions (1/2) Well-known perceptual audio coders, such as MP3, primarily exploit a single channel ’ s ability to mask its own quantization noise. In contrast, spatial perception is primarily attributed to three parameters : ILD, ITD, and IC.

23 Conclusions (2/2) MPEG Surround provides an extremely efficient method for coding of multi-channel sound via the transmission of a compressed stereo (or even mono) audio program plus a low-rate side-information channel. MPEG Surround is the latest technology for bitrate efficient and backward compatible presentation of multi-channel audio.

24 Reference ISO/IEC JTC1/SC29/WG11 (MPEG), Document N7390, “ Tutorial on MPEG Surround Audio Coding ”, July 2005, Poznan, Poland C. Faller, “ Parametric coding of spatial audio, ” in Proc. DAFx (Digital Audio Effects), October 2004.