3-D Sound and Spatial Audio MUS_TECH 348. Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back.

Slides:



Advertisements
Similar presentations
Researches and Applications for Automotive Field Andrea Azzali, Eraldo Carpanoni, Angelo Farina University of Parma.
Advertisements

Evaluation of Dummy-Head HRTFs in the Horizontal Plane based on the Peak-Valley Structure in One-degree Spatial Resolution Wersényi György SZÉCHENYI ISTVÁN.
Binaural Hearing Or now hear this! Upcoming Talk: Isabelle Peretz Musical & Non-musical Brains Nov. 12 noon + Lunch Rm 2068B South Building.
Spatial Perception of Audio J. D. (jj) Johnston Neural Audio Corporation.
1 Media Processing – Audio Part Dr Wenwu Wang Centre for Vision Speech and Signal Processing Department of Electronic Engineering
Listening Tests and Evaluation of Simulated Sound Fields Using VibeStudio Designer Wersényi György Hesham Fouad SZÉCHENYI ISTVÁN UNIVERSITY, Hungary VRSonic,
SWE 423: Multimedia Systems Chapter 3: Audio Technology (2)
3-D Sound and Spatial Audio MUS_TECH 348. Psychology of Spatial Hearing There are acoustic events that take place in the environment. These can give rise.
Extracting the pinna spectral notches Vikas C. Raykar | Ramani Duraiswami University of Maryland, CollegePark B. Yegnanaryana Indian Institute of Technology,
3-D Sound and Spatial Audio MUS_TECH 348. Wightman & Kistler (1989) Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical.
ELEC 407 DSP Project Algorithmic Reverberation – A Hybrid Approach Combining Moorer’s reverberator with simulated room IR reflection modeling Will McFarland.
Localizing Sounds. When we perceive a sound, we often simultaneously perceive the location of that sound. Even new born infants orient their eyes toward.
SYED SYAHRIL TRADITIONAL MUSICAL INSTRUMENT SIMULATOR FOR GUITAR1.
Back to Stereo: Stereo Imaging and Mic Techniques Huber, Ch. 4 Eargle, Ch. 11, 12.
Source Localization in Complex Listening Situations: Selection of Binaural Cues Based on Interaural Coherence Christof Faller Mobile Terminals Division,
3-D Sound and Spatial Audio MUS_TECH 348. Cathedral / Concert Hall / Theater Sound Altar / Stage / Screen Spiritual / Emotional World Subjective Music.
Sound Field Reproduction Peter Goss. Outline What is sound field reproduction? Free-field theory and simulation results Reverberant theory Implementation.
Auralization Lauri Savioja (Tapio Lokki) Helsinki University of Technology, TKK.
3-D Spatialization and Localization and Simulated Surround Sound with Headphones Lucas O’Neil Brendan Cassidy.
Implementing 3D Digital Sound In “Virtual Table Tennis” By Alexander Stevenson.
EE311: Junior EE Lab Phase Locked Loop J. Carroll 9/3/02.
Signal processing and Audio storage Equalization Effect processors Recording and playback.
Hearing & Deafness (3) Auditory Localisation
1 Manipulating Digital Audio. 2 Digital Manipulation  Extremely powerful manipulation techniques  Cut and paste  Filtering  Frequency domain manipulation.
STUDIOS AND LISTENING ROOMS
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
Binaural Sound Localization and Filtering By: Dan Hauer Advisor: Dr. Brian D. Huggins 6 December 2005.
So far: Historical overview of speech technology  basic components/goals for systems Quick review of DSP fundamentals Quick overview of pattern recognition.
3-D Sound and Spatial Audio MUS_TECH 348. Multi-Loudspeaker Reproduction: Surround Sound.
 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.
3-D Sound and Spatial Audio MUS_TECH 348. What are Some Options for Creating DTFs?
Audio in VEs Ruth Aylett. Use of audio in VEs n Important but still under-utilised channel for HCI including virtual environments. n Speech recognition.
1 Recording Fundamentals INART 258 Fundamentals of MIDI & Digital Audio Mark BalloraMark Ballora, instructor 1.
0 - 1 © 2010 Texas Instruments Inc Practical Audio Experiments using the TMS320C5505 USB Stick “FIR Filters” Texas Instruments University Programme Teaching.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Lecture 1 Signals in the Time and Frequency Domains
Mono and Stereo Miking Techniques. Choosing Microphones Limited collection: useful for broad range of applications  Neumannn KM 184’s (desert island.
Binaural Sonification of Disparity Maps Alfonso Alba, Carlos Zubieta, Edgar Arce Facultad de Ciencias Universidad Autónoma de San Luis Potosí.
Spatial Perception of Audio vs. The Home Theatre James D. Johnston Chief Scientist, DTS, Inc.
Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.
SOUND IN THE WORLD AROUND US. OVERVIEW OF QUESTIONS What makes it possible to tell where a sound is coming from in space? When we are listening to a number.
Issac Garcia-Munoz Senior Thesis Electrical Engineering Advisor: Pietro Perona.
Rumsey Chapter 16 Day 3. Overview  Stereo = 2.0 (two discreet channels)  THREE-DIMENSIONAL, even though only two channels  Stereo listening is affected.
Audio for Telepresence and Virtual Reality Colin S. Harrison.
Audio Systems Survey of Methods for Modelling Sound Propagation in Interactive Virtual Environments Ben Tagger Andriana Machaira.
Simulation of small head-movements on a Virtual Audio Display using headphone playback and HRTF synthesis Wersényi György SZÉCHENYI ISTVÁN UNIVERSITY,
3-D Sound and Spatial Audio MUS_TECH 348. Physical Modeling Problem: Can we model the physical acoustics of the directional hearing system and thereby.
L INKWITZ L AB S e n s i b l e R e p r o d u c t i o n & R e c o r d i n g o f A u d i t o r y S c e n e s Hearing Spatial Detail in Stereo Recordings.
Spatial and Spectral Properties of the Dummy-Head During Measurements in the Head-Shadow Area based on HRTF Evaluation Wersényi György SZÉCHENYI ISTVÁN.
Hearing Research Center
3-D Sound and Spatial Audio
3-D Sound and Spatial Audio MUS_TECH 348. Stereo Loudspeaker Reproduction.
Microcomputer Systems Final Project “Speaker and Sound Modulation”
Detection of Affective States in Human-Computer Interaction Ying Gao “Affective Sensing” needed for Affective Computing Aim: To give computers the capability.
On the improvement of virtual localization in vertical directions using HRTF synthesis and additional filtering Wersényi György SZÉCHENYI ISTVÁN UNIVERSITY,
Microphones. How Microphones Work Sound is created when a vibrating object (such as a guitar string, drum skin etc..) causes the air around it to vibrate.
3-D Sound and Spatial Audio MUS_TECH 348. Are IID and ITD sufficient for localization? No, consider the “Cone of Confusion”
On the manifolds of spatial hearing
3-D Sound and Spatial Audio MUS_TECH 348. Environmental Acoustics, Perception and Audio Processing: Envelopment.
Fletcher’s band-widening experiment (1940)
Generalized Linear Phase Quote of the Day The mathematical sciences particularly exhibit order, symmetry, and limitation; and these are the greatest forms.
3-D Sound and Spatial Audio MUS_TECH 348. What do these terms mean? Both terms are very general. “3-D sound” usually implies the perception of point sources.
What is stereophony? Stereos = solid (having dimensions: length width, height) Phonics = study of sound stereophony (stereo) is an aural illusion – a.
Voice Removal from Music
Volume 62, Issue 1, Pages (April 2009)
Volume 62, Issue 1, Pages (April 2009)
Localizing Sounds.
Using HRTFs for virtual sound source positioning Lecture Example
3-D Sound and Spatial Audio
ARTIS Bluetooth Headphone
Presentation transcript:

3-D Sound and Spatial Audio MUS_TECH 348

Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back Substantial individual differences Most evident in elevation judgments Performance varies with region Best localization: side / front? Worst localization: top rear More front-back reversals with headphones than free-field Quick Review

Headphone Issue: Externalization “ Lateralization” vs “Localization” Affected by: Head movement –Need head tracking system and dynamic updating of cues –Head-tracked ITD and IID are sufficient for front/back differentiation!!!

Headphone Issue: Externalization “ Lateralization” vs “Localization” Also affected by: Reverberation –Stereo reverberation, spatially diffuse –Interaural incoherence (more later) HRTFs –Needed especially if head is static

Headphone Issue: Externalization “ On the Externalization of sound images” Hartmann and Wittenberg (1996) Externalization depends on: Interaural phase below 1K Hz –Constant ITD (not frequency-dependent differences) IID at all frequencies HRTFs at each ear (not interaural differences) There is a continuous range of percepts from inside the head to outside

Review: Reality vs Ideal From Begault and Wenzel, 1993

Headphone Issue: Repeatability with Reseating

Headphone Reproduction: Comparison of Reproduction Modes

Binaural Recording

Classical Recording Aesthetic is to attempt to recreate the experience of being in a concert hall. All sound sources are contained in a signal acoustic space. The spatial layout attempts to replicate reality.

Rock/Pop Recording Aesthetic is to create a stereo recording with maximum clarity and impact. Sound sources are treated in ways that are most appropriate to each: strings are reverberated, but electric bass is not; voice is center and interfering instruments are to the side. The spatial design does not match any reality.

Dummy Head Recording Recording Device Aesthetic is most closely related to classical recording and extends to virtual reality. All sound sources are contained in a signal acoustic space.

Dummy Heads B&K 4128 Aachen Head

Dummy Heads KEMAR

Dummy Heads Neumann KU-100

Simulating Binaural Recording Through Digital Signal Processing (DSP) R: 14,-27,102,128,-10,-43… L: 0,0,0,27,14,-16,102,-3,28… Impulse Response is captured As a series of numbers HRTFs can be captured and stored

Simulating Binaural Recording Through Digital Signal Processing (DSP) System Overview

Simulating Binaural Recording Through Digital Signal Processing (DSP) The HRTFs are represented as HRIRs. These are simply a series of numbers, h(nT).

Simulating Binaural Recording Through Digital Signal Processing (DSP) The HRIR, h(nT), can be used directly to recreate the HRTFs as FIR (finite impulse response) filters. Of course, you need filters for both ears.

Simulating Binaural Recording Through Digital Signal Processing (DSP) The amount of computation for an FIR filter is proportional to the number of data points, ie. filter coefficients. Reducing the number of coefficients alters the HRTF.

Simulating Binaural Recording Through Digital Signal Processing (DSP) The HRIR can be processed to create a ‘minimum phase’ version that has the fewest possible coefficients, but the overall HRIR delay has to be recreated by the system.

Simulating Binaural Recording Through Digital Signal Processing (DSP) Here the system selects a pair of HRTFs and an interaural delay on the basis of source azimuth and elevation. The delay is implemented in series with the FIR filter.

Simulating Binaural Recording Through Digital Signal Processing (DSP) Each sound source needs its own pair of HRTF filters with delay. The output for each sound source is added together to create a composite output.

Simulating Binaural Recording Through Digital Signal Processing (DSP) Since sound sources change position in space, the HRTF filters need to be updated. This is especially true if one includes head-tracking!

Simulating Binaural Recording Through Digital Signal Processing (DSP) Instantaneous locations do not always match the location of the HRTFs in the library. Therefore, for locations that fall inbetween the recorded ones, new filters need to be interpolated.

Simulating Binaural Recording Through Digital Signal Processing (DSP) FIR interpolation can produce unexpected results. Best results are obtained with minimum-phase HRIRs.