Acoustic Localization by Interaural Level Difference Rajitha Gangishetty.

Slides:



Advertisements
Similar presentations
Computer Vision Lecture 7: The Fourier Transform
Advertisements

Underwater Acoustic MIMO Channel Capacity
1. INTRODUCTION In order to transmit digital information over * bandpass channels, we have to transfer the information to a carrier wave of.appropriate.
6.5 & 6.7 Notes Writing equations of trigonometric functions given the transformations.
S Digital Communication Systems Bandpass modulation II.
Fourier Transform – Chapter 13. Image space Cameras (regardless of wave lengths) create images in the spatial domain Pixels represent features (intensity,
Alternatives to Spherical Microphone arrays: Hybrid Geometries Aastha Gupta & Prof. Thushara Abhayapala Applied Signal Processing CECS To be presented.
Hough Transform Jeremy Wyatt. The story so far We know how to find edges by convolving with the derivative of a Gaussian filter in two directions We then.
MFT: Theoretical Basis
Automatic Position Calibration of Multiple Microphones
Image Enhancement in the Frequency Domain Part I Image Enhancement in the Frequency Domain Part I Dr. Samir H. Abdul-Jauwad Electrical Engineering Department.
Some Properties of the 2-D Fourier Transform Translation Distributivity and Scaling Rotation Periodicity and Conjugate Symmetry Separability Convolution.
APPROXIMATE EXPRESSIONS FOR THE MEAN AND COVARIANCE OF THE ML ESTIMATIOR FOR ACOUSTIC SOURCE LOCALIZATION Vikas C. Raykar | Ramani Duraiswami Perceptual.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
3.1 Chapter 3 Data and Signals Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Chapter 5. Operations on Multiple R. V.'s 1 Chapter 5. Operations on Multiple Random Variables 0. Introduction 1. Expected Value of a Function of Random.
Section 1.1 The Distance and Midpoint Formulas. x axis y axis origin Rectangular or Cartesian Coordinate System.
Abstract In this project we expand our previous work entitled "Design of a Robotic Platform and Algorithms for Adaptive Control of Sensing Parameters".
Empirical Bayes approaches to thresholding Bernard Silverman, University of Bristol (joint work with Iain Johnstone, Stanford) IMS meeting 30 July 2002.
Standard Form for the Equation of the Circle
In general, H 1 (f) and H 2 (f) can be chosen to best suit the application. For example, to accentuate the signals at the frequencies in which the signal-to-noise.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
October 8, 2013Computer Vision Lecture 11: The Hough Transform 1 Fitting Curve Models to Edges Most contours can be well described by combining several.
Honors Calculus I Chapter P: Prerequisites Section P.1: Lines in the Plane.
Modulation, Demodulation and Coding Course
Absolute error. absolute function absolute value.
Kent Bertilsson Muhammad Amir Yousaf. DC and AC Circuit analysis  Circuit analysis is the process of finding the voltages across, and the currents through,
Hough Transform on the GPU Presented by Javier Garcia November 29, 2010.
DelayRatio: A Gravitational Wave Event Physical Likelihood Estimator Based on Detection Delays and SNR Ratios Amber L. Stuver LIGO Livingston ObservatoryCalifornia.
Module 2 SPECTRAL ANALYSIS OF COMMUNICATION SIGNAL.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
Auditory and Visual Spatial Sensing Stan Birchfield Department of Electrical and Computer Engineering Clemson University.
7.6 Phase Shift; Sinusoidal Curve Fitting
A Acoustic Source Direction by Hemisphere Sampling Stanley T. Birchfield Daniel K. Gillmor Quindi Corporation Palo Alto, California.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
TIME-SHIFTED PRINCIPAL COMPONENT ANALYSIS BASED CUE EXTRACTION FOR STEREO AUDIO SIGNALS Jianjun HE, Ee-Leng Tan, Woon-Seng Gan Digital Signal Processing.
Pressure-particle velocity coherence Environmental effects on coherence of pressure–vertical particle velocity David Dall’Osto and Peter H. Dahl Applied.
Topic: Quadratics and Complex Numbers Grade: 10 Key Learning(s): Analyzes the graphs of and solves quadratic equations and inequalities by factoring, taking.
1 8. One Function of Two Random Variables Given two random variables X and Y and a function g(x,y), we form a new random variable Z as Given the joint.
Section 6.6 Graphs of Transformed Sine and Cosine Functions Copyright ©2013, 2009, 2006, 2001 Pearson Education, Inc.
LOCUS IN THE COMPLEX PLANE The locus defines the path of a complex number Recall from the start of the chapter that the modulus and the argument defines.
Course 8 Contours. Def: edge list ---- ordered set of edge point or fragments. Def: contour ---- an edge list or expression that is used to represent.
Section 11.1 Section 11.2 Conic Sections The Parabola.
11.5: Circles in the Coordinate Plane OBJECTIVES: STUDENTS WILL BE TO… IDENTIFY THE CENTER AND THE RADIUS OF A CIRCLE FROM ITS EQUATION WRITE AND GRAPH.
Fast Bayesian Acoustic Localization
Section 4.5 Graphs of Sine and Cosine. Sine Curve Key Points:0 Value: π 2π2π π 2π2π 1.
8.1 The Rectangular Coordinate System and Circles Part 2: Circles.
One Function of Two Random Variables
2D Fourier Transform.
1 The Coordinate Plane Just as points on a line can be identified with real numbers to form the coordinate line, points in a plane can be identified with.
Environmental and Exploration Geophysics II tom.h.wilson
Circles A review?. Let's review what we already know about circles. Definition: A circle is a locus (set) of points in a plane equidistant from a fixed.
Warm Up Find the slope of the line that connects each pair of points. – (5, 7) and (–1, 6) 2. (3, –4) and (–4, 3)
10-8 Equations of Circles 1.Write the equation of a circle. 2.Graph a circle on the coordinate plane.
Biointelligence Laboratory, Seoul National University
A Unifying Framework for Acoustic Localization
5.6 Phase Shift; Sinusoidal Curve Fitting
ECE 7251: Signal Detection and Estimation
Frequency Domain Analysis
Copyright © 2009 Pearson Education, Inc.
Volume 62, Issue 1, Pages (April 2009)
4. Image Enhancement in Frequency Domain
EE513 Audio Signals and Systems
Volume 62, Issue 1, Pages (April 2009)
POLAR COORDINATES Dr. Shildneck.
Correlation, Energy Spectral Density and Power Spectral Density
Lesson 2-4 The Coordinate Plane
7.4 Periodic Graphs & Phase Shifts Objectives:
Coordinate Plane y axis x axis.
Presentation transcript:

Acoustic Localization by Interaural Level Difference Rajitha Gangishetty

7/14/2005Acoustic Localization by ILD2 Acoustic Localization compact microphone array   sound source d Acoustic Localization: Determining the location of a sound source by comparing the signals received by an array of microphones. Issues: reverberation noise

7/14/2005Acoustic Localization by ILD3 Overview What is Interaural Level Difference (ILD)? ILD Formulation ILD Localization Simulation Results Conclusion and Future Work

7/14/2005Acoustic Localization by ILD4 Techniques Interaural time difference (ITD): relative time shift ITD ILD sound source microphones Interaural level difference (ILD): relative energy level All previous methods (TDE, beamforming, etc.) use ITD alone.

7/14/2005Acoustic Localization by ILD5 Previous Work Time Delay Estimation [M. S. Brandstein, H. F. Silverman, ICASSP 1997; P. Svaizer, M. Matassoni, M. Omologo, ICASSP 1997] Beamforming J. L. Flanagan, J.D. Johnston, R. Zahn, JASA 1985; R. Duraiswami, D. Zotkin, L.Davis, ICASSP 2001] Accumulated Correlation [Stanley T. Birchfield, EUSIPCO 2004] Microphone arrays [Michael S. Brandstein, Harvey F. Silverman, ICASSP 1995; P. Svaizer, M. Matassoni, M. Omologo, ICASSP 1997] Hilbert Envelope Approach [David R. Fischell, Cecil H. Coker, ICASSP 1984]

7/14/2005Acoustic Localization by ILD6 A sneak peek at the results Likelihood plots, Estimation error, Comparison of different approaches likelihood function computed by horizontal and vertical microphone pairs contour plots of likelihood functions (overlaid and combined) microphonestrue location

7/14/2005Acoustic Localization by ILD7 ILD Formulation N microphones and a source signal s(t) Signal received by the i th microphone d i = distance from source to the i th microphone = additive white Gaussian noise Energy received by i th microphone

7/14/2005Acoustic Localization by ILD8 ILD Formulation For 2 mics the relation between energies and distances is Given E1 and E2 the sound source lies on a locus of points (a circle or line) described by where,

7/14/2005Acoustic Localization by ILD9 ILD Formulation For E1 ≠ E2 the equation becomes which is a circle with center and radius In 3D the circle becomes a sphere For E1= E2 the equation becomes which becomes a plane in 3D

7/14/2005Acoustic Localization by ILD10 Isocontours for 10log(delta E)

7/14/2005Acoustic Localization by ILD11 ILD Localization With only two microphones source is constrained to lie on a curve The microphones cannot pinpoint the sound source location We use multiple microphone pairs The intersection of the curves yield the sound source location Why multiple microphone pairs?

7/14/2005Acoustic Localization by ILD12 Combined Likelihood Approach Then the estimate for the energy ratio at candidate location is Then the estimate for the energy ratio at candidate location is Define the energy ratio as Define the energy ratio as where is the location of the ith microphone is treated as a Gaussian is treated as a Gaussian random variable random variable Joint probability from multiple microphone Joint probability from multiple microphone pairs is computed by combining the pairs is computed by combining the individual log likelihoods individual log likelihoods Localize sound source by computing likelihood at a number of candidate locations:

7/14/2005Acoustic Localization by ILD13 Hilbert Transform The Hilbert transform returns a complex sequence, from a real data sequence. The complex signal x = x r + i*x i has a real part, x r, which is the original data, and an imaginary part, x i, which contains the Hilbert transform. The imaginary part is a version of the original real sequence with a 90° phase shift. Sines are therefore transformed to cosines and vice versa.

7/14/2005Acoustic Localization by ILD14 Hilbert Transformer x r [n] Complex Signal x[n] Hilbert Transformer h[n] x r [n] x i [n] -j, 0<w<pi j, -pi<w<0 H(e jw ) = where ‘w’ is the angular frequency The Hilbert transformed series has the same amplitude and frequency content as the original real data and includes phase information that depends on the phase of the original data. In Frequency domain, X i (e jw ) = H(e jw )X r (e jw )

7/14/2005Acoustic Localization by ILD15 Hilbert Envelope Approach All-pass filter circuit produces two signals with equal amplitude but 90 degrees out of phase. Square root of the sum of squares is taken. (90 o ) (90 o ) 2 (0 o ) 2

7/14/2005Acoustic Localization by ILD16 Simulated Room

7/14/2005Acoustic Localization by ILD17 Simulation Results The algorithm Accurately estimates the angle to the sound source in some scenarios Exhibits bias toward far locations (unable to reliably estimate the distance to the sound source) Is sensitive to noise and reverberation

7/14/2005Acoustic Localization by ILD18 Results of delta E Estimation The estimation is highly dependent upon the sound source location amount of reverberation amount of noise size of the room relative positions of source and microphones

7/14/2005Acoustic Localization by ILD19 Likelihood plots 5x5 m room, theta = 45 deg, no noise, no reverberation, d = 2m

7/14/2005Acoustic Localization by ILD20 Likelihood plots 5x5 m room, theta = 90 deg, SNR = 0db, reflection coefficient = 9, d = 2m

7/14/2005Acoustic Localization by ILD21 Likelihood plots 5x5 m room, theta = 0 deg, SNR = 0db, reflection coefficient = 9, d = 1m

7/14/2005Acoustic Localization by ILD22 Likelihood plots 10x10 m room, theta = 0 deg, SNR = 0db, reflection coefficient = 9, d = 1m angle error = 6.5 degrees

7/14/2005Acoustic Localization by ILD23 Likelihood plots 5x5 m room, theta = 36 deg, SNR = 0db, reflection coefficient = 9, d = 2m angle error = 9 degrees

7/14/2005Acoustic Localization by ILD24 Angle Errors in a 5x5 m room d = 1m, only noised = 2m, only noise d = 1m, only reverberationd = 2m, only reverberation 0.7 = solid line, blue 0.8 = dotted, red 0.9 = dashed, green 20 dB = solid line, blue 10 dB = dotted, red 0 dB = dashed, green

7/14/2005Acoustic Localization by ILD25 Angle error in degrees for the 10x10 m room when the source is at a distance of 1m Angle error in degrees for the 5x5 m room when the source is at a distance of 1m

7/14/2005Acoustic Localization by ILD26 Angle error in degrees for the 10x10 m room when the source is at a distance of 2m Angle error in degrees for the 5x5 m room when the source is at a distance of 2m

7/14/2005Acoustic Localization by ILD27 Comparison of errors with the Hilbert Envelope Approach in a 5x5 m room dB10 dB0 dB Without Hilbert = solid line, blue Matlab Hilbert = dotted line, red Kaiser Hilbert = dashed line, green Reflection coefficient

7/14/2005Acoustic Localization by ILD28 Comparison of errors with the Hilbert Envelope Approach in a 10x10 m room dB10 dB0 dB Without Hilbert = solid line, blue Matlab Hilbert = dotted line, red Kaiser Hilbert = dashed line, green Reflection coefficient

7/14/2005Acoustic Localization by ILD29 Likelihood plots without Hilbert Envelope angle error = 27 degrees 5x5 m room, theta = 18 deg, SNR = 0db, reflection coefficient = 9, d = 2m

7/14/2005Acoustic Localization by ILD30 Frames approach Signal divided into 50 frames Frame size = 92.8ms 50% overlap in each frame 5x5 m room, theta = 18 deg, SNR = 0db, reflection coefficient = 9, d = 2m (left), d = 1m (right) Mean error = 15 deg Std Dev = 11 deg Mean error = 7 deg Std Dev = 6 deg

7/14/2005Acoustic Localization by ILD31 Conclusion and Future Work ILD is an important cue for acoustic localizationILD is an important cue for acoustic localization Preliminary results indicate potential for ILD (Algorithm yields accurate results for several configurations, even with noise and reverberation) Preliminary results indicate potential for ILD (Algorithm yields accurate results for several configurations, even with noise and reverberation) Future work: Future work: Investigate issues (e.g., bias toward distant locations, sensitivity to reverberation) Investigate issues (e.g., bias toward distant locations, sensitivity to reverberation) Experiment in real environments Experiment in real environments Investigate ILDs in the case of occlusion Investigate ILDs in the case of occlusion Combine with ITD to yield more robust results Combine with ITD to yield more robust results

7/14/2005Acoustic Localization by ILD32