Mid-Term Review John W. Worley AudioGroup, WCL

Slides:



Advertisements
Similar presentations
Islamic university of Gaza Faculty of engineering Electrical engineering dept. Submitted to: Dr.Hatem Alaidy Submitted by: Ola Hajjaj Tahleel.
Advertisements

MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
1 3D sound reproduction with “OPSODIS” and its commercial applications Takashi Takeuchi, PhD Chief Technical Officer OPSODIS Limited Institute of Sound.
Sound source segregation Development of the ability to separate concurrent sounds into auditory objects.
Timbre perception. Objective Timbre perception and the physical properties of the sound on which it depends Formal definition: ‘that attribute of auditory.
Spatial Perception of Audio J. D. (jj) Johnston Neural Audio Corporation.
Learning the cues associated with non-individualised HRTFs John Worley and Jonas Braasch Binaural and Spatial Hearing Group.
School of Electrical and Computer Engineering Weldon School of Biomedical Engineering Thomas M. Talavage, PhD Associate Professor, Electrical and Computer.
Room Acoustics: implications for speech reception and perception by hearing aid and cochlear implant users 2003 Arthur Boothroyd, Ph.D. Distinguished.
Effect of reverberation on loudness perceptionInsert footer on Slide Master© University of Reading Department of Psychology 12.
3-D Sound and Spatial Audio MUS_TECH 348. Wightman & Kistler (1989) Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical.
3-D Sound and Spatial Audio MUS_TECH 348. Cathedral / Concert Hall / Theater Sound Altar / Stage / Screen Spiritual / Emotional World Subjective Music.
Watkins, Raimond & Makin (2011) J Acoust Soc Am –2788 temporal envelopes in auditory filters: [s] vs [st] distinction is most apparent; - at higher.
Ultrasonic Nonlinear Imaging- Tissue Harmonic Imaging.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
Relationship between perception of spectral ripple and speech recognition in cochlear implant and vocoder listeners L.M. Litvak, A.J. Spahr, A.A. Saoji,
A.Diederich– International University Bremen – USC – MMM – Spring Onset and offset Sounds that stop and start at different times tend to be produced.
Sound source segregation (determination)
L INKWITZ L AB Accurate sound reproduction from two loudspeakers in a living room 13-Nov-07 (1) Siegfried Linkwitz.
Modernising Children’s Hearing Aid Services Sound Field Testing MCHAS TEAM Wave 4 SFR 17/05/04.
Sub-band Mixing and Addition of Digital Effects for Consumer Audio ELECTRICAL & ELECTRONIC ENGINEERING FINAL YEAR PROJECTS 2012/2013 Presented by Fionn.
Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.
Review Exam III. Chapter 10 Sinusoidally Driven Oscillations.
Dual-Channel FFT Analysis: A Presentation Prepared for Syn-Aud-Con: Test and Measurement Seminars Louisville, KY Aug , 2002.
LINKWITZ LAB What are the On-axis & Off-axis
Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.
Applied Psychoacoustics Lecture: Binaural Hearing Jonas Braasch Jens Blauert.
Rumsey Chapter 16 Day 3. Overview  Stereo = 2.0 (two discreet channels)  THREE-DIMENSIONAL, even though only two channels  Stereo listening is affected.
Basics of Neural Networks Neural Network Topologies.
speech, played several metres from the listener in a room - seems to have the same phonetic content as when played nearby - that is, perception is constant.
Calibration of Consonant Perception in Room Reverberation K. Ueno (Institute of Industrial Science, Univ. of Tokyo) N. Kopčo and B. G. Shinn-Cunningham.
Developing a model to explain and stimulate the perception of sounds in three dimensions David Kraljevich and Chris Dove.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Timo Haapsaari Laboratory of Acoustics and Audio Signal Processing April 10, 2007 Two-Way Acoustic Window using Wave Field Synthesis.
L INKWITZ L AB S e n s i b l e R e p r o d u c t i o n & R e c o r d i n g o f A u d i t o r y S c e n e s Hearing Spatial Detail in Stereo Recordings.
Jens Blauert, Bochum Binaural Hearing and Human Sound Localization.
Hearing Research Center
This research was supported by Delphi Automotive Systems
Scaling Studies of Perceived Source Width Juha Merimaa Institut für Kommunikationsakustik Ruhr-Universität Bochum.
Predicting the Intelligibility of Cochlear-implant Vocoded Speech from Objective Quality Measure(1) Department of Electrical Engineering, The University.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
On the improvement of virtual localization in vertical directions using HRTF synthesis and additional filtering Wersényi György SZÉCHENYI ISTVÁN UNIVERSITY,
On the manifolds of spatial hearing
Subjective Assessments of Real-Time Room Dereverberation and Loudspeaker Equalisation Panagiotis Hatziantoniou and John Mourjopoulos AudioGroup, WCL Department.
2/17/2007DSP Course-IUST-Spring Semester 1 Digital Signal Processing Electrical Engineering Department Iran University of Science & Tech.
PSYC Auditory Science Spatial Hearing Chris Plack.
Fletcher’s band-widening experiment (1940)
THE SIGNIFICANCE OF SOUND DIFFRACTION EFFECTS IN PREDICTING ACOUSTICS IN ANCIENT THEATRES Presented by : Panos Economou P.E. Mediterranean Acoustics Research.
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
Franssen illusion within large rooms John W. Worley AudioGroup, WCL Department of Electrical and Computer Engineering University of Patras, Greece
H IGH E FFICIENCY L OUDNESS M ODEL FOR BROADCAST CONTENT C ONVENTION P APER 8612 T RAVAGLINI, A LEMANNO, U NCINI 134° AES.
distance, m (log scale) -25o 0o +25o C50, dB left right L-shaped
Auditory Localization in Rooms: Acoustic Analysis and Behavior
Precedence-based speech segregation in a virtual auditory environment
3D sound reproduction with “OPSODIS” and its commercial applications
1. BASIC AUDIO SYSTEM 2 Pre- Amplifier (Voltage Amp.) Pre- Amplifier (Voltage Amp.) Power Amplifier Mic Speaker.
III. Analysis of Modulation Metrics IV. Modifications
FM Hearing-Aid Device Checkpoint 2
Term Project Presentation By: Keerthi C Nagaraj Dated: 30th April 2003
Attentional Tracking in Real-Room Reverberation
Frequency Domain Perceptual Linear Predicton (FDPLP)
Loudness asymmetry in real-room reverberation: cross-band effects
Mark Sayles, Ian M. Winter  Neuron 
Outline Linear Shift-invariant system Linear filters
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Hearing Spatial Detail
COPYRIGHT © All rights reserved by Sound acoustics Germany
Now that we can store audio with high resolution, what will it take to reproduce it with high accuracy? 10/29/2019
Presentation transcript:

Mid-Term Review John W. Worley AudioGroup, WCL Department of Electrical and Computer Engineering University of Patras, Greece http://www.wcl.ee.upatras.gr/AudioGroup/

2.2 Reliability of auditory cues in multi-source scenarios Tasks 2.1 The precedence effect Franssen illusion 2.2 Reliability of auditory cues in multi-source scenarios Learning non-individualised HRTFs 2.3 Perceptual models of room reverberation with application to speech recognition Complex smoothed room responses Perceptual factors in room responses

Task 2.1 Franssen illusion Reverberant environments = cue to multiple directions. The precedence effect = stable directional percept. Franssen illusion (F.I.) Precedence effect. ITD/ILD dependant

Task 2.1 Franssen illusion Hypothesis Localisation requires transients. Signal spectral density. Room differences. ITD/ILD dependant. Solution Various onset transitions. Sinusoid & Harmonic complex’s. Large vs. small rooms At present: F.I. in reverberation chamber. No transition effect. Increasing spectral density = Increased localisability. F.I. dependant on poor stimuli localisability. Future: F.I. with Grouping cues?? USE loudspeakers Use s/p stereo file with one left channel and one right (F0=0) Compare with same files but from one loudspeaker

Task 2.2 Learning non-individualised HRTFs Cone-of-confusion MVP HRTFs Individual HRTFs

Task 2.2 Learning non-individualised HRTFs: Results Type – I (2 listeners) Type - II (3 listeners) Response bias significantly determines reversal type = No reversal predisposition. = Majority of front-to-back reversals.

Task 2.3 Complex Smoothing Room Impulse Response (RIR): time domain frequency domain Original RIR Smoothed

perceptual smoothing profiles Start with a “smoothed” room response Use smoothing based on perception variable spectral resolution variable frequency-dependent windowing Employ “room masking models”

Task 2.3 Inverse filtering using smoothed filters time domain frequency domain modification compensation from: “Results for Room Acoustics Equalisation Based on Smoothed Responses” Panagiotis D. Hatziantoniou and John N. Mourjopoulos,114th AES Convention, Amsterdam, March 2003

Task 2.3 Smoothed filters physical metrics Tests in 6 rooms of Volume 60m3 – 11000m3 EDT reduced by up to 0,5 sec C80 improves by up to 5 dB D50 improves by up to 20% Spectral deviation is reduced up to 4 dB from: “Results for Room Acoustics Equalisation Based on Smoothed Responses” Panagiotis D. Hatziantoniou and John N. Mourjopoulos, 114th AES Convention, Amsterdam, March 2003

Task 2.3 Perceptual factors in room responses Real-time perception test. Various stimuli types (steady-state & transients). Assess multiple perceptual factors.

Task 2.3 Perceptual factors in room responses Source width. Source distance. Envelopment.

Task 2.3 Perceptual factors in room responses Anchor end-points with illustrative demonstrations and explanation. Results subjected to factor analysis

Perceptual factors in room responses (2.3). Future work Perceptual factors in room responses (2.3). ITD/ILD plausibility cues (2.1, 2.2). The combination of the cues is still debated. Use F0 grouping with FI for hierarchy of cues (2.2).

Department of Electrical and Computer Engineering AudioGroup, WCL Department of Electrical and Computer Engineering University of Patras, Greece http://www.wcl.ee.upatras.gr/AudioGroup/ …. Thank you very much...