D1 - 29/05/2014 France Télécom Recherche & Développement Workshop « From 5.1 to Sound Field Synthesis..." AES 120th Convention, Paris 2006 Higher Order.

Slides:



Advertisements
Similar presentations
Covalent Bonding: Orbitals
Advertisements

Pitch, Timbre, Source Separation, and the Myths of Sound Localization
Acoustical parameters ISO 3382
Balanced Device Characterization. Page 2 Outline Characteristics of Differential Topologies Measurement Alternatives Unbalanced and Balanced Performance.
Today • Diffraction from periodic transparencies: gratings
0 - 0.
Environmental Remote Sensing GEOG 2021
Developing Event Reconstruction for CTA R D Parsons (Univ. of Leeds) J Hinton (Univ. of Leicester)
Points, Vectors, Lines, Spheres and Matrices
Adding value through knowledge © NNC Limited September 2002International PHOENICS User Conference 1 A PHOENICS model of the hotbox region of an advanced.
Researches and Applications for Automotive Field Andrea Azzali, Eraldo Carpanoni, Angelo Farina University of Parma.
Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format Jérôme Daniel, France Telecom.
2D and 3D image surface descriptors for fish otoliths classification
1 Photometric Stereo Reconstruction Dr. Maria E. Angelopoulou.
3-D Sound and Spatial Audio MUS_TECH 348. Multi-Loudspeaker Reproduction: Surround Sound.
Waves and Sound Review.
Basic Audio Production
Group Meeting Presented by Wyman 10/14/2006
TASK: Skill Development A proportional relationship is a set of equivalent ratios. Equivalent ratios have equal values using different numbers. Creating.
UH page: 1 / Sept nd Intl. AES Conference “DSP for Loudspeakers” Hillerod, Denmark Application of Linear-Phase Digital Crossover.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
1 3D sound reproduction with “OPSODIS” and its commercial applications Takashi Takeuchi, PhD Chief Technical Officer OPSODIS Limited Institute of Sound.
Spatial Perception of Audio J. D. (jj) Johnston Neural Audio Corporation.
Listening Tests and Evaluation of Simulated Sound Fields Using VibeStudio Designer Wersényi György Hesham Fouad SZÉCHENYI ISTVÁN UNIVERSITY, Hungary VRSonic,
Comparison of energy-preserving and all-round Ambisonic decoders
Back to Stereo: Stereo Imaging and Mic Techniques Huber, Ch. 4 Eargle, Ch. 11, 12.
Creating The Recorded Image of Turkish Art Music: The Decision Making Process in a Recording Session Doç. Dr. CAN KARADOĞAN İTÜ TMDK.
Sound Field Reproduction Peter Goss. Outline What is sound field reproduction? Free-field theory and simulation results Reverberant theory Implementation.
Alternatives to Spherical Microphone arrays: Hybrid Geometries Aastha Gupta & Prof. Thushara Abhayapala Applied Signal Processing CECS To be presented.
Auralization Lauri Savioja (Tapio Lokki) Helsinki University of Technology, TKK.
1 Introduction to MPEG Surround 韓志岡 2/9/ Outline Background – Motivation – Perception of sound in space Pricicple of MPEG Surround – Downmixing.
Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.
Project Presentation: March 9, 2006
STUDIOS AND LISTENING ROOMS
1 Dong Lu, Peter A. Dinda Prescience Laboratory Computer Science Department Northwestern University Virtualized.
Binaural Sound Localization and Filtering By: Dan Hauer Advisor: Dr. Brian D. Huggins 6 December 2005.
1 Ambisonics: The Surround Alternative Richard G. Elen The Ambisonic Network.
3-D Sound and Spatial Audio MUS_TECH 348. Multi-Loudspeaker Reproduction: Surround Sound.
Philip Coleman, Alice Duque, Philip Jackson, Marek Olik University of Surrey, Guildford, UK BBC Audio Research Showcase 2013 start The binaural.
1/19 Philip Coleman, Philip J. B. Jackson, Marek Olik Centre for Vision, Speech and Signal Processing, University.
L INKWITZ L AB Accurate sound reproduction from two loudspeakers in a living room 13-Nov-07 (1) Siegfried Linkwitz.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Music Tech Away Day 2012 Jonathan P Wakefield Overview of current research supervisions: Mark Mynett – PhD p/t - Music Production for Contemporary Metal.
Mono and Stereo Miking Techniques. Choosing Microphones Limited collection: useful for broad range of applications  Neumannn KM 184’s (desert island.
THEORETICAL STUDY OF SOUND FIELD RECONSTRUCTION F.M. Fazi P.A. Nelson.
AUDIO SPOTLIGHTING PRESENTED BY: NAMRATA MAURYA EI-4 rth Year
Issac Garcia-Munoz Senior Thesis Electrical Engineering Advisor: Pietro Perona.
Virtual Worlds: Audio and Other Senses. VR Worlds: Output Overview Visual Displays: –Visual depth cues –Properties –Kinds: monitor, projection, head-based,
Rumsey Chapter 16 Day 3. Overview  Stereo = 2.0 (two discreet channels)  THREE-DIMENSIONAL, even though only two channels  Stereo listening is affected.
Timo Haapsaari Laboratory of Acoustics and Audio Signal Processing April 10, 2007 Two-Way Acoustic Window using Wave Field Synthesis.
Modal Analysis of Rigid Microphone Arrays using Boundary Elements Fabio Kaiser.
3-D Sound and Spatial Audio MUS_TECH 348. Physical Modeling Problem: Can we model the physical acoustics of the directional hearing system and thereby.
L INKWITZ L AB S e n s i b l e R e p r o d u c t i o n & R e c o r d i n g o f A u d i t o r y S c e n e s Hearing Spatial Detail in Stereo Recordings.
An Alternative Ambisonics Formulation: Modal Source Strength Matching and the Effect of Spatial Aliasing Franz Zotter Hannes Pomberger Matthias Frank.
3-D Sound and Spatial Audio MUS_TECH 348. Stereo Loudspeaker Reproduction.
CARE / ELAN / EUROTeV Feedback Loop on a large scale quadrupole prototype Laurent Brunetti* Jacques Lottin**
MRI Physics: Spatial Encoding Anna Beaumont FRCR Part I Physics.
3-D Sound and Spatial Audio MUS_TECH 348. What do these terms mean? Both terms are very general. “3-D sound” usually implies the perception of point sources.
What can Ambisonics do for you?
Microphone Array Projects
27th Tonmeistertagung nd November 2012, Cologne
Fantasound Developed by Disney for “Fantasia” (1940)
Recording for Surround Sound
3D sound reproduction with “OPSODIS” and its commercial applications
Crafting Sound and Space
Hearing Spatial Detail
Does Spatialized Audio Change Everything?
Intensity Stereo Uses only differences in intensity between two channels to create stereo image Two microphone diaphragms are placed as close together.
Using HRTFs for virtual sound source positioning Lecture Example
Wave Field Synthesis Roger Vargas PHYS 536.
Presentation transcript:

D1 - 29/05/2014 France Télécom Recherche & Développement Workshop « From 5.1 to Sound Field Synthesis..." AES 120th Convention, Paris 2006 Higher Order Ambisonics: promises and reality Jérôme Daniel, France Telecom R&D

#2 Traditional 1st order Ambisonics: B-Format encoding Panoramic sound recording Coincident omni (W) and bidirectional (X,Y) microphones Front-back, Left-Right separation Directional information = amplitude relationships Description of wave propagation direction & speed localization Independent of any loudspeaker layout Front (X) Back Left (Y) Right

#3 Reproduction over loudspeakers : spatial decoding Simulate any coincident mic setup Recombine B-Format directivity patterns Decoding operation: matrix signals W,X,Y One virtual microphone per loudspeaker... as many as wanted, but... … sound image blur remains the same + - = + = B-Format Front (X) Back Left (Y) Right

#4 Reproduction over loudspeakers : spatial decoding Simulate any coincident mic setup Recombine B-Format directivity patterns Decoding operation: matrix signals W,X,Y One virtual microphone per loudspeaker... as many as wanted, but... … sound image blur remains the same Optimized decoding for localization (LF < z) Reproduce true wave propagation at the listener scale ( good ITD) HF (> Hz) Concentrate energy contributions in the expected direction ( less altered ILD, ITD) Front (X) Back Left (Y) Right minimise opposite contributions Compromise for large area [Malham] Optimize localization at the sweet spot [Gerzon]

#5 "Traditional" 1st order Ambisonics: pros & cons Pros Compact multichannel format (no redundancy) Spatial homogeneity Acoustic fidelity (regarding propagation properties) Easily extended to 3D (additional Z) Flexibility: sound field transformation; reproduction setups Commercialized B-Format microphones (eg SoundField) Cons Blurred / unstable sound images ("tiny" sweet spot) Not well adapted to irregular/unbalanced loudspeaker arrangements (esp. ITU setup) Limitations due to low directivity of usual mikes, esp. at LF... thats why non-coincident microphone approaches might be preferred

#6 Introducing Higher Order Ambisonics (HOA) Increase angular discrimination in spatial encoding add directivities with "faster" angular variation Front (X) Back Left (Y) Right 1st order 2nd order3rd order4th order

#7 Introducing Higher Order Ambisonics (HOA) Increase angular discrimination in spatial encoding add directivities with "faster" angular variation Increase angular selectivity of loudspeakers contributions selective virtual microphone directivities better use of narrowed loudspeakers Front (X) Back Left (Y) Right ++++ = = ==

#8 Introducing Higher Order Ambisonics (HOA) Increase angular discrimination in spatial encoding add directivities with "faster" angular variation Increase angular selectivity of loudspeakers contributions selective virtual microphone directivities better use of narrowed loudspeakers Front (X) Back Left (Y) Right 1st order2nd order 3rd order4th order

#9 Rendering properties of higher spatial resolution Acoustic reconstruction Enlarged sweet area "Holophony" [Nicol, Daniel] Enhanced distance encoding control of the wave curvature monochromatic plane wave (f=600Hz) 1st order 2 nd order 5th order10th order Quality of sound images: localization clues for a centred listener spherical wave (R=1m) (gaussian pulse) Order M1234 f lim 700 Hz1300 Hz1900 Hz2500 Hz E 45°30°22.5°18° good reconstruction ( good ITD) up to f lim blur angle due to HF clues alteration (ILD&ITD) above f lim

#10 Compatibility with irregular/unbalanced arrangements Synthesize directivities adapted to ITU inter-loudspeaker angles From 4th order ambisonics [Craven, 2003] Using 5th order resolution [Laborie et al]: better front channels separation Possible decoding criterion (among others): imitate pair-wise pan-pot [Craven, 2003] [Laborie et al]

#11 Compatibility with irregular/unbalanced arrangements Synthesize directivities adapted to ITU inter-loudspeaker angles From 4th order ambisonics [Craven, 2003] Using 5th order resolution [Laborie et al]: better front channels separation Possible decoding criterion (among others): imitate pair-wise pan-pot 4th order decoding over enriched ITU setup (5+2+1) C (0°), L&R(+-30°), S L &S R (+-120°) … + L&R(+-70°) … + B (180°) Demonstration on a 8-loudspeaker setup (kindly provided by Cabasse) = "energy vector" (* = target, ie ideal sound image)

#12 Extension to 3D encoding and reproduction 3D encoding and decoding Dynamic binaural reproduction Virtual loudspeakers doesnt sound so good Enhanced method: better efficiency (CPU) & rendering Sound field rotation driven by head-tracker Demo : Poster session P31, Tuesday, 14: :30 Encoding into 3D HOA Format Reproduction over a 3D rig Reproduction over headphones Spatial decoding (similar to 2D) Head-tracker Virtualization: HRTF filtering K N LdSpksignals K HOAsignals Sound Field Rotation

#13 First conclusion on Higher Order Ambisonics Pros Scalable multichannel format Spatial homogeneity Acoustic fidelity + "high spatial definition" Wave field reconstruction Easily extended to 3D – Efficient binaural spatialisation Even more flexibility: sound field transformation; reproduction setups, including irregular arrangements like ITU Cons nothing? What do we need in practice? HOA (or « high spatial resolution ») microphone systems Spatial processing tools

#14 Higher Order Ambisonics Microphone Systems Synthesis of Spherical Harmonics Extension of differential microphones: Pressure gradient and higher order derivatives using non-coincident acoustic sensors! Non concentric sensor distribution (Trinnov) Distribution over a rigid sphere (FT) [Meyer, Elko, Kubli] [Rafaely] [Ward, Abhayapala]… Trade-off on the size of the array –bigger is better to have spatial resolution at LF –smaller is better to reduce spatial aliasing (at HF) A few words on FT prototype Designed for "proof of concept" (homogeneous 3D) 32 sensors 4th order 3D (and even 5th order 2D) Objective measurements & validation [Moreau et al] Poster session P31, Tuesday, 14: :30

#15 Tools and applications Recording and mixing tools Prototypes of HOA mic (FT, Trinnov) Suite of VST plug-ins demo Use in common audio edition tools, or interactive audio progr. Applications Music, documentary, fictions Sharing of events/ambiances (eg familial use), teleconferences Interactive audio and multi-media: –A flexible multi-channel 3D audio format –Games, Virtual/Mixt Reality –News nodes for virtual scene description in MPEG4 (AudioBIFSV3) –label a multi-channel stream as a HOA content (AudioChannelConfig) –a new kind of sound object that describes a Surrounding Sound Field (SurroundingSound)

#16 Demonstrations Loudspeaker reproduction Reproduction of 4th order 3D recordings over enriched ITU setup (5 to 8 ldspk) Acknowledment: Many thanks to Cabasse and R&D manager Yvon Kernéis Head-tracked binaural reproduction [Moreau et al] Poster session P31, Tuesday, 14: :30 Could also be shown after this workshop