Design of a Coincident Microphone Array for 5.1- Channel Audio Recording Using the Mid-Side Recording Technique Jong Kun Kim, Chan Jun Chun, and Hong Kook.

Slides:



Advertisements
Similar presentations
How to record the sound of works of art Robert Hermann Diplom Tonmeister
Advertisements

Figures for Chapter 7 Advanced signal processing Dillon (2001) Hearing Aids.
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
EE2F2 - Music Technology 2. Stereo and Multi-track Recording.
A practical DSP solution for mobile phone stereo sound recording.
Listening Tests and Evaluation of Simulated Sound Fields Using VibeStudio Designer Wersényi György Hesham Fouad SZÉCHENYI ISTVÁN UNIVERSITY, Hungary VRSonic,
SWE 423: Multimedia Systems Chapter 3: Audio Technology (2)
Acoustic Model Adaptation Based On Pronunciation Variability Analysis For Non-Native Speech Recognition Yoo Rhee Oh, Jae Sam Yoon, and Hong Kook Kim Dept.
Chapter 7 Principles of Analog Synthesis and Voltage Control Contents Understanding Musical Sound Electronic Sound Generation Voltage Control Fundamentals.
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
Top Level System Block Diagram BSS Block Diagram Abstract In today's expanding business environment, conference call technology has become an integral.
Phasors ET 242 Circuit Analysis II Electrical and Telecommunication
Series Resonance ET 242 Circuit Analysis II
Parallel Resonance ET 242 Circuit Analysis II Electrical and Telecommunication Engineering Technology Professor Jang.
3-D Spatialization and Localization and Simulated Surround Sound with Headphones Lucas O’Neil Brendan Cassidy.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
Dolby AC-3 Audio Encoding & THX Wai Kam (Winnie) Henele Adams Peter Boettcher.
Integrated Circuits Design for Applications in Communications Dr. Charles Surya Department of Electronic and Information Engineering DE636  6220
Project Presentation: March 9, 2006
Sample rate conversion At times, it will be necessary to convert the sampling rate in a source signal to some other sampling rate Consider converting from.
1 Ambisonics: The Surround Alternative Richard G. Elen The Ambisonic Network.
3-D Sound and Spatial Audio MUS_TECH 348. Multi-Loudspeaker Reproduction: Surround Sound.
A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.
Sarah Middleton Supervised by: Anton van Wyk, Jacques Cilliers, Pascale Jardin and Florence Nadal 3 December 2010.
Alternating-Current Circuits Chapter 22. Section 22.2 AC Circuit Notation.
infinity-project.org Engineering education for today’s classroom 53 Design Problem - Digital Band Build a digital system that can create music of any.
ADVANCED RADIO PRODUCTION Books: “Modern Radio Production” by Hausman, Benoit, Messere, & O’Donnell: Chapter 15 Pertemuan 12 Matakuliah: O Dasar-Dasar.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Getting the most with a Stereo pair of mics. Spaced Pair (or A-B Stereo) Often used with Omnidirectional Mics, captures the room tone The ear senses time.
CSP Auditory input processing 1 Auditory input processing Lecturer: Smilen Dimitrov Cross-sensorial processing – MED7.
 Process of recording audio on set  Starts in pre-production  Specific crew positions  Goal is to capture the cleanest possible recording of set dialogue.
Mono and Stereo Miking Techniques. Choosing Microphones Limited collection: useful for broad range of applications  Neumannn KM 184’s (desert island.
ELECTRICAL CIRCUIT ET 201 Define and explain characteristics of sinusoidal wave, phase relationships and phase shifting.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
MULTIMEDIA DEFINITION OF MULTIMEDIA
VID102 DAY 3.  Schedule Change  Viewing Examples  Audio Principles Class Schedule.
Rumsey Chapter 16 Day 3. Overview  Stereo = 2.0 (two discreet channels)  THREE-DIMENSIONAL, even though only two channels  Stereo listening is affected.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Timo Haapsaari Laboratory of Acoustics and Audio Signal Processing April 10, 2007 Two-Way Acoustic Window using Wave Field Synthesis.
Estimation of Sound Source Direction Using Parabolic Reflection Board 2008 RISP International Workshop on Nonlinear Circuits and Signal Processing (NCSP’08)
Chapter 6. Effect of Noise on Analog Communication Systems
MONO SOUND. In everyday life we listen with two ears. As we compare these two separate sound images of the external world, they build a three dimensional.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Panorama Space Modeling Method for Observing an Object Jungil Jung, Heunggi Kim, Jinsoo Cho Gachon Univ., Bokjeong-dong, Sujeong-gu, Seongnam-si, Gyeonggi-do,
Analysis of Security Communication Based on Digital Retrodirective Antenna Junyeong Bok 1,1, Heung-Gyoon Ryu 1 1 Department of Electronics and Engineering,
Active Microphone with Parabolic Reflection Board for Estimation of Sound Source Direction Tetsuya Takiguchi, Ryoichi Takashima and Yasuo Ariki Organization.
Longitudinal Motion Characteristics between a Non- Matched Piezoelectric Sensor and Actuator Pair Young-Sup Lee Department of Embedded Systems Engineering,
Surround-Adaptive Local Contrast Enhancement for Preserved Detail Perception in HDR Images Geun-Young Lee 1, Sung-Hak Lee 1, Hyuk-Ju Kwon 1, Tae-Wuk Bae.
Advanced Science and Technology Letters Vol.32 (Architecture and Civil Engineering 2013), pp A Study on.
Effect Analysis of Electric Vehicle Charging to Smart Grid with Anti-Islanding Method Bum-Sik Shin, Kyung-Jung Lee, Sunny Ro, Young-Hun Ki and Hyun-Sik.
Advanced Science and Technology Letters Vol.31 (ACN 2013), pp Measurement Method for Mean Opinion Score.
Advanced Science and Technology Letters Vol.28 (EEC 2013), pp Histogram Equalization- Based Color Image.
Robot Velocity Based Path Planning Along Bezier Curve Path Gil Jin Yang, Byoung Wook Choi * Dept. of Electrical and Information Engineering Seoul National.
Development and Test of Simultaneous Power Analysis System for Three-Phase and Four-Wire Power System Hun Oh 1, In Ho Ryu 2 and Jeong-Chay Jeon 3 * 1 Department.
Advanced Science and Technology Letters Vol.74 (ASEA 2014), pp Development of Optimization Algorithm for.
Speech Enhancement using Excitation Source Information B. Yegnanarayana, S.R. Mahadeva Prasanna & K. Sreenivasa Rao Department of Computer Science & Engineering.
Advanced Science and Technology Letters Vol.28 (EEC 2013), pp Fuzzy Technique for Color Quality Transformation.
VLSI Design of View Synthesis for 3DVC/FTV Jongwoo Bae' and Jinsoo Cho 2, 1 Department of Information and Communication Engineering, Myongji University.
Antenna Arrays and Automotive Applications
Presentation 8 – Surround Sound
1 Angle Demodulator using AM FM demodulators first generate an AM signal and then use an AM demodulator to recover the message signal.  To transform the.
Electronics & communication sem: 5TH year: AUDIO VIDEO SYSTEM
Fantasound Developed by Disney for “Fantasia” (1940)
4aPPa32. How Susceptibility To Noise Varies Across Speech Frequencies
Recording for Surround Sound
Seunghui Cha1, Wookhyun Kim1
Objective % Select and utilize tools for digital audio production.
Path Curvature Sensing Methods for a Car-like Robot
Kocaeli University Introduction to Engineering Applications
Intensity Stereo Uses only differences in intensity between two channels to create stereo image Two microphone diaphragms are placed as close together.
Presentation transcript:

Design of a Coincident Microphone Array for 5.1- Channel Audio Recording Using the Mid-Side Recording Technique Jong Kun Kim, Chan Jun Chun, and Hong Kook Kim School of Information and Communications Gwangju Institute of Science and Technology (GIST), Gwangju , Korea {jonggun, cjchun, Abstract. In this paper, a 5.1-channel audio recording method is proposed by applying the mid-side (MS) stereo recording technique to a coincident micro-phone array. The coincident microphone array consists of a backward-facing cardioid microphone and the MS microphone array which is composed of a pair of forward-facing cardioid and sideways-facing figure-eight microphones. These 3 microphones produce two bi-directional signals and one uni-directional signal through additive and subtractive procedures. Using these signals, the ro-tated uni-directional signals are produced and synthesized at each angle accord-ing to the 5.1-channel speaker array. The performance of the proposed 5.1-channel audio recording method is evaluated using a directional response test and a localization test. It is shown from the tests that the proposed recording method provides a beam pattern well at each direction on the basis of the 5.1-channel speaker configuration and has localization performance within ±8.2 de-grees. Keywords: Multi-channel audio recording, coincident microphone array, mid- side recording 1 Introduction There has been increased development in video and audio fields. In particular, the technologies related to video systems, such as 3DTV and high definition television (HDTV), have advanced [1]. With these recent developments in video, there has also been an increased demand for even more realistic audio services [2]. In order to en-hance the performance of audio fields, multi-channel audio systems should be consid-ered. Although the demand for multi-channel audio systems has increased, there is still a shortage of multi-channel audio content. Instead of recording multi-channel audio contents, an upmixing method that converts mono- or stereo-audio into 5.1 channel audio signals is widely used for providing various content [3][4]. However, such a method degrades the localization performance because of a lack of directional information. As an alternative, some spaced microphone configurations have been 61

primarily used for 5.1-channel recording [5]. However, these configurations need a Fig. 1. (a) A microphone array using coincident three microphones and (b) a block diagram of the proposed 5.1-channel audio recording method large space to allocate five microphones and expensive recording equipments. In this paper, a cost-efficient method of 5.1-channel audio recording is proposed by applying the mid-side (MS) stereo recording technique to a coincident microphone array. By doing this, we can achieve improved localization performance for 5.1-channel audio content. 2 Proposed 5.1-Channel Audio Recording Method The MS stereo recording technique can generate stereo signals by combining two different microphones such as a coincident-forward-facing cardioid microphone, U F, which is called as M, and a sideways-facing figure-eight microphone, B S, which is called S [5]. In other words, right channel signal is made by adding signals from U F and B S, while left channel signal is made by subtracting the U F micro- phone signal from the B S microphone signal. In order to record 5.1 channels, a backward-facing cardioid microphone, U B, is added to the MS configuration. As shown in Fig. 1(a), the bold solid line indicates a beam pattern of U B. Fig. 1(b) shows a block diagram of the proposed 5.1-channel audio recording method using this microphone array. A bi-directional signal with a directional axis of 0°, b p (n), is first obtained by subtracting signals from U F and U B. In addition, an omni-directional signal, o p (n) is obtained by adding the U F microphone signal from U B. These two signals, b p (n) and o p (n), are processed with a bi-directional signal, bac (n), in the direction formation module. In fact, since three microphones are placed very closely, the phase and time difference between three signals are indistinguishable. Instead, there are the amplitude differences of 62

three microphone signals depending on the direction of audio source. Therefore, for a Fig. 2. Results of(a) the directional response test and (b) sound localization test. given direction, b p ( n ) and b ac ( n ) are panned using the amplitude panning law [6]. Next, the panned signal is added by o p ( n ) to generate a uni-directional signal, y  ( n ). That is, y  ( n ) is represented as [7] y ( n ) ( o ( n ) cos ( ) sin ( )) / 2  p    p  b n   ac  b n (1) where  is an arbitrary direction. In the channel synthesis, 5.1-channel signals are obtained by changing  on the basis of a speaker configuration defined by the ITU-R Recommendation BS [8]. In other words, five signals for center (C), front left (FL), front right (FR), rear left (RL), and rear right (RR) are obtained using Eq. (1) by setting  to 0°, 30°, - 30°, 110°, and - 110°, respectively. Finally, a signal for the low-frequency enhancement (LFE) channel is generated by applying a low-pass filter (LPF) to the omnidirectional signal o p ( n ). In this paper, the cut-off frequency of the LPF is set to 120 Hz since the LFE channel is used for improving bass quality without any directional information. 3 Performance Evaluation In order to demonstrate the performance of the proposed recording method, we per- formed two different tests such as a directional response test and a sound localization test. For the first test, a white noise signal was prepared to analyze all frequency bands. The coincidence microphone set as mentioned in Section 2 was positioned at the center, and a mono-speaker was arranged at 0° in the horizontal plane. The coin-cident microphone set was rotated from 0° to 360° at a step of 10° in a non- 63

reverberant audio booth. For the second test, three audio files including speech, mu-sic, and white noise were also prepared and played using the 5.1-channel speaker configuration defined by the ITU-R BS Next, we recorded these audio files by the coincidence microphone set and measured the perceived source directions of the recorded audio files. Fig. 2(a) shows the directional response of each channel. As shown in the figure, the proposed recording method formed a beam pattern well at each direction. In addition, Fig. 2(b) shows the sound localization test result. It was shown from the figure that the proposed recording method could localize audio within an average of ±8.2 degrees. 4 Conclusion In this paper, we proposed a 5.1-channel audio recording method using a coincident microphone array that was composed of a backward-facing cardioid microphone and an MS microphone array. First of all, we constructed a relationship between three signals from the coincident microphone array. After that, five channel signals were synthesized by differently setting the relationship according to an angle corresponding to a 5.1- channel speaker configuration defined by the ITU-R BS It was shown from a directional response test and a sound localization test that the proposed record-ing method provided a well-formed beam pattern and had a localization performance within an average of ±8.2 degrees. Acknowledgments. This work was supported in part by the National Research Foun- dation of Korea (NRF) grant funded by the Korea government (MEST) (No ). References 1.Hamasaki, K., Hiyama, K., Nishiguchi, T., Ono, K.: Advanced multichannel audio systems with superior impression of presence and reality. In: Proceedings of 116th AES Convention, (2004) Preprint Begault, D. R.: 3-D Sound for Virtual Reality and Multimedia. Academic Press, Cambridge, MA, (1994) 3.Irwan, R., Aarts, R. M.: Two-to-five channel sound processing. J. Audio Eng. Soc. 50(11), (2002) 4.Chun, C. J., Kim, Y. G., Yang, J. Y., Kim, H. K.: Real-time conversion of stereo audio to 5.1 channel audio for providing realistic sounds. International Journal of Signal processing, Im-age processing and Pattern Recognition 2(4), (2009) 5.Rumsey, F.: Spatial Audio. Focal Press, Oxford, UK (2001) 6.West, J. R.: Five-channel Panning Laws: an Analytical and Experimental Comparison. M.S. Thesis, Department of Music Engineering, University of Miami, Coral Gables, FL (1998) 7.Streicher, R., Dooley, W.: Basic Stereo Microphone Perspectives-A Review. J. Audio Eng. Soc. 33(7), (1985) 8.ITU-R Recommendation BS.775-1: Multi-channel Stereophonic Sound System with or without Accompanying Picture, (1994) 64