Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.

Slides:

Advertisements

Similar presentations

Change-Point Detection Techniques for Piecewise Locally Stationary Time Series Michael Last National Institute of Statistical Sciences Talk for Midyear.

Advertisements

The Fully Networked Car Geneva, 4-5 March Jean-Pierre Jallet Car Active Noise Cancellation for improved car efficiency, From/In/To car voice communication.

A Phonetician ’ s Guide to Audio Formats Chilin Shih University of Illinois at Urbana Champaign LSA 2006January 5-8, 2006.

Digital Signal Processing

Chunyi Peng, Guobin Shen, Yongguang Zhang, Yanlin Li, Kun Tan BeepBeep: A High Accuracy Acoustic Ranging System using COTS Mobile Devices.

Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),

Speech Enhancement through Noise Reduction By Yating & Kundan.

Spectral envelope analysis of TIMIT corpus using LP, WLSP, and MVDR Steve Vest Matlab implementation of methods by Tien-Hsiang Lo.

Guitar Effects Processor Using DSP

The frequency spectrum

SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.

IT-101 Section 001 Lecture #8 Introduction to Information Technology.

SYED SYAHRIL TRADITIONAL MUSICAL INSTRUMENT SIMULATOR FOR GUITAR1.

Image and Sound Editing Raed S. Rasheed Sound What is sound? How is sound recorded? How is sound recorded digitally ? How does audio get digitized.

6/3/20151 Voice Transformation : Speech Morphing Gidon Porat and Yizhar Lavner SIPL – Technion IIT December

1 A Tool for System Simulation: SIMULINK Can be used for simulation of various systems: – Linear, nonlinear; Input signals can be arbitrarily generated:

Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.

Effective Bits. An ideal model of a digital waveform recorder OffsetGain Sampling Timebase oscillator Fs ADC Waveform Memory Address counter Compute Engine.

Our Goal for Today Record vowel sounds onto PC’s Record vowel sounds onto PC’s Analyze using Matlab Analyze using Matlab Identify characteristics of what.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

EE513 Audio Signals and Systems Wiener Inverse Filter Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

0 - 1 © 2007 Texas Instruments Inc, Content developed in partnership with Tel-Aviv University From MATLAB ® and Simulink ® to Real Time with TI DSPs Measuring.

Basics of Signal Processing. frequency = 1/T  speed of sound × T, where T is a period sine wave period (frequency) amplitude phase.

LE 460 L Acoustics and Experimental Phonetics L-13

Sine Waves. Notation s 0 … s n … s N or s(0), … s(n), … s(N) Sketch the following digital signals: δ (n) = 1, n = 0 = 0, otherwise u (n) = 1, n >= 0 =

SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.

Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.

Sound Localization PART 2 Ali Javed, Josh Manuel, Brunet Breaux, Michael Browning.

Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.

Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.

Digital Speech Transmission and Recovery. Overall System Output (speaker) Channel (coax cable) Receiver Circuit Input (microphone) Transmitter Circuit.

Time Series Analysis of Elephant Acoustic and Seismic Signals Alex Williamson Physics Dept.

Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.

Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.

Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.

Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

ECE 4710: Lecture #9 1 PCM Noise  Decoded PCM signal at Rx output is analog signal corrupted by “noise”  Many sources of noise:  Quantizing noise »Four.

Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.

Sound on the Web. Using Sound on a Web Site Conveying information  pronounce a word or describe a product Set a mood  music to match the web page scene.

Group Members: Sam Marlin, Jonathan Brown Faculty Adviser: Tom Miller.

Jacob Zurasky ECE5526 – Spring 2011

Introduction to SOUND.

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.

Chaparral Physics Research

ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska

EE513 Audio Signals and Systems

Sonia Hingorany & Liza Cyriac EE113D – Professor Rajeev Jain & TA Rick Huang– Winter 2008.

Indoor Location Detection By Arezou Pourmir ECE 539 project Instructor: Professor Yu Hen Hu.

CSCI-100 Introduction to Computing Hardware Part II.

Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Progress To Date, Monday 21 st January 2008.

Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Initial Presentation, Tuesday 2 nd October 2007.

IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.

Digital Oscillators. Everything is a Table A table is an indexed list of elements (or values) A digital oscillator or soundfile is no different.

Project-Final Presentation Blind Dereverberation Algorithm for Speech Signals Based on Multi-channel Linear Prediction Supervisor: Alexander Bertrand Authors:

Impulse Response Measurement and Equalization Digital Signal Processing LPP Erasmus Program Aveiro 2012 Digital Signal Processing LPP Erasmus Program Aveiro.

1 What is Multimedia? Multimedia can have a many definitions Multimedia means that computer information can be represented through media types: – Text.

Adobe AuditionProject 4 guide © 2012 Adobe Systems IncorporatedHow to record narration1 You can record narration for your video directly into Adobe Audition.

Speech Recognition Created By : Kanjariya Hardik G.

LIGO-G Z S5 calibration: time dependent coefficients  Myungkee Sung, Gabriela González, Mike Landry, Brian O’Reilly, Xavier Siemens,…

ARENA08 Roma June 2008 Francesco Simeone (Francesco Simeone INFN Roma) Beam-forming and matched filter techniques.

ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.

Spectral subtraction algorithm and optimize Wanfeng Zou 7/3/2014.

Hi-Fi Digital Audio Compensation System Patrick Cronin Robert Galvin Matt Saterbak Kent Thomson Nick Turner Ryan Twaddle Advisor: Professor Jaijeet Roychowdhury.

Measurement and Instrumentation

XP Practical PC, 3e Chapter 14 1 Recording and Editing Sound.

CS 591 S1 – Computational Audio -- Spring, 2017

ARTIFICIAL NEURAL NETWORKS

Linear Predictive Coding Methods

Bandwidth Extrapolation of Audio Signals

Presentation transcript:

Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This makes speech recognition less reliable. Choosing a Test Sound Two different test sounds were tested to find which sound worked better. One test sound was a sine wave increasing in frequency from 80Hz to 8000Hz. The other test sound was pink noise. Speech was recorded and filtered from four different people. These files were filtered using the coefficients produced by the least mean square algorithm. These recordings were then tested against two different speech recognizers, the one built into Windows XP and the one built into Windows 7. The Windows 7 recognizer had a higher baseline success rate than the XP recognizer. Overall, the filter created from the pink noise fixed more speech recognition errors than the other filter. Also, all but one of the phrases fixed by the filter from the sine wave were also fixed by the pink noise. For the Future Test more filter lengths, iterations, gains, sound files Insert filter into Windows Mobile recording stack Add options to the program to change the filter creation parameters Jonathan Brown: <> Sam Marlin: <> Advisor: W. T. Miller Proposed Solution Using digital signal processing, a filter will be created to “undo” the distortion caused by the poor quality microphone. This process will be able to generate a filter for any handheld that uses the Windows Mobile platform, creating a custom tailored filter based on the acoustic characteristics of each device. Reference audio files, with known frequency components, will be used to find what frequencies are attenuated by the handheld. Testing the Code All the code was first done in Matlab for testing purposes. The code was then ported to C# for final deployment. Save the Filter The filter coefficients are then saved into the registry of the handheld device for use by any audio recording or voice recognition application. Record Test Sound Play an ideal test sound from the computer while recording it on the handheld. Create the Filter The program on the computer will compare the test sound and the recorded sound to create the filter. Setup Setup computer, speakers and handheld device. Steps of the Solution Process Lining up the Sound Files Each test sound file had 10 cycles of a 440Hz sine wave at its start. This knowledge was used to line up the two sound files through cross- correlation. Problem The above equation did not line up the sound files for all time. The time steps in each of the sounds are different, after 1000 samples the files would noticeably unaligned. To fix this, cross-correlation was used again to match the indexes in one file to another. Creating the Filter The least mean square algorithm was used to create the filter coefficients. For this algorithm to work, the test files have to be lined up in time. This algorithm has many different variables, so tests were done to find best filter parameters to solve the problem. The sine wave test sound file was used in these parameter tests. Choosing the numbers depended on two values, the RMS of the error value used in the algorithm and if the filter coefficients changed by varying the iterations. Numbers used in testing: Gain: 0.001, , Iterations: 500 to 3900 in steps of 200 Filter Size: 257 I = the ideal waveform NI = the non ideal waveform FC = the filter coefficients FS = filter size e = equalization error g = the gain Windows 7 Speech Recognizer NoiseUnfilteredFilter from Sine WaveFilter from Pink Noise Recognized Broke-56 Fixed-615 Final Values: Gain: Iterations: 1500 Filter Size: 257 Conclusions The filter developed using the pink noise test signal resulted in a statistically significant improvement in speech recognizer performance at the 90% confidence level (from 79% to 83.5 % correct). This indicates that the technique could provide a functionally significant improvement in practice, and warrants further investigation.