Craig Holmes Brad Klippstein Andrew Pottkotter Dustin Osborn.

Slides:



Advertisements
Similar presentations
| Page Angelo Farina UNIPR | All Rights Reserved | Confidential Digital sound processing Convolution Digital Filters FFT.
Advertisements

DCSP-12 Jianfeng Feng
Building an ASR using HTK CS4706
Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson
CMPS1371 Introduction to Computing for Engineers PROCESSING SOUNDS.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Auto-tuning for Electric Guitars using Digital Signal Processing Pat Hurney, 4ECE 31 st March 2009.
Digital Signal Processing
Speech Sound Production: Recognition Using Recurrent Neural Networks Abstract: In this paper I present a study of speech sound production and methods for.
CMSC Assignment 1 Audio signal processing
F 鍾承道 Acoustic Features for Speech Recognition: From Mel-Frequency Cepstrum Coefficients (MFCC) to BottleNeck Features(BNF)
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Introduction to Wavelets
A PRESENTATION BY SHAMALEE DESHPANDE
 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.
Representing Acoustic Information
Graphic Equalizer Table By Jose Lerma. Main Idea The main idea of this table is to display the frequencies of any sound or audio input, either by microphone.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Chapter 2. Image Analysis. Image Analysis Domains Frequency Domain Spatial Domain.
Lecture 1 Signals in the Time and Frequency Domains
(2.1) Fundamentals  Terms for magnitudes – logarithms and logarithmic graphs  Digital representations – Binary numbers – Text – Analog information 
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,
The Wavelet Tutorial: Part3 The Discrete Wavelet Transform
Is there a relationship between the lengths of body parts ?
CSC361/661 Digital Media Spring 2002
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Dual-Channel FFT Analysis: A Presentation Prepared for Syn-Aud-Con: Test and Measurement Seminars Louisville, KY Aug , 2002.
Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Spectral Analysis AOE March 2011 Lowe 1. Announcements Lectures on both Monday, March 28 th, and Wednesday, March 30 th. – Fracture Testing –
Implementing a Speech Recognition System on a GPU using CUDA
Complex Variables & Transforms 232 Presentation No.1 Fourier Series & Transforms Group A Uzair Akbar Hamza Saeed Khan Muhammad Hammad Saad Mahmood Asim.
Jacob Zurasky ECE5526 – Spring 2011
Wireless Sensor Project Search Triangulation Aerial Rescue Team (START)
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Overview ► Recall ► What are sound features? ► Feature detection and extraction ► Features in Sphinx III.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
Ch4 Short-time Fourier Analysis of Speech Signal z Fourier analysis is the spectrum analysis. It is an important method to analyze the speech signal. Short-time.
CSCI-100 Introduction to Computing Hardware Part II.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100.
Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Progress To Date, Monday 21 st January 2008.
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
Speech Processing Using HTK Trevor Bowden 12/08/2008.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 20,
Team 6. Guitar Audio Amplifier Audio Codec DSP Wireless Adapter Motor Array PC LCD Display LED Arrays Pushbutton or RPG Input Device
Sound Source Location Stand Group 72: Hiroshi Fujii Chase Zhou Bill Wang TA: Katherine O’Kane.
ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.
ΨΗΦΙΑΚΟΣ ΕΛΕΓΧΟΣ (22Δ802) Β΄ ΕΞΑΜΗΝΟ Καθηγητής Πέτρος Π. Γρουμπός  Ώρες Γραφείου: Τετάρτη Πέμπτη Παρασκευή 11:00- 12:00 Γραφείο: 1.
Sound Controlled Smoke Detector Group 67 Meng Gao, Yihao Zhang, Xinrui Zhu 1.
TUTORIAL FOR USING AUDACITY DESIGNED BY ND SINH/ND CHUONG Name of the tool: Audacity URL Link: Audacity Download (
Emotional Intelligence Vivian Tseng, Matt Palmer, Jonathan Fouk Group #41.
GCSE COMPUTER SCIENCE Topic 3 - Data 3.2 Data Representation.
PATTERN COMPARISON TECHNIQUES
The Physics of Sound.
ARTIFICIAL NEURAL NETWORKS
Speech Processing AEGIS RET All-Hands Meeting
Spoken Digit Recognition
Cepstrum and MFCC Cepstrum MFCC Speech processing.
Number Systems and Codes
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
Digital Systems: Hardware Organization and Design
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Recap In previous lessons we have looked at how numbers can be stored as binary. We have also seen how images are stored as binary. This lesson we are.
Keyword Spotting Dynamic Time Warping
Presentation transcript:

Craig Holmes Brad Klippstein Andrew Pottkotter Dustin Osborn

Wind Turbines have become a great source of alternative “green” energy Lake Erie and North West Ohio have the potential to be a great source of wind energy. North West Ohio also happens to be one of the largest migratory bird fly-ways in the country Before wind turbines can be installed questions need to be answered about the effect they will have on the avian population in the area

“As wind energy facilities becomes substantially more numerous and as wind development continues to grow, fatalities and thus the potential for biologically significant impacts to local populations increases” – National Wind Coordinating Collaborative The BBCER strives to create a way to identify what avian species are being impacted by the wind turbines and to allow for conclusions to be made about what effect they are having on the species.

Bird/Bat Calls Embedded Recognition System Bird SystemBat System Hardware 10 bit ATMega 328 A/D microcontrollerAR125 Ultrasonic Receiver Microphones StandFR125-III Field Recorder Four UEM-88 Mini Shotgun Microphones MAYA-44 PC Software Bird Call LibrarySPECT'R Software Raven-XSCAN'R Software MatlabSonoBat Software

Incoming Unknown Bird Call Take First 512 Data Points from Call Apply Hamming Window to Data Points Derive Fourier Transform of Data Points Find the Magnitude of the Fourier Transform Multiply by the Mel Frequency Scale Gather Mel Frequency Cepstral Coefficients Apply Discrete Cosine Transfrom to MFCC's Place First 9 Coefficients in MFCC Bank Move to Next 512 Data Points from Call Once through all 220,500 Data Points of Call, Send MFCC Bank to Correlation Algorithm for Matching

“Scale of pitches judged by listeners to be equal in distance from one another”[1] Converts Hertz scale to Mel scale.

Cepstral: Fourier Transform of the log spectrum. [1] Mel Frequency Cepstrum (MFC): Representation of the short term power spectrum of sound, based on a linear cosine transform of a log power spectrum on a nonlinear Mel scale of frequency.[1] Mel Frequency Cepstral Coefficients (MFCC): Coefficients that collectively make up an MFC[1]

Incoming Unknown MFCC Bank Calculate Pearson's Correlation Coefficient with first database call Calculate Next Pearson's Correlation Coefficient for next known call Repeat calculating correlation coefficient for all database calls Output call with highest correlation as best match If all calculated correlation coefficients below threshold then unknown call

Measures the correlation between two linear dependent variables. Signified by r (rho) and can take on values between -1 to 1. Where -1 signifies perfect negative correlation, 0 is no correlation, and 1 is perfect positive correlation. The previous example of American Crow and American Coot yield an r value of

With our system we are able to successfully identify an ‘unknown’ bird call that has been recorded through the UEM-88 mini-shotgun microphones and interfaced to the PC with the MAYA-44 USB device.

Correlation Coefficient of.6411

Together the AR125 and FR125-III record the ultrasonic bat calls from the field. They then work with the SPECT’R, SCAN’R, and SonoBat software to determine the species of which the call came from. The system has been tested and has successfully identified species of bats in the field.

There are three software programs that we purchased for bat call analysis SPECT’R SCAN’R SonoBat. SPECT’R is used to perform spectral analysis, digital tuning, and hard-disk recording of echolocation calls. SCAN’R is a snapshot characterization and analysis tool which is used to distinguish between bat calls and unwanted noise. SonoBat can be used to analyze and compare high- resolution full-spectrum sonograms of echolocation calls.

Because of the Microcontroller, that we purchased, memory limitations, we are unable to successfully record an audio call with enough data points to be ran through our algorithm. In the future we recommend purchasing a microcontroller with at least 512kB SRAM to get a correctly sampled call plus room to store the code. We also recommend making the system wireless so that the system could be placed in the field and left to record bird calls overnight or for long durations.