Beehive Audio Source Separation

Slides:



Advertisements
Similar presentations
Change-Point Detection Techniques for Piecewise Locally Stationary Time Series Michael Last National Institute of Statistical Sciences Talk for Midyear.
Advertisements

Online PLCA for Real-Time Semi-supervised Source Separation Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis 1. EECS Department, Northwestern University.
Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
Hidden Markov Models Reading: Russell and Norvig, Chapter 15, Sections
1 Applications on Signal Recovering Miguel Argáez Carlos A. Quintero Computational Science Program El Paso, Texas, USA April 16, 2009.
G. Valenzise *, L. Gerosa, M. Tagliasacchi *, F. Antonacci *, A. Sarti * IEEE Int. Conf. On Advanced Video and Signal-based Surveillance, 2007 * Dipartimento.
Bayesian Nonparametric Matrix Factorization for Recorded Music Reading Group Presenter: Shujie Hou Cognitive Radio Institute Friday, October 15, 2010 Authors:
Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis 1. EECS Department, Northwestern University 2. Advanced Technology Labs, Adobe Systems Inc. 3. University.
A Computationally Efficient Approach for 2D-3D Image Registration Juri Minxha Medical Image Analysis Professor Benjamin Kimia Spring 2011 Brown University.
Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.
Independent Component Analysis (ICA)
CS 589 Information Risk Management 6 February 2007.
Dimensional reduction, PCA
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
COGNITIVE AND NEURAL SYSTEMS, BOSTON UNIVERSITY Boston, Massachusetts Madhusudana Shashanka 17 August 2007 Dissertation Defense Latent.
Sparsity-Aware Adaptive Algorithms Based on Alternating Optimization and Shrinkage Rodrigo C. de Lamare* + and Raimundo Sampaio-Neto * + Communications.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Machine Learning for Signal Processing Latent Variable Models and Signal Separation Class Oct Oct /18797.
Compressed Sensing Based UWB System Peng Zhang Wireless Networking System Lab WiNSys.
Automatic detection of microchiroptera echolocation calls from field recordings using machine learning algorithms Mark D. Skowronski and John G. Harris.
Brian King, Advised by Les Atlas Electrical Engineering, University of Washington This research was funded by Air Force Office.
Computer Science, Software Engineering & Robotics Workshop, FGCU, April 27-28, 2012 Fault Prediction with Particle Filters by David Hatfield mentors: Dr.
Basics of Neural Networks Neural Network Topologies.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.
1 11th DSP Workshop Taos Ski Valley, NM 2004 Centered Discrete Fractional Fourier Transform & Linear Chirp Signals Balu Santhanam & Juan G. Vargas- Rubio.
School of Electrical & Computer Engineering Image Denoising Using Steerable Pyramids Alex Cunningham Ben Clarke Dy narath Eang ECE November 2008.
A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds Paris Smaragdis, Madhusudana Shashanka, Bhiksha Raj NIPS 2009.
Learning to Sense Sparse Signals: Simultaneous Sensing Matrix and Sparsifying Dictionary Optimization Julio Martin Duarte-Carvajalino, and Guillermo Sapiro.
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Basic Time Series Analyzing variable star data for the amateur astronomer.
Non-negative Matrix Factor Deconvolution; Extracation of Multiple Sound Sources from Monophonic Inputs International Symposium on Independent Component.
Sequential Monte-Carlo Method -Introduction, implementation and application Fan, Xin
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 3) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Doc.: IEEE /0018r0 Submission March 2006 Steve Berger, TEM ConsultingSlide 1 Exploration of Structure for IEEE P Recommended Practice for.
Doc.: IEEE /0125r0 Submission July 2006 Slide 1 Huawei Interference Detection for Sensing IEEE P Wireless RANs Date: Authors:
1 Electrical and Computer Engineering Binghamton University, State University of New York Electrical and Computer Engineering Binghamton University, State.
Flexible Speaker Adaptation using Maximum Likelihood Linear Regression Authors: C. J. Leggetter P. C. Woodland Presenter: 陳亮宇 Proc. ARPA Spoken Language.
Random signals Honza Černocký, ÚPGM.
Mohsen Riahi Manesh and Dr. Naima Kaabouch
Paradoxes on Instantaneous Frequency
Aleksey S. Rubel1, Vladimir V. Lukin1,
Compressive Coded Aperture Video Reconstruction
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
A presentation on Basics of Speech Recognition Systems
Machine Learning Basics
Detecting the Learning Value of Items In a Randomized Problem Set
Signals and Systems Networks and Communication Department Chapter (1)
Submission Title: [Robust Ranging Algorithm for UWB radio]
The Sound of the Original Sentences
Analysis of Audio Using PCA
Submission Title: [Robust Ranging Algorithm for UWB radio]
Fixed, Random and Mixed effects
Interference Detection for Sensing
EE 492 ENGINEERING PROJECT
Ian C. Wong and Brian L. Evans ICASSP 2007 Honolulu, Hawaii
Human Speech Communication
Submission Title: [Robust Ranging Algorithm for UWB radio]
A Summary of Inter-WRAN Coexistence Schemes
Comparison of CBP PHY Modes
Clustering Gene Expression Data Using Independent Component Analysis
Deep neural networks for spike sorting: exploring options
Wavelet Analysis Objectives: To Review Fourier Transform and Analysis
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Presenter: Shih-Hsiang(士翔)
Clustering methods: Part 10
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Beehive Audio Source Separation Dakota Murray

What is Audio? Probably zoom in on the frequencies less than 2500 Hz. Just a bunch of numbers! Time domain Frequency Domain Spectrogram Probably zoom in on the frequencies less than 2500 Hz.

The Spectrogram Each column is a frequency spectrum at a point in time Discretized. Tradeoff between number of time frames, and number of frequency bins Discretized? Tradeoff between time resolution and frequency resolution.

Signal Separation Separate sources from a mixture Paris Smaragdis Signal Processing researcher Approached signal separation probabilistically He models each source as a mixture of distributions over the frequency spectrum. The resulting spectrogram is a mixture of these sources and noise.

Smaragdis’s Approach 1 A source can be represented as a stochastic “picker” randomly selecting from a set of jars Each jar is a component of the source and each has different distributions of marbles inside A frequency is written on each marble Given a component z, what is the probability of the picker pulling frequency f

Smaragdis’s Approach 2 Mixtures of sources add another variable, s, to the model. If there are two sources, our picker is now selecting from two sets of jars, each set with their own distributions Given a source s, and component z, what is the probability of the picker pulling frequency f

Original Signals Voice Chimes Separated Signals

Training Target   x

Training Interference   x

Step 2: Separation Decompose the mixture of our known sources in terms of the contributions of the basis vectors for each source In other words, find out how each source contributes to the mixture.

Step 3: Reconstruction * = * =

Variation on the Process Supervised We have an example of target and interference Semi-Supervised We only know the target. Interference could be anything Train the target as normal. Learn Interference from our mixture. Interference represents everything that isnt the target. Fixed Target: Target Vectors can not change Target As Prior: Target Vectors are allowed to change

The Importance of Honey Bees Honeybees are dying from Colony Collapse Disorder Honeybee pollination results in ⅓ of the food humans consume. An agricultural, ecological, and economic issue. Data is useful. Good data is scarce. Often full of interference from traffic, weather, and electronics Cite all images and equations from somewhere else.

Bee Hive Audio Considerations Can’t be certain what type of interference to expect. Should use a Semi-Supervised method of training the interference basis vectors Don’t want to assume we know exactly what the bees will sound like. Use a Target as Prior method of training the interference, thus allowing the target to change. Might be getting a lot of data. Must optimize parameters for best performance to execution time ratio Testing should involve more real world scenarios

Problems To get good results from a complicated signal, the algorithm is time consuming The training process only find local minimum, may not produce idea set of basis vectors. Has difficulty separating sources with occupy similar frequency ranges.

Direction for Future Testing, testing, testing, and more testing (optimizing parameters, trying out different testing scenarios) Gathering more bee hive data. App State students currently working on Low Cost hive monitoring hardware More real-world scenarios Work with other types of data, not just bees.

Acknowledgements Paris Smaragdis for paving the way Dr. Parry for his mentoring, assistance, and wealth of knowledge Dr. T and the Rest of the App State Department of Computer Science for providing the encouragement and opportunity NC State for hosting this event!

References Bhiksha Raj and Paris Smaragdis, "Latent variable decomposition of spectrograms for single channel speaker separation," in Proc. of IEEE Workshop on Applications of Signal Proc. to Audio and Acoustics, New Paltz, USA, 2005. P. Smaragdis , B. Raj and M. V. Shashanka "Supervised and semi-supervised separation of sounds from single-channel mixtures", Proc. 7th Int. Conf. Ind. Compon. Anal. Signal Separation (ICA\'07), pp.414 -421 2007 M. V. S. Shashanka , B. Raj and P. Smaragdis "Sparse overcomplete decomposition for single channel speaker separation", Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP\'07), vol. 2, pp.II-641 -II-644 2007