Realtime Recognition of Orchestral Instruments Ichiro Fujinaga McGill University.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Data Mining Classification: Alternative Techniques
Data Mining Classification: Alternative Techniques
Timbre perception. Objective Timbre perception and the physical properties of the sound on which it depends Formal definition: ‘that attribute of auditory.
1 CS 391L: Machine Learning: Instance Based Learning Raymond J. Mooney University of Texas at Austin.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Instance Based Learning
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
Lecture 6 K-Nearest Neighbor Classifier
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Navneet Goyal. Instance Based Learning  Rote Classifier  K- nearest neighbors (K-NN)  Case Based Resoning (CBR)
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
K nearest neighbor and Rocchio algorithm
MACHINE LEARNING 9. Nonparametric Methods. Introduction Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Mouse Movement Biometrics, Pace University, Fall'20071 Mouse Movement Biometrics Fall 2007 Capstone -Team Members Rafael Diaz Michael Lampe Nkem Ajufor.
Data Mining Techniques Outline
Data classification based on tolerant rough set reporter: yanan yean.
INSTANCE-BASE LEARNING
Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples Committee: Eugene Fink Lihua Li Dmitry B. Goldgof Hong Tang.
Neural Optimization of Evolutionary Algorithm Strategy Parameters Hiral Patel.
CS Instance Based Learning1 Instance Based Learning.
Spam? Not any more !! Detecting spam s using neural networks ECE/CS/ME 539 Project presentation Submitted by Sivanadyan, Thiagarajan.
DSP-FPGA Based Image Processing System Final Presentation Jessica Baxter  Sam Clanton Simon Fung-Kee-Fung Almaaz Karachi  Doug Keen Computer Integrated.
Optical Music Recognition Ichiro Fujinaga McGill University 2003.
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
TEMPLATE DESIGN © Detecting User Activities Using the Accelerometer on Android Smartphones Sauvik Das, Supervisor: Adrian.
K Nearest Neighborhood (KNNs)
Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
Machine Learning in Computer Games Marc Ponsen 11/29/04.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Performance Comparison of Speaker and Emotion Recognition
Chapter1: Introduction Chapter2: Overview of Supervised Learning
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
Digitization of the Lester S. Levy Collection of Sheet Music Ichiro Fujinaga McGill University with Michael Droettboom, Karl MacMillan, G. Sayeed Choudhury,
Recognition of Isolated Instrument Tones by Conservatory Students Asha Srinivasan, David Sullivan, and Ichiro Fujinaga Peabody Conservatory of Music Johns.
Chenn-Jung Huang a*, Yi-Ju Yang b, Dian-Xiu Yang a, You-Jia Chen a a Department of Computer and Information Science b Institute of Ecology and Environmental.
Eick: kNN kNN: A Non-parametric Classification and Prediction Technique Goals of this set of transparencies: 1.Introduce kNN---a popular non-parameric.
Fuzzy Pattern Recognition. Overview of Pattern Recognition Pattern Recognition Procedure Feature Extraction Feature Reduction Classification (supervised)
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Classification of tissues and samples 指導老師:藍清隆 演講者:張許恩、王人禾.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
Evolutionary Design of the Closed Loop Control on the Basis of NN-ANARX Model Using Genetic Algoritm.
March 1, 2016Introduction to Artificial Intelligence Lecture 11: Machine Evolution 1 Let’s look at… Machine Evolution.
DOWeR Detecting Outliers in Web Service Requests Master’s Presentation of Christian Blass.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
HyperNetworks Engın denız usta
Instance Based Learning
Ichiro Fujinaga McGill University
Instance Based Learning
An Improved Neural Network Algorithm for Classifying the Transmission Line Faults Slavko Vasilic Dr Mladen Kezunovic Texas A&M University.
Department of Electrical Engineering
X.4 Genetic Algorithms Understand the basic design architecture underpinning genetic algorithms Role of the Fitness function Repopulation based on genetic.
Boltzmann Machine (BM) (§6.4)
Ichiro Fujinaga Peabody Conservatory of Music Johns Hopkins University
Realtime Recognition of Orchestral Instruments
Realtime Recognition of Orchestral Instruments
Ichiro Fujinaga McGill University
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Presentation transcript:

Realtime Recognition of Orchestral Instruments Ichiro Fujinaga McGill University

Overview  Introduction  Lazy learning (exemplar-based learning) k-NN classifier Genetic algorithm Features  Results  Conclusions

Introduction  Realtime recognition of isolated monophonic orchestral instruments  Spectrum analysis by Miller Puckette’s fiddle~  Adaptive system based on a exemplar- based classifier and a genetic algorithm

Overall Architecture Data Acquisition & Data Analysis (fiddle) Recognition K-NN Classifier Output Instrument Name Knowledge Base Feature Vectors Genetic Algorithm K-NN Classifier Best Weight Vector Live mic Input Sound file Input Off-line

Exemplar-based learning The exemplar-based learning model is based on the idea that objects are categorized by their similarity to one or more stored examples There is much evidence from psychological studies to support exemplar-based categorization by humans This model differs both from rule-based or prototype- based (neural nets) models of concept formation in that it assumes no abstraction or generalizations of concepts This model can be implemented using k-nearest neighbor classifier and is further enhanced by application of a genetic algorithm

Exemplar-based categorization  Objects are categorized by their similarity to one or more stored examples  No abstraction or generalizations, unlike rule-based or prototype-based models of concept formation  Can be implemented using k-nearest neighbor classifier  Slow and large storage requirements?

Exemplar-based learning The exemplar-based learning model is based on the idea that objects are categorized by their similarity to one or more stored examples There is much evidence from psychological studies to support exemplar-based categorization by humans This model differs both from rule-based or prototype- based (neural nets) models of concept formation in that it assumes no abstraction or generalizations of concepts This model can be implemented using k-nearest neighbor classifier and is further enhanced by application of a genetic algorithm

K-nearest-neighbor classifier  Determine the class of a given sample by its feature vector: Distances between feature vectors of an unclassified sample and previously classified samples are calculated The class represented by the majority of k- nearest neighbors is then assigned to the unclassified sample

Example of k-NN classifier

Distance measures  The distance in a N-dimensional feature space between two vectors X and Y can be defined as:  A weighted distance can be defined as:

Genetic algorithms  Optimization based on biological evolution  Maintenance of population using selection, crossover, and mutation  Chromosomes = weight vectors  Fitness function = recognition rate  Leave-one-out cross validation

Features  Static features (per window) pitch mass or the integral of the curve (zeroth-order moment) centroid (first-order moment) variance (second-order central moment) skewness (third-order central moment) amplitudes of the harmonic partials number of strong harmonic partials spectral irregularity tristimulus  Dynamic features means and velocities of static features over time

Data  Original source: McGill Master Samples  Over 1300 notes from 39 different timbres (23 orchestral instruments)  Spectrum analysis by fiddle (2048 points)  First 46–232ms of attack (1–9 windows)  Each analysis window (46 ms) consists of a list of amplitudes and frequencies of the peaks in the spectra

Results  Experiment I SHARC data static features  Experiment II fiddle dynamic features  Experiment III more features redefinition of attack point

Conclusions  Realtime timbre recognition system  Analysis by Puckette’s fiddle  Recognition using dynamic features  Adaptive recognizer by k-NN classifier enhanced with genetic algorithm  A successful implementation of exemplar-based classifier in a time- critical environment

Future research  Performer identification  Speaker identification  Tone-quality analysis  Multi-instrument recognition  Expert recognition of timbre

Recognition rate for different lengths of analysis window

Comparison with Human Performance