Realtime Recognition of Orchestral Instruments

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Timbre perception. Objective Timbre perception and the physical properties of the sound on which it depends Formal definition: ‘that attribute of auditory.
1 CS 391L: Machine Learning: Instance Based Learning Raymond J. Mooney University of Texas at Austin.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
MACHINE LEARNING 9. Nonparametric Methods. Introduction Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.
An Exploration of timbre: its perception, analysis and representation Dr. Deirdre Bolger CNRS-LMS,Paris Invited lecture, Institut für Musikwissenschaft,
Neural Optimization of Evolutionary Algorithm Strategy Parameters Hiral Patel.
DSP-FPGA Based Image Processing System Final Presentation Jessica Baxter  Sam Clanton Simon Fung-Kee-Fung Almaaz Karachi  Doug Keen Computer Integrated.
Harmonically Informed Multi-pitch Tracking Zhiyao Duan, Jinyu Han and Bryan Pardo EECS Dept., Northwestern Univ. Interactive Audio Lab,
Instrument Recognition in Polyphonic Music Jana Eggink Supervisor: Guy J. Brown University of Sheffield
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Kumar Srijan ( ) Syed Ahsan( ). Problem Statement To create a Neural Networks based multiclass object classifier which can do rotation,
Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
K Nearest Neighbors Classifier & Decision Trees
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Line detection Assume there is a binary image, we use F(ά,X)=0 as the parametric equation of a curve with a vector of parameters ά=[α 1, …, α m ] and X=[x.
Comparison of machine and human recognition of isolated instrument tones Ichiro Fujinaga McGill University.
1 “Genetic Algorithms are good at taking large, potentially huge search spaces and navigating them, looking for optimal combinations of things, solutions.
Pattern Recognition April 19, 2007 Suggested Reading: Horn Chapter 14.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Genetic Algorithms Abhishek Sharma Piyush Gupta Department of Instrumentation & Control.
Unsupervised Learning Networks 主講人 : 虞台文. Content Introduction Important Unsupervised Learning NNs – Hamming Networks – Kohonen’s Self-Organizing Feature.
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Realtime Recognition of Orchestral Instruments Ichiro Fujinaga McGill University.
Digitization of the Lester S. Levy Collection of Sheet Music Ichiro Fujinaga McGill University with Michael Droettboom, Karl MacMillan, G. Sayeed Choudhury,
Automatic Transcription System of Kashino et al. MUMT 611 Doug Van Nort.
Recognition of Isolated Instrument Tones by Conservatory Students Asha Srinivasan, David Sullivan, and Ichiro Fujinaga Peabody Conservatory of Music Johns.
Debrup Chakraborty Non Parametric Methods Pattern Recognition and Machine Learning.
Instance-Based Learning Evgueni Smirnov. Overview Instance-Based Learning Comparison of Eager and Instance-Based Learning Instance Distances for Instance-Based.
And application to estimating the left-hand fingering (automatic tabulature generation) Caroline Traube Center for Computer Research in Music and Acoustics.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
DOWeR Detecting Outliers in Web Service Requests Master’s Presentation of Christian Blass.
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
Action-Grounded Push Affordance Bootstrapping of Unknown Objects
Genetic-Algorithm-Based Instance and Feature Selection
School of Computer Science & Engineering
HyperNetworks Engın denız usta
ARTIFICIAL NEURAL NETWORKS
Instance Based Learning
Ichiro Fujinaga McGill University
Ch8: Nonparametric Methods
Unsupervised Learning Networks
Some Extra Contents Lecturer : Bijay Mishra. What kind of data preprocessing do we need before applying data mining algorithm to any dataset? Data preprocessing.
Gamera A Toolkit for Structured Document Recognition including Music
Speech Recognition Christian Schulze
Modified Crossover Operator Approach for Evolutionary Optimization
Nearest-Neighbor Classifiers
Optical Music Recognition
Instance Based Learning
An Improved Neural Network Algorithm for Classifying the Transmission Line Faults Slavko Vasilic Dr Mladen Kezunovic Texas A&M University.
Department of Electrical Engineering
X.4 Genetic Algorithms Understand the basic design architecture underpinning genetic algorithms Role of the Fitness function Repopulation based on genetic.
Boltzmann Machine (BM) (§6.4)
Ichiro Fujinaga Peabody Conservatory of Music Johns Hopkins University
Review NNs Processing Principles in Neuron / Unit
Realtime Recognition of Orchestral Instruments
Traveling Salesman Problem by Genetic Algorithm
Measuring the Similarity of Rhythmic Patterns
Harmonically Informed Multi-pitch Tracking
Ichiro Fujinaga McGill University
Data Mining CSCI 307, Spring 2019 Lecture 11
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Presentation transcript:

Realtime Recognition of Orchestral Instruments Ichiro Fujinaga and Karl MacMillan Peabody Conservatory of Music Johns Hopkins University

Overview Introduction Lazy learning (exemplar-based learning) Results k-NN classifier Genetic algorithm Features Results Demonstration Conclusions

Introduction Realtime recognition of isolated monophonic orchestral instruments Spectrum analysis by Miller Puckette’s fiddle Adaptive system based on a exemplar-based classifier and a genetic algorithm

Overall Architecture Off-line Live mic Input Sound file Input Data Acquisition & Data Analysis (fiddle) Recognition K-NN Classifier Output Instrument Name Knowledge Base Feature Vectors Genetic Algorithm K-NN Classifier Best Weight Vector Off-line

Exemplar-based categorization Objects are categorized by their similarity to one or more stored examples No abstraction or generalizations, unlike rule-based or prototype-based models of concept formation Can be implemented using k-nearest neighbor classifier Slow and large storage requirements?

K-nearest-neighbor classifier Determine the class of a given sample by its feature vector: Distances between feature vectors of an unclassified sample and previously classified samples are calculated The class represented by the majority of k-nearest neighbors is then assigned to the unclassified sample

Example of k-NN classifier

Example of k-NN classifier

Example of k-NN classifier

Example of k-NN classifier

Distance measures The distance in a N-dimensional feature space between two vectors X and Y can be defined as: A weighted distance can be defined as:

Genetic algorithms Optimization based on biological evolution Maintenance of population using selection, crossover, and mutation Chromosomes = weight vector Fitness function = recognition rate Leave-one-out cross validation

Features Static features (per window) Dynamic features pitch mass or the integral of the curve (zeroth-order moment) centroid (first-order moment) variance (second-order central moment) skewness (third-order central moment) amplitudes of the harmonic partials number of strong harmonic partials spectral irregularity tristimulus Dynamic features means and velocities of static features over time

Data Original source: McGill Master Samples Over 1300 notes from 39 different timbres (23 orchestral instruments) Spectrum analysis by fiddle (2048 points) First 46–232ms of attack (1–9 windows) Each analysis window (46 ms) consists of a list of amplitudes and frequencies of the peaks in the spectra

Results Experiment I SHARC data static features Experiment II fiddle dynamic features Experiment III more features redefinition of attack point

Demonstration Using stored data Using recording Using audience

Conclusions Realtime timbre recognition system Analysis by Puckette’s fiddle Recognition using dynamic features Adaptive recognizer by k-NN classifier enhanced with genetic algorithm A successful implementation of exemplar-based classifier in a time-critical environment

Future research Performer identification Speaker identification Tone-quality analysis Multi-instrument recognition Expert recognition of timbre

Recognition rate for different lengths of analysis window

Comparison with Human Performance