State.of.the.art June 2008 Pedro J. Ponce de León.

Slides:



Advertisements
Similar presentations

Advertisements

Fuzzy Inference Systems. Review Fuzzy Models If then.

Fuzzy Inference and Defuzzification
Advancements in Genetic Programming for Data Classification Dr. Hajira Jabeen Iqra University Islamabad, Pakistan.
Genetic fuzzy controllers for uncertain systems Yonggon Lee and Stanislaw H. Żak Supported by National Science Foundation under grant ECS
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
Genetic algorithms applied to multi-class prediction for the analysis of gene expressions data C.H. Ooi & Patrick Tan Presentation by Tim Hamilton.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Fuzzy Expert System.
Feature Selection for Regression Problems
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Data classification based on tolerant rough set reporter: yanan yean.
Neuro-Fuzzy Control Adriano Joaquim de Oliveira Cruz NCE/UFRJ
Selecting Informative Genes with Parallel Genetic Algorithms Deodatta Bhoite Prashant Jain.
Genetic Algorithm What is a genetic algorithm? “Genetic Algorithms are defined as global optimization procedures that use an analogy of genetic evolution.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
EVOLVING ANTS Enrique Areyan School of Informatics and Computing Indiana University January 24, 2012.
Revision Michael J. Watts
A Simple Method to Extract Fuzzy Rules by Measure of Fuzziness Jieh-Ren Chang Nai-Jian Wang.
Evolving a Sigma-Pi Network as a Network Simulator by Justin Basilico.
Fuzzy Rules 1965 paper: “Fuzzy Sets” (Lotfi Zadeh) Apply natural language terms to a formal system of mathematical logic
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
Fuzzy Inference (Expert) System
GENETIC ALGORITHMS FOR THE UNSUPERVISED CLASSIFICATION OF SATELLITE IMAGES Ankush Khandelwal( ) Vaibhav Kedia( )
Neural-Network-Based Fuzzy Logical Control and Decision System 主講人 虞台文.
1 Appendix D: Application of Genetic Algorithm in Classification Duong Tuan Anh 5/2014.
Fuzzy Inference Systems. Fuzzy inference (reasoning) is the actual process of mapping from a given input to an output using fuzzy logic. The process involves.
Protein motif extraction with neuro-fuzzy optimization Bill C. H. Chang and Author : Bill C. H. Chang and Saman K. Halgamuge Saman K. Halgamuge Adviser.
PART 9 Fuzzy Systems 1. Fuzzy controllers 2. Fuzzy systems and NNs 3. Fuzzy neural networks 4. Fuzzy Automata 5. Fuzzy dynamic systems FUZZY SETS AND FUZZY.
Project 2: Classification Using Genetic Programming Kim, MinHyeok Biointelligence laboratory Artificial.
Statistics 1: Introduction to Probability and Statistics Section 3-2.
Clustering Instructor: Max Welling ICS 178 Machine Learning & Data Mining.
Rotem Golan Department of Computer Science Ben-Gurion University of the Negev, Israel.
Discriminative Frequent Pattern Analysis for Effective Classification By Hong Cheng, Xifeng Yan, Jiawei Han, Chih- Wei Hsu Presented by Mary Biddle.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.
Melody Recognition with Learned Edit Distances Amaury Habrard Laboratoire d’Informatique Fondamentale CNRS Université Aix-Marseille José Manuel Iñesta,
Melody Characterization by a Fuzzy Rule System Pedro J. Ponce de León, David Rizo, José M. Iñesta (DLSI, Univ. Alicante) Rafael Ramírez (MTG, Univ. Pompeu.
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Prediction of Interconnect Net-Degree Distribution Based on Rent’s Rule Tao Wan and Malgorzata Chrzanowska- Jeske Department of Electrical and Computer.
Melody track identification in music symbolic files David Rizo Pedro J. Ponce de León Antonio Pertusa Carlos Pérez-Sancho José M. Iñesta The 19th International.
1 CS 391L: Machine Learning: Computational Learning Theory Raymond J. Mooney University of Texas at Austin.
Rhythmic Transcription of MIDI Signals
Artificial Intelligence CIS 342
FUZZY NEURAL NETWORKS TECHNIQUES AND THEIR APPLICATIONS
Stanisław H. Żak School of Electrical and Computer Engineering
Fuzzy logic Introduction 3 Fuzzy Inference Aleksandar Rakić
Chapter 12 Using Descriptive Analysis, Performing
Dr. Unnikrishnan P.C. Professor, EEE
Dr. Unnikrishnan P.C. Professor, EEE
FUZZIFICATION AND DEFUZZIFICATION
DataMining, Morgan Kaufmann, p Mining Lab. 김완섭 2004년 10월 27일
FCTA 2016 Porto, Portugal, 9-11 November 2016 Classification confusion within NEFCLASS caused by feature value skewness in multi-dimensional datasets Jamileh.
Generalizations of Markov model to characterize biological sequences
Statistics 1: Introduction to Probability and Statistics
Fuzzy Logic Colter McClure.
Dr. Unnikrishnan P.C. Professor, EEE
Parameter control Chapter 8.
Nearest Neighbors CSC 576: Data Mining.
Avoid Overfitting in Classification
Fuzzy Inference Systems
Using Bayesian Network in the Construction of a Bi-level Multi-classifier. A Case Study Using Intensive Care Unit Patients Data B. Sierra, N. Serrano,
Parameter control Chapter 8.
Modeling IDS using hybrid intelligent systems
Parameter control Chapter 8.
Presentation transcript:

State.of.the.art June 2008 Pedro J. Ponce de León

Index Melody Characterization by a Genetic Fuzzy System Doing it right Music genre dynamic classification Pre-filtering test data Classifier combination: (SSPR-2006): A Java implementation

Melody Characterization by a Genetic Fuzzy System Motivation To describe with readable linguistic labels what a melody is And learn it automatically from a dataset

Related previous works Ponce de León et. al. [ISMIR’07]: melody characterization by automatically induced rules AvgPitch>=65 and TrackOccupationRate>=0.51 and ….

Track description Set of statistics on properties of note streams: CategoryDescriptors Track info Normalized duration, Number of notes, Occupation rate, Polyphony rate Pitch Highest, Lowest, Mean, Standard deviation Pitch intervals Number of distinct intervals, Largest, Smallest, Mean, Mode, Standard deviation Note durations Longest, Shortest, Mean, Standard deviation Syncopation Number of syncopated notes Class Melody, accompaniment

From crisp rules to fuzzy rules Data representation (descriptors) Crisp descriptor (e.g. avgPitch)  fuzzy variable Fuzzy variable membership function parameters are optimized using a Genetic Fuzzy System Fuzzy Inference System (FIS) + Genetic Algorithm (GA) Rules Fuzzify antecedents: (x > a)  (x IS term 1 ) OR (x IS term 2 ) …

3.2 Fuzzy set parameter optimization: encoding r1r1 r2r2 r3r3 rnrn xyz p0p0 Gene = {p 0,r 1,r 2,r 3,…r n } Only fuzzy set support points considered for optimization the relative values of the vertices are encoded into genes

3.2 Fuzzy set parameter optimization: encoding Fuzzy variables Chromosome

3.2 GA fitness function Chromosome Build fuzzy sets from chromosome Fitness measure: test quality of fuzzy rules system with training corpus: Two possible quality measurements: a) Number of hits: num. of tracks correctly classified by the FIS b) F-measure for class ‘IsMelody=true’

3.3 Crisp Rule fuzzification Fuzzify antecedents: keep the sets whose support holds the inequality CrispFuzzy A B CDA B CD A BCD (x < v)  (x IS C) OR (x IS B) OR (x IS A) (x > v)  (x IS C) OR (x IS D) v x SMC08: ad-hoc rule fuzzification Now: Rule fuzzification based on evolved fuzzy sets (for fitness computation)

Experiments and results

FIS optimization parameters Experiment parameterValues GA population size100,500,1000 GA no. of generations100,500,1000 GA mutation rationone, 0.05, 0.1 GA selection strategyBest one, Best 10%, Best 20% GA fitness metricHit count, F-measure Defuzzification threshold0.5,0.6, 0.7

Fitness evolution (F-measure fitness)

Fitness evolution (F-measure fitness : 1000 generations)

Fitness evolution Similar results for Hit-based fitness… So reduce number of generations: max. 300

FIS optimization parameters Experiment parameterValues GA population size100,200,300 (updated) GA no. of generations100,200,300 (updated) GA mutation rationone, 0.05, 0.1 GA selection strategyBest one, Best 10%, Best 20% GA fitness metricHit count, F-measure Defuzzification threshold0.5,0.6, *2 = 486 experiments!

Ortogonal arrays for experiment design OA(rows,levels,strength) OA(18,3,2): population= evolutions= fitness=Hits Fmeasure Fmeasure mutation= selector=Best Best0.1 Best0.2 threshold= Distributed GA computation es.ua.dlsi.im.experiments.generation.ExperimentDesignGenerator es.ua.dlsi.im.experiments.generation.OrthogonalArrayCombinator

Ortogonal arrays for experiment design population= evolutions= fitness=Hits Fmeasure Fmeasure mutation= selector=Best Best0.1 Best0.2 threshold= es.ua.dlsi.im.experiments.generation.ExperimentDesignGenerator usage: ExperimentDesignGenerator -commandPrefix Prefix for command line in generated script -file Experiment definition file -help Help -latex Output LaTeX file name -latex_with_header Output LaTeX table header -outputDir Output directory -outputFilePrefix Prefix for configuration files -outputScript Output script file name

Results DatasetPrecisionRecallFError rate SMALL (crisp) SMALL (best fuzzy) SMALL (avg. fuzzy) (!) SMALL (avg. fuzzy fixed rules) System parameters learnt with SMALL dataset Large recall -> A lot of FP: Many non-melody tracks fire a rule TODO: Investigate why

Some hints for large recall Unrealistic rules: FUZZIFY AvgAbsInterval TERM high := (90.6, 0.0) (108.6, 1.0) (121.7, 1.0) ; TERM fourth := (65.4, 0.0) (80.3, 1.0) (107.1, 1.0) (111.7, 0.0) ; TERM third := (26.6, 0.0) (32.0, 1.0) (66.2, 1.0) (88.0, 0.0) ; TERM second := (3.2, 0.0) (3.8, 1.0) (29.0, 1.0) (41.6, 0.0) ; TERM unison := (0.0, 0.0) (0.4, 1.0) (4.5, 1.0) (4.5, 0.0) ; END_FUZZIFY Some fuzzy variables have 5 terms, other 3 Range: [0..127] not realistic

Best evolved Fuzzy rule set so far: 1.(((AvgPitch is high or AvgPitch is veryHigh) and ((TrackOccupationRate is average or TrackOccupationRate is high) or TrackOccupationRate is full)) and (AvgAbsInterval is unison)) and (TrackNumNotes is average or TrackNumNotes is high) 2.(((AvgPitch is high or AvgPitch is veryHigh) and ((TrackOccupationRate is average or TrackOccupationRate is high) or TrackOccupationRate is full)) and (TrackPolyphonyRate is low or TrackPolyphonyRate is none)) and (NormalizedDistinctIntervals is highest) 3.((((AvgPitch is high or AvgPitch is veryHigh) and (TrackNumNotes is high)) and (LowestNormalizedDuration is shortest)) and ((ShortestDuration is low or ShortestDuration is average) or ShortestDuration is high)) and (NormalizedDistinctIntervals is highest) 4.…

Melody description example (I) Air In F, Watermusic, Handel (Baroque) Melody (AvgPitch IS NOT veryLow) AND (AvgAbsInterval IS NOT high) AND (AvgAbsInterval IS NOT third) AND (AvgAbsInterval IS NOT fourth) AND (TrackSyncopation IS alot) AND (TrackOccupationRate IS NOT low) AND (LowestNormalizedPitch IS NOT low) AND (DistinctIntervals IS alot) AND (TrackNormalizedDuration IS largest) Most fired rule:

Melody description example (II) There Is No Greater Love, I. Jones (pre-Bop Jazz) Melody (AvgPitch IS NOT veryLow) AND (AvgAbsInterval IS NOT high) AND (AvgAbsInterval IS NOT third) AND (AvgAbsInterval IS NOT fourth) AND (TrackSyncopation IS alot) AND (TrackOccupationRate IS NOT low) AND (LowestNormalizedPitch IS NOT low) AND (DistinctIntervals IS alot) AND (TrackNormalizedDuration IS largest) Most fired rule:

Music genre dynamic classification Pre-filtering test data Based on number of notes: The fewer notes in a fragment, the less robust its statistical description Proof-of-concept: jvc melodies for training jvc2 for test NN classifier

Music genre dynamic classification Pre-filtering test data on the number of notes per window w=4,8,16,32,64,960 <= threshold < 30

Music genre dynamic classification Preliminary investigation on the number of notes vs. window length suggest a ratio between 1.5 and 2 w Non-filtered Filtered (min = 1.5*w) Filtered (min = 2*w) Accuracy Non-significant improvements

Classifier combination: a Java implementation es.ua.dlsi.im.pr.* Prediction prediction weight getPrediction() getWeight() isWeighted() PredictionCombiner PluralityVote() : Prediction SimpleWeightedVote() : Prediction RescaledWV() : Prediction BestWorstVW() : Prediction QuadraticBWVW() : Prediction WeightedMajority() : Prediction

Classifier combination: a Java implementation es.ua.dlsi.im.windows.* Instances DynamicCombiner getPredictions() : Prediction[] getPredictionByPluralityVote() : Prediction getPredictionBySimpleWeightedVote() : Prediction … Classifier