Presentation is loading. Please wait.

Presentation is loading. Please wait.

Are we still talking about diversity in classifier ensembles? Ludmila I Kuncheva School of Computer Science Bangor University, UK.

Similar presentations


Presentation on theme: "Are we still talking about diversity in classifier ensembles? Ludmila I Kuncheva School of Computer Science Bangor University, UK."— Presentation transcript:

1 Are we still talking about diversity in classifier ensembles? Ludmila I Kuncheva School of Computer Science Bangor University, UK

2 Publications (580) Citations (4594) “CLASSIFIER ENSEMBLE DIVERSITY” Search on 10 Sep 2014

3 MULTIPLE CLASSIFIER SYSTEMS 30 INT JOINT CONF ON NEURAL NETWORKS (IJCNN) 22 PATTERN RECOGNITION 17 NEUROCOMPUTING 14 EXPERT SYSTEMS WITH APPLICATIONS 13 INFORMATION SCIENCES 12 APPLIED SOFT COMPUTING 11 PATTERN RECOGNITION LETTERS 10 INFORMATION FUSION 9 IEEE INT JOINT CONF ON NEURAL NETWORKS 9 KNOWLEDGE-BASED SYSTEMS 7 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 7 INT J OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE 6 MACHINE LEARNING 5 IEEE TRANSACTIONS ON NEURAL NETWORKS 5 JOURNAL OF MACHINE LEARNING RESEARCH 5 APPLIED INTELLIGENCE 4 INTELLIGENT DATA ANALYSIS 4 IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 4 ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING 4 NEURAL INFORMATION PROCESSING 4 580 papers, 335 journals / conferences

4 MULTIPLE CLASSIFIER SYSTEMS 30 INT JOINT CONF ON NEURAL NETWORKS (IJCNN) 22 PATTERN RECOGNITION 17 NEUROCOMPUTING 14 EXPERT SYSTEMS WITH APPLICATIONS 13 INFORMATION SCIENCES 12 APPLIED SOFT COMPUTING 11 PATTERN RECOGNITION LETTERS 10 INFORMATION FUSION 9 IEEE INT JOINT CONF ON NEURAL NETWORKS 9 KNOWLEDGE-BASED SYSTEMS 7 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 7 INT J OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE 6 MACHINE LEARNING 5 IEEE TRANSACTIONS ON NEURAL NETWORKS 5 JOURNAL OF MACHINE LEARNING RESEARCH 5 APPLIED INTELLIGENCE 4 INTELLIGENT DATA ANALYSIS 4 IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 4 ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING 4 NEURAL INFORMATION PROCESSING 4 580 papers, 335 journals / conferences

5 Where in the world are we? China 140 UK 68 USA 63 Spain 55 Brazil 41 Canda 32 Poland 28 Iran 23 Italy 19... France 11

6 Where in the world are we? China 140 UK 68 USA 63 Spain 55 Brazil 41 Canda 32 Poland 28 Iran 23 Italy 19... France 11 Laurent HEUTTE Professor of Computer Science, University of Rouen, France

7 Are we still talking about diversity in classifier ensembles? Apparently yes...

8 That elusive diversity... classifier feature values (object description) classifier class label “combiner” Classifier ensemble

9 That elusive diversity... Classifier 2 Classifier 1 correct wrong correct wrong independent outputs ≠ independent errors hence, use ORACLE outputs Number of instances labelled correctly by classifier 1 and mislabelled by classifier 2

10 That elusive diversity... Classifier 2 Classifier 1 correct wrong correct wrong Q kappa correlation (rho) disagreement double fault...

11 That elusive diversity... SEVENTY SIX !!!

12 Do we need more “NEW” pairwise diversity measures? Looks like we don’t...

13 proposed by Margineantu and Dietterich in 1997 visualise individual accuracy and diversity in a 2-dimensional plot have been used to decide which ensemble members can be pruned without much harm to the overall performance Kappa-error diagrams

14 Adaboost 75.0% Bagging 77.0% Random subspace 80.9% Random oracle 83.3% Rotation Forest 84.7%  sonar data (UCI): 260 instances, 60 features, 2 classes, ensemble size L = 11 classifiers, base model – tree C4.5 Example Kuncheva L.I., A bound on kappa-error diagrams for analysis of classifier ensembles, IEEE Transactions on Knowledge and Data Engineering, 2013, 25 (3), 494-501 (DOI: 10.1109/TKDE.2011.234).

15 correctwrong C1 correct ab wrong cd C2 error kappa = (observed – chance)/(1-chance) Kappa-error diagrams

16 bound (tight) bound (tight) error kappa Kappa-error diagrams

17 error kappa Kappa-error diagrams – simulated ensembles L = 3

18 error kappa Kappa-error diagrams – real data L = 11

19 error kappa Real data: 77,422,500 pairs of classifiers room for improvement

20 Is there space for new classifier ensembles? Looks like yes...

21 Good and Bad diversity Diversity is not MONOTONICALLY related to ensemble accuracy

22 Good and Bad diversity 3 classifiers: A, B, C 15 objects, wrong vote, correct vote individual accuracy = 10/15 = 0.667 P = ensemble accuracy independent classifiers P = 11/15 = 0.733 identical classifiers P = 10/15 = 0.667 dependent classifiers 1 P = 7/15 = 0.467 dependent classifiers 2 P = 15/15 = 1.000 ABCABC ABCABC ABCABC ABCABC MAJORITY VOTE

23 Good and Bad diversity 3 classifiers: A, B, C 15 objects, wrong vote, correct vote individual accuracy = 10/15 = 0.667 P = ensemble accuracy independent classifiers P = 11/15 = 0.733 identical classifiers P = 10/15 = 0.667 dependent classifiers 1 P = 7/15 = 0.467 dependent classifiers 2 P = 15/15 = 1.000 ABCABC ABCABC ABCABC ABCABC MAJORITY VOTE Good diversity Bad diversity

24 Good and Bad diversity Data set Z Ensemble, L = 7 classifiers   Are these outputs diverse?

25 Good and Bad diversity Data set Z Ensemble, L = 7 classifiers    How about these?

26 Good and Bad diversity Data set Z Ensemble, L = 7 classifiers    3 vs 4... Can’t be more diverse, really...

27 Good and Bad diversity Data set Z Ensemble, L = 7 classifiers    MAJORITY VOTE  Good diversity

28 Good and Bad diversity Data set Z Ensemble, L = 7 classifiers    MAJORITY VOTE Bad diversity  

29 Good and Bad diversity maj  maj  Decomposition of the Majority Vote Error Individual error Subtract GOOD diversity Add BAD diversity Brown G., L.I. Kuncheva, "Good" and "bad" diversity in majority vote ensembles, Proc. Multiple Classifier Systems (MCS'10), Cairo, Egypt, LNCS 5997, 2010, 124-133.

30 Good and Bad diversity     Note that diversity quantity is 3 in both cases

31 Ensemble Margin     POSITIVE NEGATIVE

32 Ensemble Margin Average margin However, nearly all diversity measures are functions of Average absolute margin or Average square margin Margin has no sign...

33 Ensemble Margin

34 Diversity is not MONOTONICALLY related to ensemble accuracy So, STOP LOOKING for a monotonic relationship!!!

35 Conclusions 1.Beware! Overflow of diversity measures! 2.In theory, there is some room for better classifier ensembles. 3.Diversity is not monotonically related to ensemble accuracy, hence larger diversity does not necessarily mean better accuracy. Directly engineered or heuristic? – up to you

36 36

37 37


Download ppt "Are we still talking about diversity in classifier ensembles? Ludmila I Kuncheva School of Computer Science Bangor University, UK."

Similar presentations


Ads by Google