Richard Jensen and Chris Cornelis Chris Cornelis Chris Cornelis Ghent University, Belgium Richard Jensen Richard Jensen Aberystwyth University, UK Fuzzy-Rough.

Slides:



Advertisements
Similar presentations
Fuzzy-Rough Feature Significance for Fuzzy Decision Trees Advanced Reasoning Group Department of Computer Science The University of Wales, Aberystwyth.
Advertisements

PHONE MODELING AND COMBINING DISCRIMINATIVE TRAINING FOR MANDARIN-ENGLISH BILINGUAL SPEECH RECOGNITION Yanmin Qian, Jia Liu ICASSP2010 Pei-Ning Chen CSIE.
Rough Sets in Data Mining CSE5610 Intelligent Software Systems Semester 1, 2006.
DECISION TREES. Decision trees  One possible representation for hypotheses.
_ Rough Sets. Basic Concepts of Rough Sets _ Information/Decision Systems (Tables) _ Indiscernibility _ Set Approximation _ Reducts and Core _ Rough Membership.
Feature Grouping-Based Fuzzy-Rough Feature Selection Richard Jensen Neil Mac Parthaláin Chris Cornelis.
Affinity Set and Its Applications Moussa Larbani and Yuh-Wen Chen.
Rulebase Expert System and Uncertainty. Rule-based ES Rules as a knowledge representation technique Type of rules :- relation, recommendation, directive,
Combining Classification and Model Trees for Handling Ordinal Problems D. Anyfantis, M. Karagiannopoulos S. B. Kotsiantis, P. E. Pintelas Educational Software.
Multiple Criteria Decision Analysis with Game-theoretic Rough Sets Nouman Azam and JingTao Yao Department of Computer Science University of Regina CANADA.
Huge Raw Data Cleaning Data Condensation Dimensionality Reduction Data Wrapping/ Description Machine Learning Classification Clustering Rule Generation.
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts 04 10, 2014 Hyun Geun Soo Bo Pang and Lillian Lee (2004)
Rough Sets Theory Speaker:Kun Hsiang.
WRSTA, 13 August, 2006 Rough Sets in Hybrid Intelligent Systems For Breast Cancer Detection By Aboul Ella Hassanien Cairo University, Faculty of Computer.
Signal Processing Institute Swiss Federal Institute of Technology, Lausanne 1 Feature selection for audio-visual speech recognition Mihai Gurban.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Lab4 CPIT 440 Data Mining and Warehouse.
Data classification based on tolerant rough set reporter: yanan yean.
Mining Hierarchical Decision Rules from Hybrid Data with Categorical and Continuous Valued Attributes Miao Duoqian, Qian Jin, Li Wen, Zhang Zehua.
ACM SAC’06, DM Track Dijon, France “The Impact of Sample Reduction on PCA-based Feature Extraction for Supervised Learning” by M. Pechenizkiy,
August 2005RSFDGrC 2005, Regina, Canada 1 Feature Selection Based on Relative Attribute Dependency: An Experimental Study Jianchao Han 1, Ricardo Sanchez.
Efficient Join Processing over Uncertain Data - By Reynold Cheng, et all. Presented By Lydia & Usha.
Support Vector Regression David R. Musicant and O.L. Mangasarian International Symposium on Mathematical Programming Thursday, August 10, 2000
+ Doing More with Less : Student Modeling and Performance Prediction with Reduced Content Models Yun Huang, University of Pittsburgh Yanbo Xu, Carnegie.
Richard Jensen and Qiang Shen Prof Qiang Shen Aberystwyth University, UK Dr. Richard Jensen Aberystwyth University, UK Interval-valued.
Richard Jensen, Chris Cornelis and Qiang Shen Dr. Chris Cornelis Ghent University, Belgium Dr. Richard Jensen Aberystwyth University,
ROUGH SET THEORY AND FUZZY LOGIC BASED WAREHOUSING OF HETEROGENEOUS CLINICAL DATABASES Yiwen Fan.
Selective Sampling on Probabilistic Labels Peng Peng, Raymond Chi-Wing Wong CSE, HKUST 1.
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
On Applications of Rough Sets theory to Knowledge Discovery Frida Coaquira UNIVERSITY OF PUERTO RICO MAYAGÜEZ CAMPUS
Fuzzy-rough data mining Richard Jensen Advanced Reasoning Group University of Aberystwyth
CSE & CSE6002E - Soft Computing Winter Semester, 2011 More Rough Sets.
Classification and Ranking Approaches to Discriminative Language Modeling for ASR Erinç Dikici, Murat Semerci, Murat Saraçlar, Ethem Alpaydın 報告者:郝柏翰 2013/01/28.
05/06/2007CRIS 2008 Personalizing Information Retrieval in CRISs with Fuzzy Sets and Rough Sets Germán Hurtado Martín 1,2 Chris Cornelis 2 Helga Naessens.
3. Rough set extensions  In the rough set literature, several extensions have been developed that attempt to handle better the uncertainty present in.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
CSC 196k Semester Project: Instance Based Learning
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
Data Reduction via Instance Selection Chapter 1. Background KDD  Nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable.
Richard Jensen, Andrew Tuson and Qiang Shen Qiang Shen Aberystwyth University, UK Richard Jensen Aberystwyth University, UK Andrew Tuson City University,
Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules S.D. Lee, David W. Cheung, Ben Kao The University of Hong.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Lian Yan and David J. Miller 國立雲林科技大學 National Yunlin University of.
Consensus Group Stable Feature Selection
Outline K-Nearest Neighbor algorithm Fuzzy Set theory Classifier Accuracy Measures.
1 Fuzzy Versus Quantitative Association Rules: A Fair Data-Driven Comparison Shih-Ming Bai and Shyi-Ming Chen Department of Computer Science and Information.
Chaoyang University of Technology Clustering web transactions using rough approximation Source : Fuzzy Sets and Systems 148 (2004) 131–138 Author : Supriya.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Direct mining of discriminative patterns for classifying.
Data Mining with Big Data IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014 Xiangyu Cai ( )
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Multi-objective evolutionary generation of mamdani fuzzy rule-based systems based on rule and condition selection International Workshop On Genetic And.
FUZZ-IEEE Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation Yixin Chen and James Z. Wang The Pennsylvania State.
Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.
Instance-Based Learning Evgueni Smirnov. Overview Instance-Based Learning Comparison of Eager and Instance-Based Learning Instance Distances for Instance-Based.
- Sachin Singh. Data Mining - Concepts Extracting meaningful knowledge from huge chunk of ‘raw’ data. Types –Association –Classification –Temporal.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
國立雲林科技大學 National Yunlin University of Science and Technology Semantics-preserving dimensionality reduction rough and fuzzy rough based approaches Author.
CLASS INHERITANCE TREE (CIT)
Queensland University of Technology
Data Driven Resource Allocation for Distributed Learning
Sahar Cherif, Nesrine Baklouti , Adel Alimi and Vaclav Snasel
An Enhanced Support Vector Machine Model for Intrusion Detection
Methods of modulation and demodulation in OFDM system: implement and analysis for BER performance, hardware performance, and hardware complexity Hsin-Yu.
Introduction to Data Mining, 2nd Edition by
A Modified Naïve Possibilistic Classifier for Numerical Data
A Unifying View on Instance Selection
Dependencies in Structures of Decision Tables
Color image noise removal algorithm utilizing hybrid vector filtering
Presentation transcript:

Richard Jensen and Chris Cornelis Chris Cornelis Chris Cornelis Ghent University, Belgium Richard Jensen Richard Jensen Aberystwyth University, UK Fuzzy-Rough Instance Selection

Richard Jensen and Chris Cornelis Outline The importance of instance selection Rough set theory Fuzzy-rough sets Fuzzy-rough instance selection Experimentation Conclusion

Richard Jensen and Chris Cornelis Knowledge discovery The problem of too much data Requires storage Intractable for data mining algorithms Removing data that is noisy or irrelevant Instance selection

Richard Jensen and Chris Cornelis Rough set theory Rx is the set of all points that are indiscernible with point x Upper Approximation Set A Lower Approximation Equivalence class Rx

Richard Jensen and Chris Cornelis Fuzzy-rough sets Approximate equality Handle real-valued features via fuzzy tolerance relations instead of crisp equivalence Better noise and uncertainty handling Focus has been on feature selection, not instance selection

Richard Jensen and Chris Cornelis Fuzzy-rough sets Parameterized relation Fuzzy-rough definitions:

Richard Jensen and Chris Cornelis Instance selection: basic idea Not needed Remove objects to keep the underlying approximations unchanged

Richard Jensen and Chris Cornelis Instance selection: basic idea Remove objects to keep the underlying approximations unchanged

Richard Jensen and Chris Cornelis FRIS-I

Richard Jensen and Chris Cornelis FRIS-II

Richard Jensen and Chris Cornelis FRIS-III

Richard Jensen and Chris Cornelis Experimentation: setup

Richard Jensen and Chris Cornelis Results: FRIS-I (heart) (214 objects, 9 features)

Richard Jensen and Chris Cornelis Results: FRIS-II (heart)

Richard Jensen and Chris Cornelis Results: FRIS-III (heart)

Richard Jensen and Chris Cornelis Conclusion Proposed new techniques for instance selection based on fuzzy-rough sets Managed to reduce the number of instances significantly, retaining classification accuracy Future work Many possibilities for novel fuzzy-rough instance selection methods Comparisons with non-rough techniques Improving the complexity of FRIS-III Combined instance/feature selection

Richard Jensen and Chris Cornelis WEKA implementations of all fuzzy-rough methods can be downloaded from: