Mining in Anticipation for Concept Change: Proactive-Reactive Prediction in Data Streams YING YANG, XINDONG WU, XINGQUAN ZHU Data Mining and Knowledge.

Slides:



Advertisements
Similar presentations
A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions Jing Gao Wei Fan Jiawei Han Philip S. Yu University of Illinois.
Advertisements

Learning under concept drift: an overview Zhimin He iTechs – ISCAS
Introduction to Support Vector Machines (SVM)
Classification, Regression and Other Learning Methods CS240B Presentation Peter Huang June 4, 2014.
An Adaptive Learning Method for Target Tracking across Multiple Cameras Kuan-Wen Chen, Chih-Chuan Lai, Yi-Ping Hung, Chu-Song Chen National Taiwan University.
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
Decision Tree Approach in Data Mining
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
Ensemble Methods An ensemble method constructs a set of base classifiers from the training data Ensemble or Classifier Combination Predict class label.
Data Mining Classification: Alternative Techniques
The Problem of Concept Drift: Definitions and Related Work Alexev Tsymbalo paper. (April 29, 2004)
Korea Univ. Division Information Management Engineering UI Lab. Korea Univ. Division Information Management Engineering UI Lab. S E M I N A R Predictive.
Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.
Date : 21 st of May, Shri Ramdeo Baba College of Engineering and Management Presentation By : Rimjhim Singh Under the Guidance of: Dr. M.B. Chandak.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
On Appropriate Assumptions to Mine Data Streams: Analyses and Solutions Jing Gao† Wei Fan‡ Jiawei Han† †University of Illinois at Urbana-Champaign ‡IBM.
1 An Adaptive Nearest Neighbor Classification Algorithm for Data Streams Yan-Nei Law & Carlo Zaniolo University of California, Los Angeles PKDD, Porto,
A Brief Introduction to Adaboost
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Ensemble Learning: An Introduction
Adaboost and its application
Efficient Estimation of Emission Probabilities in profile HMM By Virpi Ahola et al Reviewed By Alok Datar.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Boosting Main idea: train classifiers (e.g. decision trees) in a sequence. a new classifier should focus on those cases which were incorrectly classified.
CSCI 347 / CS 4206: Data Mining Module 04: Algorithms Topic 06: Regression.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Machine Learning CS 165B Spring 2012
Where Are the Nuggets in System Audit Data? Wenke Lee College of Computing Georgia Institute of Technology.
Semi-Supervised Learning with Concept Drift using Particle Dynamics applied to Network Intrusion Detection Data Fabricio Breve Institute of Geosciences.
Event Metadata Records as a Testbed for Scalable Data Mining David Malon, Peter van Gemmeren (Argonne National Laboratory) At a data rate of 200 hertz,
1 Verifying and Mining Frequent Patterns from Large Windows ICDE2008 Barzan Mozafari, Hetal Thakkar, Carlo Zaniolo Date: 2008/9/25 Speaker: Li, HueiJyun.
1 ENTROPY-BASED CONCEPT SHIFT DETECTION PETER VORBURGER, ABRAHAM BERNSTEIN IEEE ICDM 2006 Speaker: Li HueiJyun Advisor: Koh JiaLing Date:2007/11/6 1.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
Boosting of classifiers Ata Kaban. Motivation & beginnings Suppose we have a learning algorithm that is guaranteed with high probability to be slightly.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.
Particle Competition and Cooperation in Networks for Semi-Supervised Learning with Concept Drift Fabricio Breve 1,2 Liang Zhao 2
Powerpoint Templates 1 Mining High-Speed Data Streams Pedro Domingos Geoff Hulten Sixth ACM SIGKDD International Confrence Presented by: Afsoon.
ISQS 6347, Data & Text Mining1 Ensemble Methods. ISQS 6347, Data & Text Mining 2 Ensemble Methods Construct a set of classifiers from the training data.
Classification and Novel Class Detection in Data Streams Classification and Novel Class Detection in Data Streams Mehedy Masud 1, Latifur Khan 1, Jing.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Decision Mining in Prom A. Rozinat and W.M.P. van der Aalst Joosung, Ko.
Active learning Haidong Shi, Nanyi Zeng Nov,12,2008.
One-class Classification of Text Streams with Concept Drift
Text Document Categorization by Term Association Maria-luiza Antonie Osmar R. Zaiane University of Alberta, Canada 2002 IEEE International Conference on.
Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.
Classification Ensemble Methods 1
Xiangnan Kong,Philip S. Yu An Ensemble-based Approach to Fast Classification of Multi-label Data Streams Dept. of Computer Science University of Illinois.
Mining Progressive Confident Rules M. Zhang, W. Hsu and M.L. Lee Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng.
1 Systematic Data Selection to Mine Concept-Drifting Data Streams Wei Fan Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery.
Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial.
On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams Peng Wang, H. Wang, X. Wu, W. Wang, and B. Shi Proc. of the Fifth IEEE International.
Mismatch String Kernals for SVM Protein Classification Christina Leslie, Eleazar Eskin, Jason Weston, William Stafford Noble Presented by Pradeep Anand.
Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.
Mining Concept-Drifting Data Streams Using Ensemble Classifiers Haixun Wang Wei Fan Philip S. YU Jiawei Han Proc. 9 th ACM SIGKDD Internal Conf. Knowledge.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Introduction to Machine Learning, its potential usage in network area,
Ensemble Classifiers.
Experience Report: System Log Analysis for Anomaly Detection
Mining Data Streams with Periodically changing Distributions Yingying Tao, Tamer Ozsu CIKM’09 Supervisor Dr Koh Speaker Nonhlanhla Shongwe April 26,
New ideas on FP-Growth and batch incremental mining with FP-Tree
Supervised Time Series Pattern Discovery through Local Importance
Cost-Sensitive Learning
Cost-Sensitive Learning
Introduction to Data Mining, 2nd Edition
An Adaptive Nearest Neighbor Classification Algorithm for Data Streams
Decision Trees for Mining Data Streams
Learning from Data Streams
Modeling IDS using hybrid intelligent systems
Presentation transcript:

Mining in Anticipation for Concept Change: Proactive-Reactive Prediction in Data Streams YING YANG, XINDONG WU, XINGQUAN ZHU Data Mining and Knowledge Discovery (DMKD),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng Yen

Outline Introduction Background knowledge Building concept history Choosing prediction models Related work Experiments

Introduction Two major challenges posed by data streams : - the data may grow without limit so that it is difficult to retain a long history of raw data -the underlying concept of the data may change over time

Introduction Some problems of previous work remain open : - the history of data streams is not well organized nor made good use of -A second open problem is that existing approaches are mainly interested in predicting the class of each specific instance

Introduction The novelties of this paper : - uses a measure of conceptual equivalence to organize the data history into a history of concepts -it learns concept-transition patterns from the concept history and anticipates what the concept will be in the case of a concept change

Introduction The novelties of this paper : - it incorporates proactive and reactive predictions - an efficient and effective system RePro is proposed to implement these new ideas

Background knowledge Terminology *A data stream is a sequence of instances *Each instance is a vector of attribute values *Each instance has a class label *Predicting for an instance is to decide the class of an unlabeled instance by evidence from labeled instances

Background knowledge Modes of concept change Concept change refers to the change of the underlying concept over time Concept drift describes a gradual change of the concept Concept shift happens when a change between two concepts is more abrupt Sampling change refers to the change in the data distribution.

Background knowledge Taxonomy of prediction approaches (a) Trigger-sensitive - Once a trigger is detected, a new model is constructed for data coming after the trigger Trigger-insensitive - they continuously adapt the current model to newly coming instances (b) Incremental - process coming instances one by one Batch- exam a batch of instances at once

Background knowledge Taxonomy of prediction approaches (c) Historical- After a trigger is detected, historical approaches consult the history to construct a new model Contemporary- only resort to data in hand that have just triggered the concept change (d) Proactive- Proactive approaches foresee what the forthcoming concept is given the current concept Reactive- do not predict what concept is coming. A new prediction model is constructed upon a trigger is detected

Building concept history A classification algorithm -This algorithm is used to abstract a concept from the raw data. -such as decision rules, decision trees, C4.5rules A trigger detection algorithm -It is especially important when concept shift happens -A sliding-window methodology is used here, two important parameters are the window size and the error threshold - the window is full and its error rate exceeds the error threshold, the beginning instance is taken as a trigger

Building concept history A measure of conceptual equivalence - checks whether a concept is brand new or reappearing

Building concept history The building process -the window size is 10 -the stable learning size is 30 -the error threshold is 55%. - A ♠ represents an instance where a stable trigger is detected -A ♣ represents an instance where a temporary trigger is detected -A √ represents a correctly classified instance -A X represents a misclassified instance

Choosing prediction models Proactive (trigger-sensitive, incremental, historical and proactive) - A proactive approach predicts the oncoming concept given the current concept (s) by evidence from the concept history -Once a new trigger is detected that indicates the concept has changed, the predicted concept immediately takes over the classification task - In the proactive style, the history of concepts is treated like a Markov Chain -A transition matrix can be constructed from the concept history and dynamically updated upon each future occurrence of concept change.

Choosing prediction models Proactive (example) Suppose the concept is : spring, summer, autumn, winter, spring, summer, hurricane, autumn, winter, spring, flood, summer, autumn, winter, spring, summer, autumn, winter, spring, summer, hurricane, autumn,…etc

Choosing prediction models Contemporary- Reactive (trigger-sensitive, incremental, contemporary and reactive.) -Upon detecting a new trigger, contemporary- reactive prediction does not consult the concept history. - it uses this model to classify oncoming instances. - this approach risks high classification variance especially when the sliding window is small

Choosing prediction models Historical- Reactive (trigger-sensitive, incremental, historical and reactive.) -Upon detecting a new trigger, historical reactive prediction retrieves a concept from the history that is most appropriate for the trigger instances - One problem happens when this new concept is very different from every existing concept. - Another potential concern is the efficiency issue

Choosing prediction models RePro, a coalition of strength

Related work WCE (weighed classifier ensemble) -divides its previous data into sequential chunks of fixed size, builds a classifier from each chunk, and composes a classifier ensemble where each classifier is weighed proportional -no trigger detection nor conceptual equivalence -more striking when concept shifts than when concept drifts

Related work CVFDT (concept-adapting very fast decision tree) -maintains a window of training instances and keeps its learned tree up-to-date with this window -it is trigger-sensitive -CVFDT builds a new prediction model from scratch -CVFDT can not take advantage of previous experience may be less efficient

Experiments Data (a) stagger simulate the scenario of concept shift (b) Hyperplane simulate the scenario of concept drift (c) Network intrusion. simulate the scenario of sampling change

Experiments