1 ENTROPY-BASED CONCEPT SHIFT DETECTION PETER VORBURGER, ABRAHAM BERNSTEIN IEEE ICDM 2006 Speaker: Li HueiJyun Advisor: Koh JiaLing Date:2007/11/6 1.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Systematic Data Selection to Mine Concept Drifting Data Streams Wei Fan IBM T.J.Watson.
A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions Jing Gao Wei Fan Jiawei Han Philip S. Yu University of Illinois.
Learning under concept drift: an overview Zhimin He iTechs – ISCAS
Classification, Regression and Other Learning Methods CS240B Presentation Peter Huang June 4, 2014.
Di Yang, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute VLDB 2009, Lyon, France 1 A Shared Execution Strategy for Multiple Pattern.
Visual Data Mining: Concepts, Frameworks and Algorithm Development Student: Fasheng Qiu Instructor: Dr. Yingshu Li.
The Problem of Concept Drift: Definitions and Related Work Alexev Tsymbalo paper. (April 29, 2004)
Mining in Anticipation for Concept Change: Proactive-Reactive Prediction in Data Streams YING YANG, XINDONG WU, XINGQUAN ZHU Data Mining and Knowledge.
Data Stream Classification: Training with Limited Amount of Labeled Data Mohammad Mehedy Masud Latifur Khan Bhavani Thuraisingham University of Texas at.
Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Date : 21 st of May, Shri Ramdeo Baba College of Engineering and Management Presentation By : Rimjhim Singh Under the Guidance of: Dr. M.B. Chandak.
Introduction to Automatic Classification Shih-Wen (George) Ke 7 th Dec 2005.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
On Appropriate Assumptions to Mine Data Streams: Analyses and Solutions Jing Gao† Wei Fan‡ Jiawei Han† †University of Illinois at Urbana-Champaign ‡IBM.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
1 An Adaptive Nearest Neighbor Classification Algorithm for Data Streams Yan-Nei Law & Carlo Zaniolo University of California, Los Angeles PKDD, Porto,
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Bing LiuCS Department, UIC1 Learning from Positive and Unlabeled Examples Bing Liu Department of Computer Science University of Illinois at Chicago Joint.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
Detecting Distance-Based Outliers in Streams of Data Fabrizio Angiulli and Fabio Fassetti DEIS, Universit `a della Calabria CIKM 07.
(ACM KDD 09’) Prem Melville, Wojciech Gryc, Richard D. Lawrence
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang Wen-Yi Chu Department of Computer Science & Information.
Semi-Supervised Learning with Concept Drift using Particle Dynamics applied to Network Intrusion Detection Data Fabricio Breve Institute of Geosciences.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
by B. Zadrozny and C. Elkan
1 Active learning based survival regression for censored data Bhanukiran Vinzamuri Yan Li Chandan K.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
Data Stream Mining and Incremental Discretization John Russo CS561 Final Project April 26, 2007.
1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Bayesian Classification. Bayesian Classification: Why? A statistical classifier: performs probabilistic prediction, i.e., predicts class membership probabilities.
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
Anomaly Detection in Data Mining. Hybrid Approach between Filtering- and-refinement and DBSCAN Eng. Ştefan-Iulian Handra Prof. Dr. Eng. Horia Cioc ârlie.
A new Ad Hoc Positioning System 컴퓨터 공학과 오영준.
Classification and Novel Class Detection in Data Streams Classification and Novel Class Detection in Data Streams Mehedy Masud 1, Latifur Khan 1, Jing.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
MINING COLOSSAL FREQUENT PATTERNS BY CORE PATTERN FUSION FEIDA ZHU, XIFENG YAN, JIAWEI HAN, PHILIP S. YU, HONG CHENG ICDE07 Advisor: Koh JiaLing Speaker:
Bayesian Classification Using P-tree  Classification –Classification is a process of predicting an – unknown attribute-value in a relation –Given a relation,
1 AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hong.
Bing LiuCS Department, UIC1 Chapter 8: Semi-supervised learning.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Lian Yan and David J. Miller 國立雲林科技大學 National Yunlin University of.
Date: 2015/11/19 Author: Reza Zafarani, Huan Liu Source: CIKM '15
Detecting New a Priori Probabilities of Data Using Supervised Learning Karpov Nikolay Associate professor NRU Higher School of Economics.
Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris T. Lin, Sanghack Lee, Ngot Bui.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
LINDEN : Linking Named Entities with Knowledge Base via Semantic Knowledge Date : 2013/03/25 Resource : WWW 2012 Advisor : Dr. Jia-Ling Koh Speaker : Wei.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Jessica K. Ting Michael K. Ng Hongqiang Rong Joshua Z. Huang 國立雲林科技大學.
Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.
Speaker Min-Koo Kang March 26, 2013 Depth Enhancement Technique by Sensor Fusion: MRF-based approach.
1 Systematic Data Selection to Mine Concept-Drifting Data Streams Wei Fan Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery.
On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams Peng Wang, H. Wang, X. Wu, W. Wang, and B. Shi Proc. of the Fifth IEEE International.
1 Mining the Smallest Association Rule Set for Predictions Jiuyong Li, Hong Shen, and Rodney Topor Proceedings of the 2001 IEEE International Conference.
Mining Concept-Drifting Data Streams Using Ensemble Classifiers Haixun Wang Wei Fan Philip S. YU Jiawei Han Proc. 9 th ACM SIGKDD Internal Conf. Knowledge.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Bias Management in Time Changing Data Streams We assume data is generated randomly according to a stationary distribution. Data comes in the form of streams.
Posture Monitoring System for Context Awareness in Mobile Computing
KDD 2004: Adversarial Classification
Query in Streaming Environment
An Enhanced Support Vector Machine Model for Intrusion Detection
When Security Games Go Green
Intent-Aware Semantic Query Annotation
Kostas Kolomvatsos, Christos Anagnostopoulos
Presentation transcript:

1 ENTROPY-BASED CONCEPT SHIFT DETECTION PETER VORBURGER, ABRAHAM BERNSTEIN IEEE ICDM 2006 Speaker: Li HueiJyun Advisor: Koh JiaLing Date:2007/11/6 1

2 OUTLINE Introduction Entropy and Concept Shift Adaption Calculating Entropy on Data Streams Algorithm Control Strategy using Entropy Measure Experimental setup Experimental Results Discussion of the Experiments Application to a Real-World Problem: Context Switches in Sensor Data Limitations, Future Work, and Conclusion 2

3 INTRODUCTION Problem: In many applications data is gathered over time, which raises the problem that the concepts to be learned may drift (i.e., change) over time. The increasing amount of data (e.g., multimedia content, data warehouse ) and limitation of computing power due to miniaturization (e.g., wearable computing) call for faster and more resource friendly algorithms. Motivation: the analysis of sensor data on wearable devices. 3

4 INTRODUCTION Context-awareness: A Scenario-based Approach for Direct Interruptablity Prediction on Wearable Devices Classifiers predict peoples’ anticipated behavior based on sensory input Contexts (or contextual situations) switch rather than gradually change Contextual information could be reused, even for new, not yet encountered situations An ongoing monitoring of the sensor stream is needed 4

5 INTRODUCTION Problem: online pattern matching mechanism comparing the sensor stream to the entire library of already known contexts is computational complex and not yet suitable for today’s wearable devices. Solution: indicate possible candidates (or hot spots) for context changes limiting the computationally intensive context (re-)determination on those candidates. 5

6 ENTROPY AND CONCEPT SHIFT ADAPTION Assumptions: As long as the distribution of older instances (features and target values) is similar to the distribution of new instances no concept drift occurred A distribution difference between older and more recent instances indicates a change in the target concept Measure the distribution inequality: If two distributions are equal, the entropy measure results in a value of 1 If they are absolutely different the measure will result in a value of 0 6

7 CALCULATING ENTROPY ON DATA STREAMS Sliding window technique: compares two windows, one presenting older and the other representing more recent instances in the stream Compare the two windows by counting and comparing all instances with respect to their class and stream membership Discretize the range of instance values to a fixed number of bins to take the approximate value distribution into account 7

8 CALCULATING ENTROPY ON DATA STREAMS A data stream: a sequence consisting of sequentially ordered tuples in time t i i (1, 2, 3, …) := (, l i ) where is the vector of all feature stream instances s n i at time t i The domain of the label stream l is discrete and contains all class values c C 8

9 CALCULATING ENTROPY ON DATA STREAMS H i : the resulting entropy at time t i and is defined as the mean of all data stream entropies H is at time t i where S : the number of feature-streams H is is calculated from the entropies H iscb

10 CALCULATING ENTROPY ON DATA STREAMS H iscb : represent the entropy of each class ( c C ) and bin ( b B ) given the stream s at time t i Bins: discrete aggregation of the values of each feature stream s : the probability that an instance occurs in the old window at time t i, belong to class c, with feature domain of stream s in bin b w iscb : depend on i, s, c, b

ALGORITHM CONTROL STRATEGY USING ENTROPY MEASURE Instance selection style algorithm 11

EXPERIMENTAL SETUP Real concept drifts: changes in the actual target concepts Virtual concept drifts: changes in the distribution Generate synthetic data set: H. Wang, W. Fan, P. S. Yu, and J. Han. Mining concept-drifting data streams using ensemble classifiers. In KDD ‘03: proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining Real drift data set Virtual drift data set Mixed data set 12

EXPERIMENTAL SETUP Performance measure: Accuracy The area under the ROC-curve A representative set of benchmarks: Perfect benchmark: assumes an oracle-given ideal window size ξ for any point in time A selection of ensemble classifiers: the literature so far showed to have the highest accuracy and robustness against noise P. Vorburger and A. Bernstein. Entropy-based detection of real and virtual concept shifts. Working Paper – University of Zurich, Department of Informatics,

EXPERIMENTAL RESULTS 14

EXPERIMENTAL RESULTS The prediction quality against increasing noise levels 15

EXPERIMENTAL RESULTS Computational complexity Compare ensemble classifiers and the entropy measure based algorithm Measure the elapsed time: three committee classifiers: ±15s Entropy based algorithm: 148.6s Entropy calculation without Naïve Bayes model building: 1.1±0.1s 16

DISCUSSION OF THE EXPERIMENT Entropy measure outperforms the ensemble benchmark algorithm on real concept shifts Exhibit a greater predictive power while requiring less computational resources The entropy measure based algorithm showed the nearly the same robustness towards noise as the perfect benchmark and the committee classifiers 17

APPLICATION TO A REAL-WORLD PROBLEM: CONTEXT SWITCHES IN SENSOR DATA Data set: Audio: decomposed into 10 features accelerometer data recorded over a time of 15381s: merged in one single feature The wearable data acquisition set up: a microphone and three three-dimentional accelerometers attached on the subject’s shoulder, wrist, and leg 18

APPLICATION TO A REAL-WORLD PROBLEM: CONTEXT SWITCHES IN SENSOR DATA (A)walking (B)streetcar (C)office work (D)lecture (E)cafeteria (F)meeting 19

LIMITATIONS, FUTURE WORK, AND CONCLUSION Gradual concept drifts Find boundary conditions Recognize recurring concepts and exploit this information Generalizability The choice of the suitable parameters could be optimized 20

LIMITATIONS, FUTURE WORK, AND CONCLUSION Find a measure for detecting and measuring concept shifts as an analogue for context switches Formulation of entropy on data streams is capable to detect and measure concept shifts Algorithm with an entropy based instance selection strategy outperformed ensemble based algorithms on real concept shift data sets Given algorithm robustness towards noise, its sensitivity towards concept shifts, its computational efficiency, and predictive power on real concept shift data sets 21