Mining Dynamics of Data Streams in Multi-Dimensional Space

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Wincite Knowledge Warehousing and Networking Sophisticated Simplicity.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
University of Minnesota
Data Mining By Archana Ketkar.
Data Mining – Intro.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Client-Side Validation with Javascript by Daniel Yee.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
1 Of Crawlers, Portals, Mice and Men: Is there more to Mining the Web? Jiawei Han Simon Fraser University, Canada ACM-SIGMOD’99 Web Mining Panel Presentation.
Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
Data Warehousing Mining & BI Data Streams Mining DWMBI1.
Data Mining: Concepts and Techniques Mining data streams
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
Xiangnan Kong,Philip S. Yu An Ensemble-based Approach to Fast Classification of Multi-label Data Streams Dept. of Computer Science University of Illinois.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining & OLAP What is Data Mining? Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data.
Welcome.
Data Mining Functionalities
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
Information Systems in Organizations
Decision Support Systems
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
MIS2502: Data Analytics Advanced Analytics - Introduction
Managing the Privacy of Incidental Information During Collaboration
What’s next for search engine
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Databases and Information Management
Data Streaming in Computer Networking
Advanced Scientific Visualization
Chapter 13 The Data Warehouse
Latest Updates on BlackHawk Mines Music : Privacy Policy
Data warehouse & Data Mining: Concepts and Techniques
Introduction C.Eng 714 Spring 2010.
Introductory Seminar on Research: Fall 2017
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Introduction Position your online or offline business
©Jiawei Han and Micheline Kamber
Jiawei Han Department of Computer Science
©Jiawei Han and Micheline Kamber
Client-Side Validation with Javascript
Chapter 12: Automated data collection methods
Sangeeta Devadiga CS 157B, Spring 2007
Mining Unusual Patterns in Data Streams in Multi-Dimensional Space
Data Warehousing and Data Mining
Data Mining: Concepts and Techniques
A Framework for Clustering Evolving Data Streams
Data Mining: Concepts and Techniques
Data Warehousing Data Model –Part 1
Jiawei Han Department of Computer Science
©Jiawei Han and Micheline Kamber
Data Mining: Introduction
MIS2502: Data Analytics Introduction to Advanced Analytics
Data Mining: Concepts and Techniques
Online Analytical Processing Stream Data: Is It Feasible?
Kenneth C. Laudon & Jane P. Laudon
Mining Data Streams Many slides are borrowed from Stanford Data Mining Class and Prof. Jiawei Han’s lecture slides.
CSE591: Data Mining by H. Liu
Presentation transcript:

Mining Dynamics of Data Streams in Multi-Dimensional Space 11/9/2018 Mining Dynamics of Data Streams in Multi-Dimensional Space Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.edu/~hanj

Challenges of Stream Data Mining 11/9/2018 Challenges of Stream Data Mining Mining query mode: continuous, ad-hoc, progressive? Mining mode: batched vs. interactive vs. lazy mining? Time constraints: real-time? What patterns to be mined? Finding patterns, anomaly, differences, …in multiple streams Mining dynamics (changes, trends and evolutions) of data streams Multi-level/multi-dimensional processing and data mining Most stream data are at pretty low-level or multi-dimensional in nature November 9, 2018 Mining Dynamics of Data Streams

Why Mining Dynamics of Data Streams in Multi-Dimensional Space? 11/9/2018 Why Mining Dynamics of Data Streams in Multi-Dimensional Space? Dynamics (changes, trends and evolutions) of data streams Perhaps the most interesting thing in streams Cannot just look at the current data? Save something! Multi-dimensional stream mining Most real stream data are at low-level or multi-dimensional in nature How to examine dynamically at multi-dimensions? Finding dynamics: patterns and outliers in certain dimensional space November 9, 2018 Mining Dynamics of Data Streams

Stream Data Mining Tasks Multi-dimensional (on-line) analysis of streams Clustering data streams Classification of data streams Mining frequent patterns in data streams Mining sequential patterns in data streams Mining partial periodicity in data streams Mining notable gradients in data streams Mining outliers and unusual patterns in data streams ……, more? November 9, 2018 Mining Dynamics of Data Streams

Example 1: Multi-Dimensional (OLAP) Analysis Analysis of Web click streams Raw data at low levels: seconds, web page addresses, user IP addresses, … Analysts want: changes, trends, unusual patterns, at reasonable levels of details E.g., Average clicking traffic in North America on sports in the last 15 minutes is 40% higher than that in the last 24 hours.” Analysis of power consumption streams Raw data: power consumption flow for every household, every minute Patterns one may find: average hourly power consumption surges up 30% for manufacturing companies in Chicago in the last 2 hours today than that of the same day a week ago November 9, 2018 Mining Dynamics of Data Streams

Example 2: Multi-Dimensional Classification Dynamic model update for loan or investment Huge amount of incoming flow of changing information with multiple dimensional space (factors) E.g., Should we invest this company based on the situation of the current market? Classification in dynamic (volatile) stock market Classification of stocks based on their current streams E.g., Is Lucent going to be up in the next little while? November 9, 2018 Mining Dynamics of Data Streams

Example 3: Hi-Dimensional Clustering Network intrusion detection Huge amount of incoming flow of network traffic information, multiple dimensional features in nature Find burst of activities/traffic in real time On-line clustering to detect abrupt changes What are the changes of e-mail or text information Clustering based on frequent terms Can we perform such clustering in real-time? November 9, 2018 Mining Dynamics of Data Streams

Methodology in Stream Data Mining Multi-dimensional (on-line) analysis Mining dynamics of data streams Time is a special dimension Tilted time frame (multiple time granularity) Stream data reduction and pre-computation What kind of multi-dimensional data to be pre-computed and stored for OLAP analysis? What kind of data to be pre-computed/stored for classification? For clustering? For mining frequent patterns? For mining sequential patterns? partial periodic patterns? …… How to do incremental updates? How to find changes? November 9, 2018 Mining Dynamics of Data Streams

?- Questions in Stream Data Mining Will stream data mining be real in practice? Should we develop general stream data mining principles, or ad-doc application-oriented methods? How are stream data mining methods different from incremental mining? How are stream data mining linked with stream data management system? With continuous query processing? Can we do privacy-preserving mining with stream data? November 9, 2018 Mining Dynamics of Data Streams

Mining Dynamics of Data Streams www.cs.uiuc.edu/~hanj Thank you !!! November 9, 2018 Mining Dynamics of Data Streams