Progress Report on Continuous Data Stream Management  Mining Frequent Itemsets over Data Streams  Music Virtual Channel Presented by: Dr. Yi-Hung Wu.

Slides:



Advertisements
Similar presentations
Raghavendra Madala. Introduction Icicles Icicle Maintenance Icicle-Based Estimators Quality Guarantee Performance Evaluation Conclusion 2 ICICLES: Self-tuning.
Advertisements

指導教授:陳良弼 老師 報告者:鄧雅文  Introduction  Related Work  Problem Formulation  Future Work.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
Continuous Data Stream Processing  Music Virtual Channel – extensions  Data Stream Monitoring – tree pattern mining  Continuous Query Processing – sequence.
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Continuous Data Stream Processing
Continuous Data Stream Processing MAKE Lab Date: 2006/03/07 Post-Excellence Project Subproject 6.
Clementine Server Clementine Server A data mining software for business solution.
Continuous Data Stream Processing MAKE Lab Date: 2006/03/07 Post-Excellence Project Subproject 6.
Data Mining By Archana Ketkar.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Mining Association Rules
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
NGDM’02 1 Efficient Data-Reduction Methods for On-line Association Rule Mining H. Bronnimann B. ChenM. Dash, Y. Qiao, P. ScheuermannP. Haas Polytechnic.
Project 1 Online multi-user video monitoring system.
DEPARTMENT OF REGIONAL DEVELOPMENT, PRIMARY INDUSTRY, FISHERIES AND RESOURCES Mine Site Water Quality Monitoring Michael.
Mining Optimal Decision Trees from Itemset Lattices Dr, Siegfried Nijssen Dr. Elisa Fromont KDD 2007.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Software Design Patterns for Information Visualization 薛乃榮 Q NCBCI LAB.
Chapter 1 Introduction to Data Mining
Page 1 Semantic Data Compression Techniques for NASA and Mobile Computing Databases Principal Investigators: G. Ozsoyoglu, Z.M. Ozsoyoglu Case Western.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Approximate Frequency Counts over Data Streams Loo Kin Kong 4 th Oct., 2002.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Fine-grained Partitioning for Aggressive Data Skipping Calvin SIGMOD 2014 UC Berkeley.
A Test Paradigm for Detecting Changes in Transactional Data Streams Willie Ng and Manoranjan Dash DASFAA 2008.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
Fast Algorithms for Mining Association Rules Rakesh Agrawal and Ramakrishnan Srikant VLDB '94 presented by kurt partridge cse 590db oct 4, 1999.
Research issues on association rule mining Loo Kin Kong 26 th February, 2003.
The Internet of Things with Live Data Cloud by Open Automation Software.
MICROSOFT SEMANTIC ENGINE Unified Search, Discovery and Insight.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Data Mining: Concepts and Techniques Mining data streams
Security in Outsourced Association Rule Mining. Agenda  Introduction  Approximate randomized technique  Encryption  Summary and future work.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
Students: Aiman Md Uslim, Jin Bai, Sam Yellin, Laolu Peters Professors: Dr. Yung-Hsiang Lu CAM 2 Continuous Analysis of Many CAMeras The Problem Currently.
Fundamentals, Design, and Implementation, 9/e KDD and Data Mining Instructor: Dragomir R. Radev Winter 2005.
Streaming Semantic Data COMP6215 Semantic Web Technologies Dr Nicholas Gibbins –
Recommendation Systems ARGEDOR. Introduction Sample Data Tools Cases.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Christoph F. Eick: Final Words COSC Topics Covered in COSC 3480  Data models (ER, Relational, XML)  Using data models; learning how to store real.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Mining Data Streams with Periodically changing Distributions Yingying Tao, Tamer Ozsu CIKM’09 Supervisor Dr Koh Speaker Nonhlanhla Shongwe April 26,
NASA Space Communications Symposium
DATA MINING © Prentice Hall.
INTRODUCTION TO CAPSTONE PROJECT
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Text Classification CS5604 Information Retrieval and Storage – Spring 2016 Virginia Polytechnic Institute and State University Blacksburg, VA Professor:
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Jiawei Han Department of Computer Science
Targeted Association Mining in Time-Varying Domains
I don’t need a title slide for a lecture
Analytics: Its More than Just Modeling
CLUSTER BY: A NEW SQL EXTENSION FOR SPATIAL DATA AGGREGATION
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Data Warehousing Data Mining Privacy
Presentation transcript:

Progress Report on Continuous Data Stream Management  Mining Frequent Itemsets over Data Streams  Music Virtual Channel Presented by: Dr. Yi-Hung Wu Date: 2004/11/3 Post-Excellence Project Subproject 6

Continuous Data Stream Management /9 A data stream is a continuous, massive, and rapid data flow… 1 Data Stream Data Stream Processor Synopsis Report Query Archive Buffer Time Storage Accuracy Sampling

Continuous Data Stream Management /9 Mining Data Streams  Various data formats  Relational tuples, transactions, sequences, trees, graphs, …  Various interesting patterns  Association, trends, clustering, outliers, classification, …  Various interestingness measures  Support, confidence, lift, chi-square value, entropy, … 2

Continuous Data Stream Management /9 Mining Data Streams  Landmark model  Time-decaying model  Sliding-window model 3

Continuous Data Stream Management /9 Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding-Window 4 Discounting table Potential count

Continuous Data Stream Management /9 Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding-Window 5

Continuous Data Stream Management /9 Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding-Window 6  Our contributions  The time-sensitive sliding-window model  Fast mining and discounting method mining time > maintenance time  Self-adjusting discounting table selective adjustment vs. naïve adjustment  Accuracy guarantees No false dismissal  false alarm rate No false alarm  false dismissal rate

Continuous Data Stream Management /9 Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding-Window 7

Continuous Data Stream Management /9 Music Virtual Channel: Framework 8 … 1 1 N N 2 2 … Data sources Internet Overlay network V.C. player V.C. player Clustering engine Clustering engine Filtering engine Filtering engine Interface Profile monitor Profile monitor Cluster monitor Cluster monitor Channel monitor Channel monitor

Continuous Data Stream Management /9 Music Virtual Channel: Implementation 9 … 1 1 N N 2 2 … Music Databases Internet V.C. player V.C. player Clustering engine Clustering engine Filtering engine Filtering engine Music stream simulator Internet Music feature database