Visual Discovery Management: Divide and Conquer Abhishek Mukherji, Professor Elke A. Rundensteiner, Professor Matthew O. Ward XMDVTool, Department of Computer.

Slides:



Advertisements
Similar presentations
An Introduction to Data Mining
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
Information Retrieval in Practice
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Chapter 3 Database Management
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
/department of mathematics and computer science Visualization of Transition Systems Hannes Pretorius Visualization Group
A Strategy Selection Framework for Adaptive Prefetching in Visual Exploration Punit R. Doshi, Geraldine E. Rosario, Elke A. Rundensteiner, and Matthew.
Research Project Mining Negative Rules in Large Databases using GRD.
Probabilistic Databases Amol Deshpande, University of Maryland.
SESSION 7 MANAGING DATA DATARESOURCES. File Organization Terms and Concepts Field: Group of words or a complete number Record: Group of related fields.
Data Mining – Intro.
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Data mining By Aung Oo.
Prefetching for Visual Data Exploration Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Beyond Opportunity; Enterprise Miner Ronalda Koster, Data Analyst.
Basic Concepts in Big Data
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Sensor Data Management: Challenges and (some) Solutions Amol Deshpande, University of Maryland.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Detecting, Assessing and Monitoring Relevant Topics in Virtual.
Knowledge Discovery and Data Mining Evgueni Smirnov.
David S. Ebert David S. Ebert Visual Analytics to Enable Discovery and Decision Making: Potential, Challenges, and.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Foundations of Business Intelligence: Databases and Information Management.
Image Classification for Automatic Annotation
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Di Yang, Zhengyu Guo, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute EDBT 2010, Submitted 1 A Unified Framework Supporting Interactive.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Data Mining and Decision Support
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
SNS COLLEGE OF TECHNOLOGY
DATA MINING © Prentice Hall.
Introduction to Data Mining
Introduction C.Eng 714 Spring 2010.
Data Warehousing and Data Mining
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Web Mining Department of Computer Science and Engg.
Data Mining: Concepts and Techniques
Presentation transcript:

Visual Discovery Management: Divide and Conquer Abhishek Mukherji, Professor Elke A. Rundensteiner, Professor Matthew O. Ward XMDVTool, Department of Computer Science MODELING NUGGETS MOTIVATION This project is supported by NSF under grants IIS and CCF What analysts work with 1.Huge datasets 2.Primarily data views 3.Cluttered displays 4.Limited sharing S MORE RELEVANT TOPICS HANDLING USER UPDATESRELATIONSHIPS  Providing analysts the capability of managing their discoveries online,  Enhanced visualization using the hierarchical views  Superior evidence management supporting reasoning and decision making,  Knowledge sharing between groups of analysts. PROJECT IMPACT WHAT WE AIM TO GIVE THEM DATA INFORMATION Context KNOWLEDGE Meaning WISDOM Insight Hypothesis view Nugget view Data view PROPOSED TASKS  Nugget definition, modeling and storage  Classes of nuggets and their inter-relationships  Provenance links to data  Nugget discovery and capture  Explicit, implicit and automated generation  Nugget lifespan management  Validation & refinement (meaning & quality)  Visually examine the extracted nuggets and derivation traces  Annotate and classify nuggets  Associate confidence to a nugget  Employ computational techniques (nearness measures)  Eliminate redundant nuggets  Structuring  Clusters or hierarchy of nugget subsets  Ordering / sequencing  Correlations or causal relationships  Nugget-supported Visual Exploration  Interactive visual analytics Target Scenarios  Terrorist attacks  Flu pandemic  Tornado touch-down  Electric grid overload  Between data and nugget  is-valid-for, forms-support-for, is-member-of.  Between two or more nuggets  is-similar-to, is-derived-from, is-evidence-for acct-nobalancezipcode 101a b User avg-balances select zipcode, avg(balance) from accounts group by zipcode A traditional database view (defined using an SQL query) accounts timeidtemp 10am120 10am221.. … 10am729 temperatures Use Regression to predict missing values and to remove spatial bias A model-based database view* (defined using a statistical model ) raw-temp-data User CREATE VIEW RegView(time [0::1], x [0:100:10], y[0:100:10], temp) AS FIT temp USING time, x, y BASES 1, x, x 2, y, y 2 FOR EACH time T TRAINING DATA SELECT temp, time, x, y FROM raw-temp-data WHERE raw-temp-data.time = T 1.New arriving tuples. 2.Update to existing tuples. UPDATE WEATHER_INFO SET RESULT = “No” WHERE WEATHER = “overcast” NO  Keep track of data and nuggets prone to change.  Incremental updates. ASSOCIATION RULES VIEWS CREATE ASSOCIATION RULES VIEW Rules ({antecedent itemset}--> {consequent itemset}) -- [Label, Supp, Conf, DSubset] SELECT * FROM transactions WHERE ATTRIB_k BETWEEN K_min AND K_max INTERESTINGNESS MEASURE minSupport = S and minConfidence = C  {R11(x1:x6), R12(x3:x20)}, {R21 (x3:x5), R22(x10:x32)} => {(R11, R21), (R12, R21)}  {R11(XY->Z), R12(ABC->D)}, {R21 (DE->FG), R22(Y->ZW)} => {(R12, R21)} SELECT RV1.label, RV2.label FROM RULES_VIEW1, RULES_VIEW2 WHERE RULES_VIEW1.DSubset CONTAINS RULES_VIEW2.DSubset SELECT RV1.label, RV2.label FROM RULES_VIEW1, RULES_VIEW2 WHERE RULES_VIEW1.consequent CONTAINS RULES_VIEW2.antecedent  Relationships across nugget types  Cascading changes data-> nuggets -> relationships-> meta-nuggets -> hypothesis *MauveDB: Supporting Model-based User Views in Database Systems; Amol Deshpande, Sam Madden; SIGMOD 2006.