A Strategy Selection Framework for Adaptive Prefetching in Visual Exploration Punit R. Doshi, Geraldine E. Rosario, Elke A. Rundensteiner, and Matthew.

Slides:



Advertisements
Similar presentations
Recuperação de Informação B Cap. 10: User Interfaces and Visualization 10.1,10.2,10.3 November 17, 1999.
Advertisements

Chapter 12 The Stages of Learning
1 VLDB 2006, Seoul Mapping a Moving Landscape by Mining Mountains of Logs Automated Generation of a Dependency Model for HUG’s Clinical System Mirko Steinle,
Di Yang, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute VLDB 2009, Lyon, France 1 A Shared Execution Strategy for Multiple Pattern.
Towards Self-Testing in Autonomic Computing Systems Tariq M. King, Djuradj Babich, Jonatan Alava, and Peter J. Clarke Software Testing Research Group Florida.
TorusVis ND : Unraveling High- Dimensional Torus Networks for Network Traffic Visualizations Shenghui Cheng, Pradipta De, Shaofeng H.-C. Jiang* and Klaus.
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Project 4 U-Pick – A Project of Your Own Design Proposal Due: April 14 th (earlier ok) Project Due: April 25 th.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
HCI Methods for Pathway Visualization Tools Purvi Saraiya, Chris North, Karen Duca* Virginia Tech Dept. of Computer Science, Center for Human-Computer.
/ department of mathematics and computer science TU/e eindhoven university of technology CEDEFOP workshop: Policy, Practice, Partnership: Getting to Work.
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
WPI Center for Research in Exploratory Data and Information Analysis CREDIA SC4DEVO-1, July 12-15, 2004 Interactive Visual Exploration of Multivariate.
Chapter 6: Database Evolution Title: AutoAdmin “What-if” Index Analysis Utility Authors: Surajit Chaudhuri, Vivek Narasayya ACM SIGMOD 1998.
Selecting Distinctive 3D Shape Descriptors for Similarity Retrieval Philip Shilane and Thomas Funkhouser.
Evaluating the Quality of Image Synthesis and Analysis Techniques Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
Introduction : ‘Skoll: Distributed Continuous Quality Assurance’ Morimichi Nishigaki.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
Prefetching for Visual Data Exploration Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Query Planning for Searching Inter- Dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal 1, Ruoming Jin 2 1 Department of Computer.
Determining the Significance of Item Order In Randomized Problem Sets Zachary A. Pardos, Neil T. Heffernan Worcester Polytechnic Institute Department of.
MASTERS THESIS DEFENSE QBANK A Web-Based Dynamic Problem Authoring Tool BY ANN PAUL ADVISOR: PROFESSOR CLIFF SHAFFER JUNE 2013 Computer Science Department.
Analytics Map Reduce Query Insight Hive Pig Hadoop SQL Map Reduce Business Intelligence Predictive Operational Interactive Visualization Exploratory.
Managing Service Metadata as Context The 2005 Istanbul International Computational Science & Engineering Conference (ICCSE2005) Mehmet S. Aktas
Visual Discovery Management: Divide and Conquer Abhishek Mukherji, Professor Elke A. Rundensteiner, Professor Matthew O. Ward XMDVTool, Department of Computer.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
Searching for Extremes Among Distributed Data Sources with Optimal Probing Zhenyu (Victor) Liu Computer Science Department, UCLA.
Opinion to ponder… “ Since we are a visual species (especially the American culture), because of our educational system. Many of the tools currently used.
ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.
Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
StoryFlow: Tracking the Evolution of Stories IEEE INFOVIS 2013 Shixia Liu, Senior Member, IEEE, Microsoft Research Asia Yingcai Wu, Member, IEEE, Microsoft.
Investigating Adaptive Compilation using the MIPSpro Compiler Keith D. Cooper Todd Waterman Department of Computer Science Rice University Houston, TX.
Algorithms & FlowchartsLecture 10. Algorithm’s CONCEPT.
Visual Analytics of User Behavior Project Description: Analyze and predict user behavior in a virtual world to inform dynamic modifications to the environment.
Allen D. Malony Department of Computer and Information Science TAU Performance Research Laboratory University of Oregon Discussion:
1 Elke. A. Rundensteiner Worcester Polytechnic Institute Elisa Bertino Purdue University 1 Rimma V. Nehme Microsoft.
Memory Hierarchy Adaptivity An Architectural Perspective Alex Veidenbaum AMRM Project sponsored by DARPA/ITO.
Geographic Visualization to Support Epidemiology in Bulgaria Anthony C. Robinson GeoVISTA Center Department of Geography The Pennsylvania State University.
Image Classification for Automatic Annotation
Data mining, interactive semantic structuring, and collaboration: A diversity-aware method for sense-making in search Mathias Verbeke, Bettina Berendt,
Di Yang, Zhengyu Guo, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute EDBT 2010, Submitted 1 A Unified Framework Supporting Interactive.
Performance and Energy Efficiency Evaluation of Big Data Systems Presented by Yingjie Shi Institute of Computing Technology, CAS
CS851 – Biological Computing February 6, 2003 Nathanael Paul Randomness in Cellular Automata.
Fault Isolation in Multicast Trees Anoop Reddy, Ramesh Govindan, Deborah Estrin. USC/Information Sciences Institute Presented by: Srikrishna Gurugubelli.
The Concept of the “CLIOS Process”: Integrating the study of physical and policy systems using Mexico City as an example Presentation to the Engineering.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
Transforming Policies into Mechanisms with Infokernel Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Nathan C. Burnett, Timothy E. Denehy, Thomas J.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
1 Cache-Oblivious Query Processing Bingsheng He, Qiong Luo {saven, Department of Computer Science & Engineering Hong Kong University of.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Best detection scheme achieves 100% hit detection with
Accurate WiFi Packet Delivery Rate Estimation and Applications Owais Khan and Lili Qiu. The University of Texas at Austin 1 Infocom 2016, San Francisco.
Maestro AI Vision and Design Overview Definitions Maestro: A naïve Sensorimotor Engine prototype. Sensorimotor Engine: Combining sensory and motor functions.
Why Intelligent Data Analysis? Joost N. Kok Leiden Institute of Advanced Computer Science Universiteit Leiden.
Recommender Systems Session I
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Presentation transcript:

A Strategy Selection Framework for Adaptive Prefetching in Visual Exploration Punit R. Doshi, Geraldine E. Rosario, Elke A. Rundensteiner, and Matthew O. Ward Computer Science Department Worcester Polytechnic Institute Supported by NSF grant IIS Presented at SSDBM2003, July 10, 2003.

2 Motivation Why visually explore data? –Ever increasing data set sizes make data exploration infeasible –Possible solution: Interactive Data Visualization -- humans can detect certain patterns better and faster than data mining tools Why cache and prefetch? –Interactive visualization tools do not scale well, yet we need real-time response

3 Data Hierarchy Flat Display Hierarchical Display Example Visual Exploration Tool: XmdvTool

4 Structure-Based Brush2Parallel Coordinates (Linked with Brush2) Roll-Up: Structure-Based Brush1Parallel Coordinates (Linked with Brush1) Drill Down:

5 Characteristics of a Visualization Environment Exploited for Prefetching Locality of exploration Contiguity of user movements Idle time due to user viewing display Move left/right Move up/down

6 Overview of Prefetching Locality of exploration Contiguity of user movements Idle time due to user viewing display New user query Idle time Prefetchin g Cache DB User’s next request can be predicted with high accuracy Time to prefetch Fetchin g

7 (m-1)m(m+1) Direction Strategy Random Strategy 1/4 Static Prefetching Strategies

8 Drawbacks of Static Prefetching Lacks a feedback mechanism Different users have different exploration patterns A user’s pattern may be changing within same session  Generates predictions independent of past performance.  No single strategy will work best for all users.  A single strategy may not be sufficient within one user session. This calls for Adaptive Prefetching – changing prediction behavior in response to changing data access patterns.

9 Types of Adaptive Prefetching Fine tuning one strategy: –Change parameter values of one strategy over time depending on past performance Strategy selection among several strategies: –Given a set of strategies, allow the choice of strategy to change over time within same session, depending on past performance

10 Strategy Selection Requirements for strategy selection: 1.Set of strategies to select from 2.Performance measures 3.Fitness function 4.Strategy selection policy

11 Set of Strategies & Performance Measures Strategy #Correctly Predicted #Not Predicted #Mis- Predicted No Prefetch Random Direction Performance measures Strategies YesNo Yes Correctly predicted Mis-predicted No Not predicted Required by user Predicted by prefetcher

12 Fitness Function Strategy#Correctly Predicted #Not Predicted #Mis- Predicted Local Avg. Mis- Classification Cost No Prefetch Random Direction Other fitness functions: global average misclass. cost local average response time global average response time Fitness function Cost of No prediction Cost of Mis-prediction

13 Fitness Function Definitions Global Average: Local Average (using exponential smoothing):

14 Strategy Selection Policy Strategy selection policies: 1.Best 2.Proportionate Strategy#Correctly Predicted #Not Predicted #Mis- Predicted Local Avg. Mis- Classification Cost No Prefetch Random Direction Overall

15 Performance Evaluation Setup – XmdvTool as testbed 14 real user traces analyzed User traces were analyzed for: Tendency to move in the same direction Frequency of movement Size of sample focused on 3 user types: random-starers, indeterminates, directional- movers We will show: Detailed analysis and results for 2 user traces Summary results for all user types

16 Directional User: Navigation Patterns Over Time Ave 73% directional Ave 70 queries/min Navigation pattern changes over time

17 Directional User: Navigation Patterns Over Time Move up or down then move left to right to left

18 Directional User: Directional prefetcher is best Selection matched more directional navigation pattern. Any kind of prefetching is better than none.

19 … but SelectBest is even better SelectBest chose Directional & No-Prefetching No-Prefetching selected when #queries/min is high & %dir is low.

20 Directional User: Other performance measures Misclassification cost = trade-off between %NP & %MP. SelectBest gave low %NP and high %MP.

21 Directional User: Other performance measures SelectBest gave best %CP & response time but this will not always be the case. Choice of fitness function is important.

22 Ave 50% directional Ave 40 queries/min Pattern changes over time Move left then perturb up & down. Move right then perturb up & down. Indeterminate User: Navigation Patterns Over Time

23 Indeterminate User: SelectBest is better SelectBest chose Random & No-Prefetching No-Prefetching selected when #queries/min is high & %dir is low.

24 Summary Across All User Types Experiments repeated 3x and averaged. Reduced prediction error for random- starters and directional-movers. No improvement in response time.

25 Related Work Adaptive Prefetching – Strategy Refinement - Davidson98, Tcheun97, Curewitz93, Kroeger96, Palpanas99 Learning - Agrawal95, Swaminathan00 Adaptation Concepts – Mitchell99, Waldspurger94, Avnur00 Performance Measures – Joseph97,Weiss25, Mitchell99 Database support for Interactive Applications – Stolte02, Tioga96

26 Observations Prefetching is better than no prefetching Different users have different navigation patterns, same user has varying navigation patterns within same session No single prefetcher works best in all cases Strategy selection allows prefetcher to adapt Performance of strategy selection depends on fitness function being optimized

27 Contributions The first to study adaptive prefetching in the context of visual data exploration A proposed framework for adaptive prefetching via strategy selection, as opposed to common approach of strategy refinement Empirical results showing benefits of strategy selection over a wide range of user navigation traces

28 That’s all folks XmdvTool Homepage: Code is free for research and education. Contact author: