1 This work partially funded by NSF Grants IIS-9732897, IRIS-9729878 and IIS-0119276 Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.

Slides:



Advertisements
Similar presentations
Di Yang, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute VLDB 2009, Lyon, France 1 A Shared Execution Strategy for Multiple Pattern.
Advertisements

TorusVis ND : Unraveling High- Dimensional Torus Networks for Network Traffic Visualizations Shenghui Cheng, Pradipta De, Shaofeng H.-C. Jiang* and Klaus.
VALTChessVA IntroAppsWrap-up 1/25 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science.
Matthew O. Ward Professor Computer Science WPI data and information visualization, visual analytics Selected Publications:  Matthew O. Ward, "XmdvTool:
Rolling the Dice: Multidimensional Visual Exploration using Scatterplot Matrix Navigation 1 Niklas Elmqvist | Purdue University Pierre Dragicevic | INRIA.
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Two Technique Papers on High Dimensionality Allan Rempel December 5, 2005.
Performance Analysis And Visualization By:Mehdi Semsarzadeh Chapter 15.
Interactive Pattern Search in Time Series (Using TimeSearcher 2) Paolo Buono, Aleks Aris, Catherine Plaisant, Amir Khella, and Ben Shneiderman Proceedings,
Dimensionality Reduction with Linear Transformations project update by Mingyue Tan March 17, 2004.
1 Murali Mani Topics projects in databases and web applications and XML Database Systems Research Lab @cs.wpi.eduWebpages:
Thank you for coming here!. Purpose of Experiment Compare two visualization systems. You will play with one of them.
Visualization of Multidimensional Multivariate Large Dataset Presented by: Zhijian Pan University of Maryland.
WPI Center for Research in Exploratory Data and Information Analysis CREDIA SC4DEVO-1, July 12-15, 2004 Interactive Visual Exploration of Multivariate.
HDDVis: An Interactive Tool for High Dimensional Data Visualization by Mingyue Tan April 21st, 2004.
ShadyStats Project Update Mike Cora November 16, C: Information Visualization.
Multivariate Data Visualization Adapted from Slides by: Matthew O. Ward Computer Science Department Worcester Polytechnic Institute This work was supported.
“Exploring High-D Spaces with Multiform Matrices and Small Multiples” Mudit Agrawal Nathaniel Ayewah MacEachren, A., Dai, X., Hardisty, F., Guo, D., and.
Scalability for High Cardinality in Steerable MDS CPSC 533C Project Presentation Allan Rempel December 19, 2005.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information R. Rao and S. K.
Glyphs Presented by Bertrand Low. Presentation Overview A Taxonomy of Glyph Placement Strategies for Multidimensional Data VisualizationA Taxonomy of.
Evaluating the Quality of Image Synthesis and Analysis Techniques Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
1 Murali Mani Topics projects in databases and web applications and XML Database Systems Research Lab @cs.wpi.eduWebpages:
Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer) Jinwook Seo and Ben Shneiderman Human-Computer Interaction.
ShadyStats Final Report Mike Cora December 19, C: Information Visualization.
A Strategy Selection Framework for Adaptive Prefetching in Visual Exploration Punit R. Doshi, Geraldine E. Rosario, Elke A. Rundensteiner, and Matthew.
Project Update: Law Enforcement Resource Allocation (LERA) Visualization System Michael Welsman-Dinelle April Webster.
UC Berkeley, 09/19/00 An Introduction to Multivariate Data Visualization and XmdvTool Matthew O. Ward Computer Science Department Worcester Polytechnic.
Prefetching for Visual Data Exploration Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
NERCOMP Workshop, Dec. 2, 2008 Information Visualization: the Other Half of Data Analysis Dr. Matthew Ward Computer Science Department Worcester Polytechnic.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Theory and practice of computer technologies used for creating DEA software Eugene P. MORGUNOV Siberian State Aerospace University Krasnoyarsk, Russia.
Space/Order Quanzhen Geng (Master of Software Systems Program) January 27, 2003 CS-533C Reading Presentation.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Glyph Visualization and Yet Another Tree Visualization Matt Williams InfoVis 533c April 3, 2003.
A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.
Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia.
Visual Perspectives iPLANT Visual Analytics Workshop November 5-6, 2009 ;lk Visual Analytics Bernice Rogowitz Greg Abram.
Opinion to ponder… “ Since we are a visual species (especially the American culture), because of our educational system. Many of the tools currently used.
Intuitive Database Query System, Zooming Query Results Previews Drawing upon existing literature on zooming interface technology, intuitive navigation.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information Ramana Rao and Stuart.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Cluster Sorting, Analysis and 3D-Display For the Mapping of the Dragonfly’s Neuro- Network Advisor: Prof. Hannay Client: Prof. Olberg (Biology )
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Pad++: A Zooming Graphical Interface for Exploring Alternate Interface Physics Presented By: Daniel Loewus-Deitch.
16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1.
Visualizing Large Dynamic Digraphs Michael Burch.
Visualization in Problem Solving Environments Amit Goel Department of Computer Science Virginia Tech June 14, 1999.
Di Yang, Zhengyu Guo, Elke A. Rundensteiner and Matthew O. Ward Worcester Polytechnic Institute EDBT 2010, Submitted 1 A Unified Framework Supporting Interactive.
Uncovering Clusters in Crowded Parallel Coordinates Visualizations Alimir Olivettr Artero, Maria Cristina Ferreiara de Oliveira, Haim levkowitz Information.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 Mining Images of Material Nanostructure Data Aparna S. Varde, Jianyu Liang, Elke A. Rundensteiner and Richard D. Sisson Jr. ICDCIT December 2006 Bhubaneswar,
Storage Systems CSE 598d, Spring 2007 OS Support for DB Management DB File System April 3, 2007 Mark Johnson.
InfoVis Cyberinfrastructure Shashikant Penumarthy, Bruce Herr & Katy Börner School of Library and Information Science sprao | bherr
Geant4 User Workshop 15, 2002 Lassi A. Tuura, Northeastern University IGUANA Overview Lassi A. Tuura Northeastern University,
Mulidimensional Detective “Multidimensional” : multivariate, many parameters “Detective” : focus is on the “discovery process”, finding patterns and trends.
Database management system Data analytics system:
CSC420 Showing Complex Data.
Visualization of Web Search Results in 3D
CSc4730/6730 Scientific Visualization
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Information Design and Visualization
An Introduction to Multivariate Data Visualization and XmdvTool
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine Rosario, Allen R. Martin, Ying-Huey Fua, Daniel Stroe XmdvTool Interactive Visual Data Exploration System for High-dimensional Data Sets Worcester Polytechnic Institute

2 XmdvTool Features Hierarchical visualization and interaction tools for exploring very large high-dimensional data sets to discover patterns, trends and outliers Applications: Bioterrorism Detection Bioinformatics and Drug Discovery Space Science Geology and Geochemistry Systems Monitoring and Performance Evaluation Economics and Business Simulation Design and Analysis Multi-platform support (Unix, Linux, Windows) Public domain software:

3 Scale-up to High Dimensions: Visual Hierarchical Dimension Reduction Scale-up to Large Data Sets: Interactive Hierarchical Displays, Database Backend with Minmax Encoding, Semantic Caching and Adaptive Prefetching Interlinked Multi-Displays: Parallel Coordinates, Glyphs, Scatterplot Matrices, Dimensional Stacking Visual Interaction Tools: N-Dimensional Brushes, Structure-Based Brushing, InterRing Xmdv: Main Features

4 Scale-Up for Large Number of Dimensions Solution to High Dimensional Datasets: Group Similar Dimensions into Dimension Hierarchy Navigate Dimension Hierarchy by InterRing Form Lower Dimensional Spaces by Dimension Clusters Convey Dimension Cluster Information by Dissimilarity Display

5 Visual Hierarchical Dimension Reduction Process

6 A 42-dimensional Data Set Dimension Hierarchy Interaction Tool: InterRing A 4-Dimensional Subspace Visual Hierarchical Dimension Reduction Process

7 InterRing - Dimension Hierarchy Navigation and Manipulation Roll-up/Drill-down Rotate Zoom in/out Distort Modify

8 Dissimilarity Display Three Axes Method Mean-Band Method Diagonal Plot Method Axis Width Method

9 Scale-up for Large Number of Records Solution to Large Scale Datasets: Group Similar Records into Data Hierarchy Navigate Data Hierarchy by Structure-Based Brushing Represent Data Clusters by Mean-Band Method Provide Database Backend Support using MinMax Tree, Caching, Prefetching

10 2D example Interactive Hierarchical Display Hierarchical ClusteringStructure-Based Brushing

11 Flat Display Hierarchical Display Interactive Hierarchical Display Mean-Band Method in Parallel Coordinates

12 Flat Display Hierarchical Display Mean-Band Method in Parallel Coordinates Interactive Hierarchical Display

13 Scalability of Data Access Approach Attach database system to visualization front-end MinMax hierarchy encoding Key idea: avoid recursive processing Pre-computed Caching Key idea: reduce response time and network traffic Prefetching application hints and predict user patternsKey idea: use application hints and predict user patterns Performed during idle timePerformed during idle time

14 Pre-compute object positions –level-of-detail (L) –extent values (x,y) –preserve tree structure New query semantics –objects are now rectangles –select objects that touch L –select objects that touch (x, y) –structure-based brush = intersection of two selections Scalability of Data Access: MinMax Hierarchy Encoding level of detail extent values L xy query = (x, y, L) xy L

15 Purpose reduce response time and network traffic Issues visual query cannot directly translate into object IDs  high-level cache specification to avoid complete scans Semantic caching queries are cached rather than objects minimize cost of cache lookup dynamically adapt cached queries to patterns of queries Scalability of Data Access: Caching

16 Strategy –Speculative (no specific hints) local –navigation remains local user data set –both user and data set influence exploration –Adaptive (strategy changes over time) –Evolves as more knowledge becomes available –Non-pure (interruptible prefetching) consistent –leave buffer in consistent state Requirements –non-pure prefetching + large transactions & small object size + semantic caching  small granularity (object level) –speculative, non-pure prefetcher  cache replacement policy + guessing method Scalability of Data Access: Prefetching

17 Conclusions:  Caching reduces response time by 80%  Prefetching further reduces response time by 30%  Designing better prefetching strategies might help further reduce response time Scalability of Data Access: Experimental Evaluation

18 Random Random Strategy (m-1)m(m+1) Direction Direction Strategy Hot Regions Current Navigation Window Focus Focus Strategy m(n-2) m(n-1) m(n) m(n+1) Mean Mean Strategy m(n-2) m(n-1) m(n) m(n+1) Exponential Weight Average Exponential Weight Average Strategy Vector Vector Strategies Data Set Driven Data Set Driven Strategy Localized Speculative Localized Speculative Strategies Scalability of Data Access: Prefetching

19 Xmdv System Implementation Tools –C/C++ –TCL/TK –OpenGL –Oracle 8i –Pro*C User MinMax Labeling Schema Info Hierarchical Data Rewriter Translator Loader Buffer Queries GUI OFF-LINE PROCESS Estimator Exploration Variables DB ON-LINE PROCESS MEMORY Flat Data Prefetcher Library: Random Direction Focus EWA Mean DB Buffer

20 Publications (available at Jing Yang, Matthew O. Ward and Elke A. Rundensteiner, "InterRing: An Interactive Tool for Visually Navigating and Manipulating Hierarchical Structures", InfoVis 2002, to appear Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward and Daniel Stroe, “Prefetching For Visual Data Exploration.” Technical Report #: WPI-CS-TR-02-07, 2002 Jing Yang, Matthew O. Ward and Elke A. Rundensteiner, “Interactive Hierarchical Displays: A General Framework for Visualization and Exploration of Large Multivariate Data Sets”, Computers and Graphics Journal, 2002, to appear Daniel Stroe, Elke A. Rundensteiner and Matthew O. Ward, “Scalable Visual Hierarchy Exploration”, Database and Expert Systems Applications, pages , Sept Ying-Huey Fua, Matthew O. Ward and Elke A. Rundensteiner, “Hierarchical Parallel Coordinates for Exploration of LargeDatasets”, IEEE Proc. of Visualization, pages 43-50, Oct Ying-Huey Fua, Matthew O. Ward and Elke A. Rundensteiner, “Navigating Hierarchies with Structure-Based Brushes”, IEEE Proceedings of Visualization, pages 43-50, Oct. 1999