“Exploring High-D Spaces with Multiform Matrices and Small Multiples” Mudit Agrawal Nathaniel Ayewah MacEachren, A., Dai, X., Hardisty, F., Guo, D., and.

Slides:



Advertisements
Similar presentations
Multi-Dimensional Data Visualization
Advertisements

Information Visualization Survey
Multi-Scale Analysis of Crime and Incident Patterns in Camden Dawn Williams Department of Civil, Environmental & Geomatic.
Multivariate Visualization of Continuous Datasets, a User Study Haleh Hagh-ShenasSunghee KimLaura TateosianChristopher Healey Gettysburg College North.
Dynamic Queries for Visual Information Seeking Ben Shneiderman Jin Tong Hyunmo Kang Cmsc838 Sep. 28, 1999.
Graphical Examination of Data Jaakko Leppänen
Rolling the Dice: Multidimensional Visual Exploration using Scatterplot Matrix Navigation 1 Niklas Elmqvist | Purdue University Pierre Dragicevic | INRIA.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
1 Presented by Jean-Daniel Fekete. 2  Motivation  Mélange [Elmqvist 2008] Multiple Focus Regions.
1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.
Never forget your primary wealth, your and your family’s health, it will be your hope and your family’s hope for ever.
HDDVis: An Interactive Tool for High Dimensional Data Visualization by Mingyue Tan April 21st, 2004.
Live Re-orderable Accordion Drawing (LiveRAC) Peter McLachlan, Tamara Munzner Eleftherios Koutsofios, Stephen North AT&T Research Symposium August, 2007.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
Multidimensional Detective Alfred Inselberg Presented By Cassie Thomas.
Mark Gahegan … and Masa Takatsuka, Frank Hardisty, Xiping Dai, Junyan Luo, Diansheng Guo, Mike Wheeler, James O’Brien, Isaac Brewer, Dan Haug, Alan MacEachren.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information R. Rao and S. K.
Graphics and Graphic Information Processing J. Bertin Hilary Browne Jeff Carver CMSC 838 September 9, 1999.
Visual Analytics and the Geometry of Thought— Spatial Intelligence through Sapient Interfaces Alexander Klippel & Frank Hardisty Department of Geography,
1 A Rank-by-Feature Framework for Interactive Exploration of Multidimensional Data Jinwook Seo, Ben Shneiderman University of Maryland Hyun Young Song.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Topics: Correlation The road map
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Multivariate Statistics Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Information Visualization in Data Mining S.T. Balke Department of Chemical Engineering and Applied Chemistry University of Toronto.
 Catalogue No: BS-338  Credit Hours: 3  Text Book: Advanced Engineering Mathematics by E.Kreyszig  Reference Books  Probability and Statistics by.
NERCOMP Workshop, Dec. 2, 2008 Information Visualization: the Other Half of Data Analysis Dr. Matthew Ward Computer Science Department Worcester Polytechnic.
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Dr. Russell Anderson Dr. Musa Jafar West Texas A&M University.
Jia-kai Chou & Chuan-kai Yang National Taiwan University of Science and Technology Computer Graphics & Multimedia Laboratory.
Data Exploration Chapter 9. Introduction  Where to begin?  Data exploration is data-centered query and analysis  Better understand the data and provide.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Visual Analysis of Hierarchical Management Data Zhao Geng 1, Gaurav Gathania 2, Robert S.Laramee 1 and ZhenMin Peng 1 1 Visual Computing Group, Computer.
Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia.
Clustering II. 2 Finite Mixtures Model data using a mixture of distributions –Each distribution represents one cluster –Each distribution gives probabilities.
Geovisualization and Spatial Analysis of Cancer Data: Developing Visual-Computational Spatial Tools for Cancer Data Research Challenges for Spatial Data.
Copyright © 2008, SAS Institute Inc. All rights reserved. Interactive Analysis and Data Visualization Using JMP −Dara Hammond, Federal Systems Engineer.
Copyright © 2005, Pearson Education, Inc. Slides from resources for: Designing the User Interface 4th Edition by Ben Shneiderman & Catherine Plaisant Slides.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information Ramana Rao and Stuart.
9/28/2012HCI571 Isabelle Bichindaritz1 Working with Data Data Summarization.
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
3. THE LANGUAGE OF INTERFACE DESIGN. Design decisions at different levels of visual form LevelExample Pixel. Graphic atomsA, 3 _ graphic fragmentsWord,
Data Preprocessing Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
GRITS 2011: Benny Chan. Browsers as Application GUI  Modern Browsers are basically an HTML and a power JavaScript rendering engine.  As the internet.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Scatter Plots Scatter plots are a graphic representation of collated biviariate data via a mathematical diagram using Cartesian coordinates. The data.
Use of graphics in SPSS. Download the BECK SPSS dataset Access the course Blackboard site to obtain the dataset. 2.
Data Visualization.
Visualization of Washing Powder Formulation ———seeking the best ingredients of washing powder.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
NATO NEC C2 Maturity Model Overview. C2 Maturity and NEC Capability Five levels of C2 maturity have been defined. These five levels and their relationship.
1 Statistics & R, TiP, 2011/12 Multivariate Methods  Multivariate data  Data display  Principal component analysis Unsupervised learning technique 
Exploring High-D Spaces with Multiform Matrices and Small Multiples Presented by Ray Chen and Sorelle Friedler Authors: MacEachren, A., Dai, X., Hardisty,
Multi-Dimensional Data Visualization cs5984: Information Visualization Chris North.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-5 Chapters 9 and 10.
CHAPTER 10 DATA EXPLORATION 10.1 Data Exploration Box 10.1 Data Visualization Descriptive Statistics Box 10.2 Descriptive Statistics Graphs.
Exploring Data: Summary Statistics and Visualizations
Microsoft Excel PowerPivot & Power View
Exploring High-D Spaces with Multiform Matrices and Small Multiples
Information Design and Visualization
CSE572, CBS572: Data Mining by H. Liu
Ernest Valveny Computer Vision Center
Multidimensional Space,
CSE572: Data Mining by H. Liu
Comp 15 - Usability & Human Factors
Presentation transcript:

“Exploring High-D Spaces with Multiform Matrices and Small Multiples” Mudit Agrawal Nathaniel Ayewah MacEachren, A., Dai, X., Hardisty, F., Guo, D., and Lengerich, G. Proc. IEEE Symposium on Information Visualization (2003), 31–38.

The Plan Motivation Contribution Analysis Methods GeoVISTA studio Conclusions

Discover Multivariate relationships Examine data from multiple perspectives Motivation DATA  INFORMATION

Visual analysis of multivariate data  Combinations of scatterplots, bivariate maps and space-filling displays  Conditional Entropy to identify interesting variables from a data-set, and to order the variables to show more information  Dynamic query/filtering called Conditioning Contribution

Back-end: Design Box Building of applications using visual programming tools Front-end: GUI Box Visualizing data using the developed designs Source: GeoVista Studio

Analysis Methods

Sorting Nested sorting – sort a table on selected attributes To understand the relationships between sorted variables and the rest Permutation Matrix :  cell values are replaced by graphical depiction of value.  Rows/cols can be sorted to search for related entities  e.g. Analysis Methods

Augmented seriation:  Organizing a set of objects along a single dimension using multimodal multimedia Correlation matrices Reorderable Matrices:  Simple interactive visualization artifact for tabular data Analysis Methods Sorting Source: (Siirtola, 1999)

Space-filling visualization Analysis Methods Sunburst methods Mosaic plot Pixel-oriented methods Source: (Keim, 1996) Source: (Schedl, 2006) Source: (Young, 1999)

Multiform Bivariate Small Multiple Small Multiples A set of juxtaposed data representations that together support understanding of multivariate information Analysis Methods Source: (MacEachren, 2003)

Analysis Methods Multiform Bivariate Matrix Source: (MacEachren, 2003)

GeoVista Studio

Demonstration Basic Demo  Application construction  Scatterplot, Geomap  Dynamic linking, eccentric labeling etc.

Dealing with High Dimensionality

High Dimensionality Interactive Feature Selection  Guo, D., Coordinating Computational and Visualization Approaches for Interactive Feature Selection and Mulivariate Clustering. Information Visualization 2(4):

High Dimensionality “Goodness of Clustering”  high coverage  high density  high dependence E.g.  Correlation  Chi-squared  Conditional Entropy HIGH LOW

Conditional Entropy Discretize two dimensions into intervals  Nested Means mean Source: (Guo, 2003)

Conditional Entropy Source: (Guo, 2003)

Ordering Dimensions Related dimensions should be close together Sort By: Conditional Entropy Sort Method: Minimum Spanning Tree ABCD A5169 B51521 C16154 D9214 AB CD Ordering: B A D C unsorted

Demonstration Advanced Demo  Interactive Feature Selection  PCP, SOM, Matrix  Conditioning

Conclusions Strengths  Dynamic Linking of different representations  Visualizing clusters of dimensions  Rich and extensible toolbox Weaknesses  Usability  Arrangement of Windows

References Guo, D., (2003). Coordinating Computational and Visualization Approaches for Interactive Feature Selection and Mulivariate Clustering. Information Visualization 2(4): Keim, D (1996) Pixel-oriented Visualization Techniques for Exploring Very Large Databases, Journal of Computational and Graphical Statistics. Schedl, M (2006), CoMIRVA: Collection of Music Information Retrieval and Visualization Applications. Website. Siirtola, H. (1999), Interaction with the Reorderable Matrix. In E. Banissi, F. Khosrowshahi, M. Sarfraz, E. Tatham, and A. Ursyn, editors, Information Visualization IV '99, pages Proceedings International Conference on Information Visualization. Young, F (1999), Frequency Distribution Graphs (Visualizations) for Category Variables, unpublished.