Multivariate Data Visualization Adapted from Slides by: Matthew O. Ward Computer Science Department Worcester Polytechnic Institute This work was supported.

Slides:



Advertisements
Similar presentations
TorusVis ND : Unraveling High- Dimensional Torus Networks for Network Traffic Visualizations Shenghui Cheng, Pradipta De, Shaofeng H.-C. Jiang* and Klaus.
Advertisements

Visualization and Cluster
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
Visualizing and Exploring Data Summary statistics for data (mean, median, mode, quartile, variance, skewnes) Distribution of values for single variables.
Geog 409: Advanced Spatial Analysis & Modelling © J.M. Piwowar1Modelling Essentials Model Parsimony vs. Model Simplicity.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Lecture Notes for Chapter 2 Introduction to Data Mining
Multivariate Methods Pattern Recognition and Hypothesis Testing.
1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.
WPI Center for Research in Exploratory Data and Information Analysis CREDIA SC4DEVO-1, July 12-15, 2004 Interactive Visual Exploration of Multivariate.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Evaluating the Quality of Image Synthesis and Analysis Techniques Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
Multivariate and High Dimensional Visualizations Robert Herring.
Information Visualization Chapter 1 - Continued. Reference Model Visualization: Mapping from data to visual form Raw DataData Tables Visual Structures.
UC Berkeley, 09/19/00 An Introduction to Multivariate Data Visualization and XmdvTool Matthew O. Ward Computer Science Department Worcester Polytechnic.
Visual Analytics and the Geometry of Thought— Spatial Intelligence through Sapient Interfaces Alexander Klippel & Frank Hardisty Department of Geography,
1 A Rank-by-Feature Framework for Interactive Exploration of Multidimensional Data Jinwook Seo, Ben Shneiderman University of Maryland Hyun Young Song.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
NERCOMP Workshop, Dec. 2, 2008 Information Visualization: the Other Half of Data Analysis Dr. Matthew Ward Computer Science Department Worcester Polytechnic.
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Information Design and Visualization
DATA MINING from data to information Ronald Westra Dep. Mathematics Knowledge Engineering Maastricht University.
Basic concepts in ordination
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Digital Image Fundamentals II 1.Image modeling and representations 2.Pixels and Pixel relations 3.Arithmetic operations of images 4.Image geometry operation.
Visual Perspectives iPLANT Visual Analytics Workshop November 5-6, 2009 ;lk Visual Analytics Bernice Rogowitz Greg Abram.
Enhancing Interactive Visual Data Analysis by Statistical Functionality Jürgen Platzer VRVis Research Center Vienna, Austria.
Multidimensional Scaling Vuokko Vuori Based on: Data Exploration Using Self-Organizing Maps, Samuel Kaski, Ph.D. Thesis, 1997 Multivariate Statistical.
Visualizing Tabular Data CS 4390/5390 Data Visualization Shirley Moore, Instructor September 29,
Copyright © 2005, Pearson Education, Inc. Slides from resources for: Designing the User Interface 4th Edition by Ben Shneiderman & Catherine Plaisant Slides.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information Ramana Rao and Stuart.
1 Data Mining: Data Lecture Notes for Chapter 2. 2 What is Data? l Collection of data objects and their attributes l An attribute is a property or characteristic.
Graph Visualization and Beyond … Anne Denton, April 4, 2003 Including material from a paper by Ivan Herman, Guy Melançon, and M. Scott Marshall.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
Visualization Techniques for Multivariate Discrete and Continuous Data March 4, 2005 Rachael Brady.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Uncovering Clusters in Crowded Parallel Coordinates Visualizations Alimir Olivettr Artero, Maria Cristina Ferreiara de Oliveira, Haim levkowitz Information.
Lucent Technologies - Proprietary 1 Interactive Pattern Discovery with Mirage Mirage uses exploratory visualization, intuitive graphical operations to.
Visual Correlation Analysis of Numerical and Categorical Data on the Correlation Map Zhiyuan Zhang, Kevin T. McDonnell, Erez Zadok, Klaus Mueller.
1 Mining Images of Material Nanostructure Data Aparna S. Varde, Jianyu Liang, Elke A. Rundensteiner and Richard D. Sisson Jr. ICDCIT December 2006 Bhubaneswar,
Data Visualization.
Affine Registration in R m 5. The matching function allows to define tentative correspondences and a RANSAC-like algorithm can be used to estimate the.
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Visualization Design Principles cs5984: Information Visualization Chris North.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-5 Chapters 9 and 10.
Cluster Analysis This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed under a Creative Commons.
Data Mining – Intro.
Mining Dynamics of Data Streams in Multi-Dimensional Space
Overview Identify similarities present in biological sequences and present them in a comprehensible manner to the biologists Objective Capturing Similarity.
Data Mining: Exploring Data
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Information Design and Visualization
cs5984: Information Visualization Chris North
Dimension reduction : PCA and Clustering
Information Visualization (Part 1)
Multidimensional Scaling
Multidimensional Space,
An Introduction to Multivariate Data Visualization and XmdvTool
Group 9 – Data Mining: Data
Data Pre-processing Lecture Notes for Chapter 2
Data exploration and visualization
Comp 15 - Usability & Human Factors
Presentation transcript:

Multivariate Data Visualization Adapted from Slides by: Matthew O. Ward Computer Science Department Worcester Polytechnic Institute This work was supported under NSF Grant IIS

What is Multivariate Data? zEach data point has N variables or observations zEach observation can be: y nominal or ordinal ydiscrete or continuous yscalar, vector, or tensor zMay or may not have spatial, temporal, or other connectivity attribute

Characteristics of a Variable zOrder: grades have an order, brand names do not. zDistance metric: for income, distance equals difference. For rankings, difference is not a distance metric. zA variable can be classified by these three attributes, called Scale. zEffective visualizations attempt to match the scale of the data dimension with the graphical attribute conveying it.

Sources of Multivariate Data zSensors (e.g., images, gauges) zSimulations zCensus or other surveys zCommerce (e.g., stock market) zCommunication systems zSpreadsheets and databases

Issues in Visualizing Multivariate Data zHow many variables? zHow many records? zTypes of variables? zUser task (exploration, confirmation, presentation) zData feature of interest (clusters, anomalies, trends, patterns, ….) zBackground of user (domain expert, visualization specialist, decision-maker, ….)

Methods for Visualizing Multivariate Data zDimensional Subsetting zDimensional Reorganization zDimensional Reduction

Dimensional Subsetting zScatterplot matrix displays all pairwise plots zSelection allows linkage between views zClusters, trends, and correlations readily discerned between pairs of dimensions

Dimensional Reorganization zParallel Coordinates creates parallel, rather than orthogonal, dimensions. zData point corresponds to polyline across axes zClusters, trends, and anomalies discernable as groupings or outliers, based on intercepts and slopes

Dimensional Reorganization zGlyphs map data dimensions to graphical attributes zSize, color, shape, and orientation are commonly used zSimilarities/differences in features give insights into relations

Dimensional Reduction zMap N-D locations to M-D display space while best preserving N-D relations zApproaches include MDS, PCA, and Kohonen Self Organizing Maps zRelationships conveyed by position, links, color, shape, size, etc.

The Role of Selection zUser needs to interact with display, examine interesting patterns or anomalies, validate hypotheses zSelection allows isolation of subset of data for highlighting, deleting, focussed analysis zDirect (clicking on displayed items ) vs. indirect (range sliders) zScreen space (2-D) vs. data space (N-D)

Auxiliary Tools zExtent scaling to reduce occlusion of bands zDimensional zooming - fill display with selected subspace (N-D distortion) zDynamic masking to fade out selected or unselected data zSaving selected subsets zEnabling/disabling dimensions zUnivariate displays (Tukey box plots, tree maps)