16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1.

Slides:



Advertisements
Similar presentations
The Robert Gordon University School of Engineering Dr. Mohamed Amish
Advertisements

1 SDMIV Data Visualization - A Very Rough Guide Ken Brodlie University of Leeds.
1GR2-00 GR2 Advanced Computer Graphics AGR Lecture 9 Adding Realism Through Texture.
13.1 Vis_04 Data Visualization Lecture 13 Information Visualization Part 1.
ENV Envisioning Information Lecture 3 – Multivariate Data Exploration Scatter plots and parallel coordinates Ken Brodlie.
ENV Envisioning Information Lecture 4 – Multivariate Data Exploration Glyphs and other methods Hierarchical approaches Ken Brodlie.
ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
9.1si31_2001 SI31 Advanced Computer Graphics AGR Lecture 9 Adding Realism Through Texture.
Multi-Dimensional Data Visualization
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Multidimensional data processing. Multivariate data consist of several variables for each observation. Actually, serious data is always multivariate.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
LISA Short Course Series Multivariate Analysis in R Liang (Sally) Shan March 3, 2015 LISA: Multivariate Analysis in RMar. 3, 2015.
1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.
Thank you for coming here!. Purpose of Experiment Compare two visualization systems. You will play with one of them.
Thank you for coming here!. Purpose of Experiment Compare two visualization systems. You will play with one of them.
Multivariate Data Visualization Adapted from Slides by: Matthew O. Ward Computer Science Department Worcester Polytechnic Institute This work was supported.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
2.1 Vis_04 Data Visualization Lecture 2 Fundamental Concepts - Reference Model Visualization Techniques – Overview Visualization Systems - Overview.
ENV Envisioning Information Lecture 11 – Scientific Visualization Introduction Scalar 1D Data Ken Brodlie
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
1 18 April 2007 vizNET-LEEDS-PRES A Rough Guide to Data Visualization – Part 2 VizNET 2007 Annual Event Ken Brodlie School of Computing University.
Multivariate and High Dimensional Visualizations Robert Herring.
10/17/071 Read: Ch. 15, GSF Comparing Ecological Communities Part Two: Ordination.
14.1 Vis_04 Data Visualization Lecture 14 Information Visualization : Part 2.
Data Tutorial Tutorial on Types of Graphs Used for Data Analysis, Along with How to Enter Them in MS Excel Carryn Bellomo University of Nevada, Las Vegas.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Lecture 3: Bivariate Data & Linear Regression 1.Introduction 2.Bivariate Data 3.Linear Analysis of Data a)Freehand Linear Fit b)Least Squares Fit c)Interpolation/Extrapolation.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Basic concepts in ordination
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
Data Presentation & Graphing Introduction to Mechanical Engineering The University of Texas-Pan American College of Science and Engineering.
8 th Grade Math Common Core Standards. The Number System 8.NS Know that there are numbers that are not rational, and approximate them by rational numbers.
Space/Order Quanzhen Geng (Master of Software Systems Program) January 27, 2003 CS-533C Reading Presentation.
A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation Dmitri G. Roussinov Department of.
Robert Kosara, Helwig Hauser 1InfoVis STAR The State of the Art in Information Visualization Robert Kosara, Helwig Hauser.
Language Objective: Students will be able to practice agreeing and disagreeing with partner or small group, interpret and discuss illustrations, identify.
Tan,Steinbach, Kumar: Exploratory Data Analysis (with modifications by Ch. Eick) Data Mining: “New” Teaching Road Map 1. Introduction to Data Mining and.
Introduction to science
Summer Student Program 15 August 2007 Cluster visualization using parallel coordinates representation Bastien Dalla Piazza Supervisor: Olivier Couet.
Carrying out a statistics investigation. A process.
Visualization Techniques for Multivariate Discrete and Continuous Data March 4, 2005 Rachael Brady.
CATA 2010 March 2010 Jewels, Himalayas and Fireworks, Extending Methods for Visualizing N Dimensional Clustering W. Jockheck Dept. of Computer Science.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Scatter Plots Scatter plots are a graphic representation of collated biviariate data via a mathematical diagram using Cartesian coordinates. The data.
Data Mining: Exploring Data
Visual Correlation Analysis of Numerical and Categorical Data on the Correlation Map Zhiyuan Zhang, Kevin T. McDonnell, Erez Zadok, Klaus Mueller.
Section 2.4 Representing Data.
Data Visualization.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Principal Component Analysis
Strategies for Success GOOD LUCK!! Strategy 1 Can I plug it in? Can I plug it in?
1 Statistics & R, TiP, 2011/12 Multivariate Methods  Multivariate data  Data display  Principal component analysis Unsupervised learning technique 
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Data Mining: Exploring Data
Lindita Camaj Associate professor
Jewels, Himalayas and Fireworks, Extending Methods for
Data Mining: Exploring Data
Data Mining: “New” Teaching Road Map
Principal Component Analysis
Jewels, Himalayas and Fireworks, Extending Methods for
Range, Width, min-max Values and Graphs
Multidimensional Space,
Data Mining: Exploring Data
Data exploration and visualization
Comp 15 - Usability & Human Factors
Presentation transcript:

16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1

16.2 Vis_2002 What is Visualization? n Generally: – The use of computer-supported, interactive, visual representations of data to amplify cognition Card, McKinlay and Schneiderman n Two ‘branches’: – Scientific Visualization – Information Visualization.. But first… an experiment

16.3 Vis_2002 The Experiment n You need a watch with a second-hand n Without using pencil and paper (or a calculator!!), multiply 72 by 34 n How long did it take? n Now you need pencil and paper as well as watch n Multiply 47 by 54 n How long did it take? n Conclusion?

16.4 Vis_2002 Visualization – Twin Subjects n Scientific Visualization – Visualization of physical data n Information Visualization – Visualization of abstract data Ozone layer around earth Automobile web site - visualizing links … but this is only one characterisation

16.5 Vis_2002 Scientific Visualization – Another Characterisation n Focus is on visualizing an entity measured in a multi-dimensional space – 1D – 2D – 3D – Occasionally nD n Underlying field is recreated from the sampled data n Relationship between variables well understood – some independent, some dependent Image from D. Bartz and M. Meissner

16.6 Vis_2002 Scientific Visualization Model n Visualization represented as pipeline: – Read in data – Build model of underlying entity – Construct a visualization in terms of geometry – Render geometry as image n Realised as modular visualization environment – IRIS Explorer – IBM Open Visualization Data Explorer (DX) – AVS visualizemodeldatarender

16.7 Vis_2002 Information Visualization n Focus is on visualizing set of observations that are multi-variate n Example of iris data set – 150 observations of 4 variables (length, width of petal and sepal) – Techniques aim to display relationships between variables

16.8 Vis_2002 Dataflow for Information Visualization n Again we can express as a dataflow – but emphasis now is on data itself rather than underlying entity n First step is to form the data into a table of observations, each observation being a set of values of the variables n Then we apply a visualization technique as before visualize data table datarender ABC variables observations

16.9 Vis_2002 Applications of Information Visualization n Data Collections – Census data – Astronomical Data – Bioinformatics Data – Supermarket checkout data – and so on – Can relationships be discovered amongst the variables? n Networks of Information – traffic - Web documents – Hierarchies of information (eg filestores) We shall see that all can be described as data tables

16.10 Vis_2002 Multivariate Visualization n Software: – Xmdvtool Matthew Ward n Techniques designed for any number of variables – Scatter plot matrices – Parallel co-ordinates – Glyph techniques Acknowledgement: Many of images in following slides taken from Ward’s work..and also IRIS Explorer!

16.11 Vis_2002 Scatter Plot n Simple technique for 2 variables is the scatter plot This example from NIST shows linear correlation between the variables handbook/eda/section3/ scatterp.htm

16.12 Vis_2002 3D Scatter Plots n There has been some success at extending concept to 3D for visualizing 3 variables XRT/3d

16.13 Vis_2002 Extending to Higher Numbers of Variables n Additional variables can be visualized by colour and shape coding n IRIS Explorer used to visualize data from BMW – Five variables displayed using spatial arrangement for three, colour and object type for others – Notice the clusters… Kraus & Ertl

16.14 Vis_2002 IRIS Explorer 3D Scatter Plots n Try this…. Thanks to:

16.15 Vis_2002 Scatter Plots for M variables n For table data of M variables, we can look at pairs in 2D scatter plots n The pairs can be juxtaposed: A B C C B A With luck, you may spot correlations between pairs as linear structures

16.16 Vis_2002 Scatter Plot Data represents 7 aspects of cars: what relationships can we notice? For example, what correlates with high MPG? Pictures from Xmdv tool developed by Matthew Ward: davis.wpi.edu/~xmdv

16.17 Vis_2002 Parallel Coordinates: Visualizing M variables on one chart ABCDEF - create M equidistant vertical axes, each corresponding to a variable - each axis scaled to [min, max] range of the variable - each observation corresponds to a line drawn through point on each axis corresponding to value of the variable

16.18 Vis_2002 Parallel Coordinates ABCDEF - correlations may start to appear as the observations are plotted on the chart - here there appears to be negative correlation between values of A and B for example - this has been used for applications with thousands of data items

16.19 Vis_2002 Parallel Coordinates Example Detroit homicide data 7 variables 13 observations

16.20 Vis_2002 The Screen Space Problem n All techniques, sooner or later, run out of screen space n Parallel co- ordinates – Usable for up to 150 variates – Unworkable greater than 250 variates Remote sensing: 5 variates, 16,384 observations)

16.21 Vis_2002 Brushing as a Solution n Brushing selects a restricted range of one or more variables n Selection then highlighted

16.22 Vis_2002 Scatter Plot Use of a ‘brushing’ tool can highlight subsets of data..now we can see what correlates with high MPG

16.23 Vis_2002 Parallel Coordinates Brushing picks out the high MPG data Can you observe the same relations as with scatter plots? More or less easy?

16.24 Vis_2002 Parallel Coordinates Here we highlight high MPG and not 4 cylinders

16.25 Vis_2002 Clustering as a Solution n Success has been achieved through clustering of observations n Hierarchical parallel co-ordinates – Cluster by similarity – Display using translucency and proximity-based colour

16.26 Vis_2002 Hierarchical Parallel Co- ordinates

16.27 Vis_2002 Reduction of Dimensionality of Variable Space n Reduce number of variables, preserve information n Principal Component Analysis – Transform to new co- ordinate system – Hard to interpret n Hierarchical reduction of variable space – Cluster variables where distance between observations is typically small – Choose representative for each cluster

16.28 Vis_2002 Glyph Techniques – Star Plots n Star plots – Each observation represented as a ‘star’ – Each spike represents a variable – Length of spike indicates the value Crime in Detroit

16.29 Vis_2002 Chernoff Faces n Chernoff suggested use of faces to encode a variety of variables - can map to size, shape, colour of facial features - human brain rapidly recognises faces

16.30 Vis_2002 Chernoff Faces n Here are some of the facial features you can use

16.31 Vis_2002 Chernoff Faces n Demonstration applet at: – ects/Faces/

16.32 Vis_2002 Chernoff’s Face n.. And here is Chernoff’s face Chernoff/Hcindex.htm

16.33 Vis_2002 Daisy Charts Dry Wet Showery Saturday Sunday Leeds Sahara Amazon variables and their values placed around circle lines connect the values for one observation This item is { wet, Saturday, Amazon }

16.34 Vis_2002 Daisy Charts - Underground Problems

16.35 Vis_2002 Scientific Visualization – Information Visualization n Focus is on visualizing set of observations that are multi-variate n There is no underlying field – it is the data itself we want to visualize n The relationship between variables is not well understood n Focus is on visualizing an entity measured in a multi-dimensional space n Underlying field is recreated from the sampled data n Relationship between variables well understood Scientific VisualizationInformation Visualization

16.36 Vis_2002 Further Reading n Information Visualization – Robert Spence – published 2000 by Addison Wesley n See also resources section of the module web site