IAT 355 Data + Multivariate Visualization

Slides:



Advertisements
Similar presentations
Multi-Dimensional Data Visualization
Advertisements

Sep 23, 2013 IAT Data ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY.
E FFECTIVE V ISUALS Tables Graphs Charts Illustrations.
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 5 Analyzing.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
cs5764: Information Visualization Chris North
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Information Visualization Chapter 1 - Continued. Reference Model Visualization: Mapping from data to visual form Raw DataData Tables Visual Structures.
1 i247: Information Visualization and Presentation Marti Hearst Data Types and Graph Types.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Charts and Graphs V
NERCOMP Workshop, Dec. 2, 2008 Information Visualization: the Other Half of Data Analysis Dr. Matthew Ward Computer Science Department Worcester Polytechnic.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Effective Graphing for Cartographers SP 240 Cartography Alex Chaucer.
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Basic concepts in ordination
IAT 814 Trees Chapter 3.2 of Spence ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS +
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
EXCEL CHARTS. CHARTS Charts provide a way of presenting and comparing data in graphical format. Embedded charts or chart sheets Embedded charts are objects.
CONFIDENTIAL Data Visualization Katelina Boykova 15 October 2015.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Effective Visuals Tables Graphs Charts Illustrations.
Data Visualization.
1 Visual Encoding Andrew Chan CPSC 533C January 20, 2003.
Between which two points is there a constant speed?
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
84 Interpreting Graphs 83 10/30/15 Starter: Quiz Application Notes Glue here when done Connection : Complete Ws Exit: A plane traveled North 1500 miles.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-5 Chapters 9 and 10.
170 Interpreting Graphs 169 2/10/2016 Starter: What is the difference between speed and velocity? Application Notes Glue here when done Connection : Complete.
Multivariate Display From tables, charts, graphs to more complicated methods.
DISPLAYING DATA DIAGRAMMATICALLY. The Aim By the end of this lecture, the students will be aware of graphical representation of data and by using SPSS.
Data organization and Presentation. Data Organization Making it easy for comparison and analysis of data Arranging data in an orderly sequence or into.
Descriptive Statistics: Tabular and Graphical Methods
DISPLAYING DATA.
Data Visualization basics Petar Horozov Nikolay Nedyalkov
Visualizing your data effectively
IAT 355 Basic Tasks with Interactive Example
Visualizing Data and Communicating Information
Charts & Graphs CTEC V
Guilford County SciVis V105.01
Tennessee Adult Education 2011 Curriculum Math Level 3
Fundamentals of Statistical Process Control
Preparing and Interpreting Tables, Graphs and Figures
Charts and Graphs V
SPREADSHEETS Parts of a graph Data Range X and Y axes
Mean Shift Segmentation
Statistics for the Social Sciences
Bar Graph A bar graph uses vertical or horizontal bars to display numerical information.
Ms jorgensen Unit 1: Statistics and Graphical Representations
Module 6: Presenting Data: Graphs and Charts
Chart and Graphs used in Business CHART COMPONENTS
Collecting & Displaying Data
Building Worksheet Charts
Chart and Graphs used in Business CHART COMPONENTS
Graphs with SPSS.
Descriptive statistics
Chart and Graphs used in Business CHART COMPONENTS
Analyzing Bivariate Data
Descriptive Analysis and Presentation of Bivariate Data
Topic 7: Visualization Lesson 1 – Creating Charts in Excel
Statistical Reasoning
Chart and Graphs used in Business CHART COMPONENTS
Chart and Graphs used in Business CHART COMPONENTS
Scatter Diagrams Slide 1 of 4
Chart and Graphs used in Business CHART COMPONENTS
Data exploration and visualization
Comp 15 - Usability & Human Factors
Presentation transcript:

IAT 355 Data + Multivariate Visualization ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT] | WWW.SIAT.SFU.CA Jan 6, 2017 IAT 355

Agenda Data forms and representations Basic representation techniques Multivariate (>3) techniques Jan 6, 2017 IAT 355

Data Sets Data comes in many different forms Typically, not in the way you want it How is stored (in the raw)? Jan 6, 2017 IAT 355

Example Cars make model year miles per gallon cost number of cylinders weights ... Jan 6, 2017 IAT 355

Data Tables Often, we take raw data and transform it into a form that is more workable Main idea: Individual items are called cases Cases have variables(attributes) Jan 6, 2017 IAT 355

Data Table Format Think of it as a function Dimensions Variable1 Variable2 Variable3 Case1 Value11 Value12 Value13 Case2 Value21 Value22 Value23 Case3 Value31 Value32 Value33 Think of it as a function F(Case1) = <Val11, Val12,…> Jan 6, 2017 IAT 355

Example student data Name Student Num Age Entered SFU GPA Mary Tom 65432101 20 Sep 2010 4.0 Tom 98765651 22 Sep 2013 2.3 Louise 89846251 19 Jan 2012 3.1 Jan 6, 2017 IAT 355

Variable Types Three main types of variables N - Nominal (equal or not equal to other values) Example: gender O - Ordinal (obeys < relation, ordered set) Example: mild, medium, hot, suicide Q - Quantitative (can do math on them) Example: age Jan 6, 2017 IAT 355

Metadata Descriptive information about the data Might be something as simple as the type of a variable, or could be more complex For times when the table itself just isn’t enough Example: if variable1 is “l”, then variable3 can only be 3, 7 or 16 Mary 65432101 20 Sep 2010 4.0 Tom 98765651 22 Sep 2013 2.3 Louise 89846251 19 Jan 2012 3.1 Jan 6, 2017 IAT 355

How Many Variables? Data sets of dimensions 1, 2, 3 are common Number of variables per class 1 - Univariate data 2 - Bivariate data 3 - Trivariate data >3 - Hypervariate data Jan 6, 2017 IAT 355

Representation What’s a common way of visually representing multivariate data sets? Graphs! (not the vertex-edge ones) Jan 6, 2017 IAT 355

Basic Symbolic Displays Graphs Charts Maps Diagrams Jan 6, 2017 IAT 355

Graphs Jan 6, 2017 IAT 355

Graphs Visual display that illustrates one or more relationships among entities Shorthand way to present information Allows a trend, pattern or comparison to be easily comprehended Jan 6, 2017 IAT 355

Issues Critical to focus on task Why do you need a graph? What questions are being answered? What data is needed to answer those questions? Who is the audience? Jan 6, 2017 IAT 355

Graph Components Framework Content Labels Measurement types, scale Geometric Metadata Content Marks, lines, points Data Labels Title, axes, ticks Nominal Metadata Jan 6, 2017 IAT 355

Chart Structure is important, relates entities to each other Primarily uses lines, enclosure, position to link entities Flow charts, family tree, organization chart B A C Jan 6, 2017 IAT 355

Map Represents spatial relations Locations identified by labels Nominal metadata Jan 6, 2017 IAT 355

Choropleth Map Areas are filled and colored differently to indicate some attribute of that region Jan 6, 2017 IAT 355

Cartography Cartographers and map-makers have a wealth of knowledge about the design and creation of visual information artifacts Labeling, color, layout, … Jan 6, 2017 IAT 355

Diagram Schematic picture of object or entity Parts are symbolic Examples: figures, steps in a manual, illustrations … Jan 6, 2017 IAT 355

Details What are the constituent pieces of these four symbolic displays? What are the building blocks? Jan 6, 2017 IAT 355

Visual Structures Composed of Spatial substrate Marks Graphical properties of marks Jan 6, 2017 IAT 355

Space Often put axes on space to assist perception of space Use techniques of composition, alignment, folding, recursion, overloading to 1) increase use of space 2) do data encodings Jan 6, 2017 IAT 355

Marks Things that occur in space Points Lines Areas Volumes Jan 6, 2017 IAT 355

Graphical Properties Size, shape, color, orientation... Spatial Properties Object Properties Expressing Extent Position, Size Greyscale Differentiating Marks Orientation Color, Shape, Texture Jan 6, 2017 IAT 355

Data Number of variables per class 1 - Univariate data 2 - Bivariate data 3 - Trivariate data >3 - Hypervariate data Jan 6, 2017 IAT 355

Univariate Data Jan 6, 2017 IAT 355

What goes where In univariate representations, we often think of the data case as being shown along one dimension, and the value in another Y Axis is quantitative Graph shows change in Y over continuous range X Y Axis is quantitative Graph shows value of Y for 4 cases Jan 6, 2017 IAT 355

Or… We may think of graph as representing independent (data case) and dependent (value) variables Guideline: Independent vs. dependent variables Put independent on x-axis See resultant dependent variables along y-axis Jan 6, 2017 IAT 355

Bivariate Data Representations Scatter plot Each mark is a data case Price Representations Scatter plot Each mark is a data case Want to see relationship between two variables What is the pattern? Mileage Jan 6, 2017 IAT 355

Trivariate Data 3D scatter plot may work Must have 3D cues Horsepower Price 3D scatter plot may work Must have 3D cues 3D blobs motion parallax stereoscopy Mileage Jan 6, 2017 IAT 355

Scatter Plot Use mark attribute for another variable Price Price Mileage Mileage Jan 6, 2017 IAT 355

Alternative 3D Represent each variable on its own line Jan 6, 2017 IAT 355

Hypervariate Data Number of well-known visualization techniques exist for data sets of 1-3 dimensions line graphs, bar graphs, scatter plots OK We see a 3-D world (4-D with time) What about data sets with more than 3 variables? Often the interesting, challenging ones Jan 6, 2017 IAT 355

Multiple Views Each variable on its own line A B C D 1 6 7 9 2 12 3 8 4 2 3 4 A B C D Jan 6, 2017 IAT 355

Scatterplot Matrix aka Splom Represent each possible pair of variables in their own 2-D scatterplot Useful for what? Misses what? Jan 6, 2017 IAT 355

Much More than 3D Fundamentally, we have 2 display dimensions For data sets with >2 variables, we must project data down to 2D Come up with visual mapping that locates each dimension into 2D plane Computer graphics 3D->2D projections Jan 6, 2017 IAT 355

Spreadsheets Tables allocate a unique space per value Case + Variable 1 case per row 1 variable per column Jan 6, 2017 IAT 355

Parallel Coordinates Case 1 V1 V2 V3 V4 V5 V1 V2 V3 V4 V5 9 6 7 3 5 10 8 7 6 5 4 3 2 1 Case 1 V1 V2 V3 V4 V5 9 6 7 3 5 Jan 6, 2017 IAT 355

Parallel Coordinates Case 2 V1 V2 V3 V4 V5 V1 V2 V3 V4 V5 7 8 9 7 3 10 6 5 4 3 2 1 Case 2 V1 V2 V3 V4 V5 7 8 9 7 3 Jan 6, 2017 IAT 355

Parallel Coordinates Each column of space is assigned a variable Vertical Scale to left Each data case is a polyline that puts a vertex on each column at its corresponding data value Jan 6, 2017 IAT 355

Parallel Coords Example Jan 6, 2017 IAT 355

Issues The range of each variable can be different: Must rescale to the vertical space available Each variable rescales independently Hard to read parallel coord plot as a static picture Interaction required Jan 6, 2017 IAT 355

Example Problem VLSI chip manufacture Want high quality chips (high speed) and a high yield batch (% of useful chips) Able to track defects Hypothesis: No defects gives desired chip types 473 batches of data A. Inselberg, “Multidimensional Detective” InfoVis 1997. Jan 6, 2017 IAT 355

The Data 16 variables X1 - yield X2 - quality X3-X12 - # defects (inverted) X13-X16 - physical parameters Jan 6, 2017 IAT 355

Quality Defects Parameters Yield Quality Defects Parameters Distributions Yield: Normal Quality: Bimodal Jan 6, 2017 IAT 355

Top Yield & Quality Split in parameters Split Jan 6, 2017 IAT 355

Minimal Defects Not best yield & quality Jan 6, 2017 IAT 355

Best Yields Parameters that give best yields cause 2 types of defects Jan 6, 2017 IAT 355

XmdvTool Matt Ward, WPI Does Parallel Coords Jan 6, 2017 IAT 355

Dimensional Reordering Which Dimensions are most alike? Sort dimensions according to similarity Jan 6, 2017 IAT 355

Advanced Graphics Johanson et al InfoVis 2005 Jan 6, 2017 IAT 355

Use texture mapping Pre-process data into clusters Render each cluster to texture Blend textures Jan 6, 2017 IAT 355

Interaction with Parallel Coords Angular Query Query the difference Hauser et al InfoVis 2002 Jan 6, 2017 IAT 355

Further elaboration Brush individual ranges Display histogram per dimension Jan 6, 2017 IAT 355

Visualizing Categories Titanic Disaster Bendix et al InfoVis 2005 Jan 6, 2017 IAT 355

Parallel coordinates layout Uses frequency based representation Continuous axes replaced with boxes Uses frequency based representation Jan 6, 2017 IAT 355

Star Plots Space out the n variables at equal angles around a circle Each “spoke” encodes a variable’s value Data point is now a “shape” Jan 6, 2017 IAT 355

Star Plot Examples Jan 6, 2017 IAT 355