ENV 20063.1 Envisioning Information Lecture 3 – Multivariate Data Exploration Scatter plots and parallel coordinates Ken Brodlie.

Slides:



Advertisements
Similar presentations
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advertisements

1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Finding The Unknown Number In A Number Sentence! NCSCOS 3 rd grade 5.04 By: Stephanie Irizarry Click arrow to go to next question.
Section 1.8 Homework questions?. Section Concepts 1.8 Linear Equations in Two Variables Slide 2 Copyright (c) The McGraw-Hill Companies, Inc. Permission.
Advanced Piloting Cruise Plot.
Chapter 16 Graphical User Interfaces
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Quickfire Mental Maths 25 questions More free powerpoints at
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Writing Pseudocode And Making a Flow Chart A Number Guessing Game
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Topic 1Topic Q 1Q 6Q 11Q 16Q 21 Q 2Q 7Q 12Q 17Q 22 Q 3Q 8Q 13Q 18Q 23 Q 4Q 9Q 14Q 19Q 24 Q 5Q 10Q 15Q 20Q 25.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
Lesson 3-3 Ideas/Vocabulary
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
Points, Vectors, Lines, Spheres and Matrices
1 SDMIV Data Visualization - A Very Rough Guide Ken Brodlie University of Leeds.
ENV Envisioning Information Lecture 9 – Time: Taxonomy & Techniques Ken Brodlie
ENV Envisioning Information Lecture 2 Simple Graphs and Charts Ken Brodlie School of Computing University of Leeds.
13.1 Vis_2003 Data Visualization Lecture 13 Visualization of Very Large Datasets.
ENV Envisioning Information Lecture 12 – Scientific Visualization Scalar 2D Data Ken Brodlie
13.1 Vis_04 Data Visualization Lecture 13 Information Visualization Part 1.
Data Visualization Lecture 4 Two Dimensional Scalar Visualization
GR2 Advanced Computer Graphics AGR
15.1 Vis_04 Data Visualization Lecture 15 Information Visualization : Part 3.
ENV Envisioning Information Lecture 4 – Multivariate Data Exploration Glyphs and other methods Hierarchical approaches Ken Brodlie.
ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
ENV 2006 CS4.1 Envisioning Information: Case Study 4 Focus and Context for Volume Visualization.
9.1si31_2001 SI31 Advanced Computer Graphics AGR Lecture 9 Adding Realism Through Texture.
ZMQS ZMQS
Copyright © 2010 Pearson Education, Inc. Systems of Linear Equations in Three Variables Learn basic concepts about systems in three variables Learn basic.
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
Box and Whisker Plots.
ABC Technology Project
Unit 8: Presenting Data in Charts, Graphs and Tables
CHAPTER 6 Introduction to Graphing and Statistics Slide 2Copyright 2012, 2008, 2004, 2000 Pearson Education, Inc. 6.1Tables and Pictographs 6.2Bar Graphs.
“Start-to-End” Simulations Imaging of Single Molecules at the European XFEL Igor Zagorodnov S2E Meeting DESY 10. February 2014.
Squares and Square Root WALK. Solve each problem REVIEW:
Do you have the Maths Factor?. Maths Can you beat this term’s Maths Challenge?
© 2012 National Heart Foundation of Australia. Slide 2.
The x- and y-Intercepts
Graphing Ax + By = C Topic
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Motion in Two and Three Dimensions
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Challenge the future Delft University of Technology Some Observations on Human Control of a Bicycle Jodi Kooijman.
Waller Creek Intake CFD Modeling and Simulation
Addition 1’s to 20.
25 seconds left…...
Copyright © Cengage Learning. All rights reserved.
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
TASK: Skill Development A proportional relationship is a set of equivalent ratios. Equivalent ratios have equal values using different numbers. Creating.
1 Functions and Applications
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
1 18 April 2007 vizNET-LEEDS-PRES A Rough Guide to Data Visualization – Part 2 VizNET 2007 Annual Event Ken Brodlie School of Computing University.
16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1.
Data Visualization.
Presentation transcript:

ENV Envisioning Information Lecture 3 – Multivariate Data Exploration Scatter plots and parallel coordinates Ken Brodlie

ENV Data Tables Multivariate datasets can be expressed as a data table –Each entry in table is an observation –An observation consists of values of a set of variables, or variates Exercise –Create a data table from the MSc class… ABC variables observations

ENV Scatter Plot For two variates, we have already met the scatter plot technique It is useful for showing what happens to one variable as another changes…

ENV Scatter Plot Visicube from Datamology is a useful free charting tool Here is an example scatter plot, visualizing the speed of the (receding) galaxy NGC7531 relative to the earth, measurements of speed being taken at different points on galaxy Circles represent measurements at 133 o to horizon; pluses at 43 o What can you observe?

ENV D Scatter Plot Visicube has a tool specifically for 3D scatter plots Third variate expressed as a vertical axis and widget lets you take slices at different heights Here we have same dataset but X and Y are positions, and Z axis is velocity … ie layered by velocity – here 3 rd layer (1482 – 1519 km/sec) Observations less than 1500 km/sec highlighted in yellow (almost allowing 4D) Conclusion?

ENV D Scatter Plots Here is an alternative approach, using 3D plotting… … does this work? XRT/3d

ENV Extending to Higher Numbers of Variables Additional variables can be visualized by colour and shape coding IRIS Explorer ( a scientific visualization system!) used to visualize data from BMW –Five variables displayed using spatial arrangement for three, colour and object type for others –Notice the clusters… But there are clearly limits to how much this will scale Kraus & Ertl, U Stuttgart

ENV Multivariate Visualization Techniques Software: –Xmdvtool Matthew Ward n Techniques designed for any number of variables –Scatter plot matrices –Parallel co-ordinates –Glyph techniques Acknowledgement: Many of images in following slides taken from Wards work

ENV What are these?

ENV Multivariate Visualization Example of iris data set –150 observations of 4 variables (length, width of petal and sepal) –Check wikipedia for explanations of petals & sepals –Techniques aim to display relationships between variables – the analytical task Challenge in visualization is to design the visualization to match the analytical task

ENV Scatter Plot Matrices

ENV For table data of M variables, we can look at pairs in 2D scatter plots The pairs can be juxtaposed: A B C C B A With luck, you may spot correlations between pairs as linear structures… or you may observe clusters Scatter Plot Matrices

ENV Scatter Plot Matrix – Iris Data Set

ENV Scatter Plot Matrix – Car Data Set Data represents 7 aspects of cars: what relationships can we notice? For example, what correlates with high MPG?

ENV ABCDEF - create M equidistant vertical axes, each corresponding to a variable - each axis scaled to [min, max] range of the variable - each observation corresponds to a line drawn through point on each axis corresponding to value of the variable Parallel Coordinates

ENV ABCDEF - correlations may start to appear as the observations are plotted on the chart - here there appears to be negative correlation between values of A and B for example - this has been used for applications with thousands of data items Parallel Coordinates

ENV Parallel Coordinates – Iris Data

ENV Parallel Coordinates Example Detroit homicide data 7 variables 13 observations

ENV Parallel Coordinates Concept due to Alfred Inselberg Conceived the idea as a research student in 1959… … idea gradually refined over next 40 years

ENV Parallel Coordinates Parallel coordinates is a clever mechanism for transforming geometry from one space to another To get a handle on the idea, consider two variables X,Y In parallel coordinates, a point (X,Y) becomes… what? A line becomes… what? Why is the ordering of the axes important? Use this space to sketch the answers…

ENV The Screen Space Problem All techniques, sooner or later, run out of screen space Parallel co-ordinates –Usable for up to 150 variates –Unworkable greater than 250 variates Remote sensing: 5 variates, 16,384 observations)

ENV Brushing as a Solution Brushing selects a restricted range of one or more variables Selection then highlighted

ENV Scatter Plot Use of a brushing tool can highlight subsets of data..now we can see what correlates with high MPG

ENV Parallel Coordinates Brushing picks out the high MPG data Can you observe the same relations as with scatter plots? More or less easy?

ENV Parallel Coordinates Here we highlight high MPG and not 4 cylinders