1/59 Lecture 02: Data Mapping September 15, 2015 COMP 150-2 Visualization.

Slides:



Advertisements
Similar presentations
Statistics for the Social Sciences Psychology 340 Fall 2006 Distributions.
Advertisements

The visual display of quantitative data Joyce Chapman, Consultant for Communications & Data Analysis State Library of North Carolina,
Welcome to EPS 525 Introduction to Statistics Dr. Robert Horn Summer 2008 Mondays – Thursdays 1:00 – 3:15 p.m.
TYPES OF DATA. Qualitative vs. Quantitative Data A qualitative variable is one in which the “true” or naturally occurring levels or categories taken by.
Graphing data.
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect3_1.
The Stats Unit.
Defining and Measuring Variables Slides Prepared by Alison L. O’Malley Passer Chapter 4.
1. I. Variable II. Relationship among variables III. Hypothesis and theory 2.
Happy Friday Scientists!
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
LECTURE 03: DATA COLLECTION AND MODELS February 4, 2015 COMP Topics in Visual Analytics Note: slide deck adapted from R. Chang, Fall 2010.
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Data Classification.  Qualitative Data: consists of attributes, labels, or nonnumerical entries.  Examples: red, Mr. Smith, Dogs  Quantitative Data:
1 Statistics 202: Statistical Aspects of Data Mining Professor David Mease Tuesday, Thursday 9:00-10:15 AM Terman 156 Lecture 2 = Start chapter 2 Agenda:
1.2 Data Classification NOTES Coach Bridges. What you should learn: How to distinguish between qualitative data and quantitative data How to classify.
ME 1202: Linear Algebra & Ordinary Differential Equations (ODEs)
Chapter 1: Introduction to Statistics. 2 Statistics A set of methods and rules for organizing, summarizing, and interpreting information.
Quantitative Skills 1: Graphing
Inquiry Unit.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall, 2014 Room 120 Integrated.
1 Course review, syllabus, etc. Chapter 1 – Introduction Chapter 2 – Graphical Techniques Quantitative Business Methods A First Course
Chapter 2 Frequency Distributions
Variables & Measurement Lesson 4. What are data? n Information from measurement l datum = single observation n Variables l Dimensions that can take on.
Statistics 300: Introduction to Probability and Statistics Section 1-2.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Graphing Why? Help us communicate information : Visual What is it telling your? Basic Types Line Bar Pie.
Qualitative Data: consists of attributes, labels or non-numerical entries Examples: Quantitative Data: consists of numerical measurements or counts Examples:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
In Stat-I, we described data by three different ways. Qualitative vs Quantitative Discrete vs Continuous Measurement Scales Describing Data Types.
Lecture 07: Dealing with Big Data
Overview and Types of Data
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Data Classification Lesson 1.2.
Chapter 2: Levels of Measurement. Researchers classify variables according to the extent to which the values of the variable measure the intended characteristics.
1 PAUF 610 TA 1 st Discussion. 2 3 Population & Sample Population includes all members of a specified group. (total collection of objects/people studied)
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Applying Pixel Values to Digital Images
1 What is Data? l An attribute is a property or characteristic of an object l Examples: eye color of a person, temperature, etc. l Attribute is also known.
Biostatistics Introduction Article for Review.
Welcome Physicists! Today: DQ: What is the difference between an independent and dependent variable? 1.Complete “Don’t Lose Your Marbles” 2.Data and Graph.
LAB 01: BAR AND LINE CHARTS February 3, 2015 SDS 136 Communicating with Data.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
1 Computational Vision CSCI 363, Fall 2012 Lecture 32 Biological Heading, Color.
Anthony J Greene1 Distributions of Variables I.Properties of Variables II.Nominal Data & Bar Charts III.Ordinal Data IV.Interval & Ratio Data, Histograms.
Used to communicate the accuracy of measurements
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Data Preliminaries CSC 600: Data Mining Class 1.
GRAPHING RULES.
Chapter 12 Understanding Research Results: Description and Correlation
Reasoning in Psychology Using Statistics
Lecture Notes for Chapter 2 Introduction to Data Mining
Tips for exam 1- Complete all the exercises from the back of each chapter. 2- Make sure you re-do the ones you got wrong! 3- Just before the exam, re-read.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2018 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
CSc4730/6730 Scientific Visualization
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2017 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Chapter 3 Graphical and Tabular Displays of Data.
Statistics Chapter 1 Sections
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Reasoning in Psychology Using Statistics
PBH 616: Quantitative Research Method
Data Preliminaries CSC 576: Data Mining.
Graphing data.
Inequalities Some problems in algebra lead to inequalities instead of equations. An inequality looks just like an equation, except that in the place of.
Reasoning in Psychology Using Statistics
Association between 2 variables
Biostatistics Lecture (2).
Presentation transcript:

1/59 Lecture 02: Data Mapping September 15, 2015 COMP Visualization

2/59 Admin Assignment 0 -- no demo TA Office hours posted Assignment 1 posted (more later) Piazza accounts Online study by Tara Kola: dfa6 Reminder: Lab 1 is due on Thursday!! Also, someone please let me know when we have less than 15 mins left in the class

3/59 How do you design a visualization? (note, this might be a trick question)

4/59 Better Question: How do you design a data visualization?

5/59 What is Data Visualization? A mapping of data attributes to visual attributes What are data attributes? What are visual attributes?

6/59 Good vs. Bad Ways An objective analysis to a visual design Reverse the thought process – What if we don’t start by thinking about visual designs first (and how data would map onto it)… This is how Excel wants you to think… But it’s backwards But instead we start with data design and figure out what visual marks can “support” it? Consider: Scatterplot Barchart

7/59

8/59 Common Visualization Design

9/59 General Use of Glyphs

10/59 General Use of Glyphs

11/59 Chernoff Faces (1973)

12/59 A Note on Chernoff Faces What do you think?

13/59 A Note on Chernoff Face Chernoff faces was invented by Herman Chernoff (1973) Based on the idea that human perceptions are specifically tuned to detect facial features and expressions Study have shown that detecting differences in Chernoff faces is not pre-attentive (Morris et al. 1999)

14/59 Detecting Faces Kindlmann et al. (2002)

15/59 Detecting Faces Kindlmann et al. (2002)

16/59 General Mapping? What data attributes map well to visual attributes? How do you know if the mappings are good? Objective measure: see set theory notes below Perceptual and cognitive measures: future lectures Subjective measure: aesthetics, preference, tasks, etc.

17/59 Color Shape

18/59 Set Theory Bijection (one visual attribute, one data attribute) Surjection (multiple visual attribute to one data attribute) Every element in Y has 1 or more corresponding element in X Injection (One to one mapping, but not all data elements are mapped) Every element in X has a mapping in Y, but not true in reverse Other scenarios?

19/59

20/59 Interaction Effects An example of interference between icon spacing (representing a linear variable) and icon brightness (representing a more general scalar field). Areas of high brightness create false lower-spacing regions. Acevdeo, Laidlaw. “Subjective Quantification of Perceptual Interactions among some 2D Scientific Visualization Methods”, TVCG 2006.

21/59 Interaction Effects Process for creating the stimuli for the data resolution identification task. (a) Shows a vertical sine-wave dataset. (b) Shows the same dataset with amplitude values a linearly decreasing from left to right. (c) Shows the final appearance of the datasets used for this task, where we also linearly move the zero value of the sine-wave from a/2 at the top of the image to 1−a/2 at the bottom. (d) Shows how subjects would mark the area where they perceive the sine-wave pattern.

22/59 What If? What if 1.The number of DATA VARIABLES is greater than VISUAL VARIABLES? 2.The number of VISUAL VARIABLES is greater than DATA VARIABLES?

23/59 Exercise Consider a line graph How many variables can you fit into a line graph? What about a barchart?

24/59 Exercise Consider a line graph How many variables can you fit into a bar chart? What about a barchart? For 2-dimensional data, are these two visualizations the same?

25/59 Structure and Form Image courtesy of Barbara Tversky

26/59 Structure and Form Image courtesy of Barbara Tversky

27/59 Visualization Process

28/59 Data Definition A typical dataset in visualization consists of n records (r 1, r 2, r 3, …, r n ) Each record r i consists of m (m >=1) observations or variables (v 1, v 2, v 3, …, v m ) A variable may be either independent or dependent Independent variable (iv) is not controlled or affected by another variable For example, time in a time-series dataset Dependent variable (dv) is affected by a variation in one or more associated independent variables For example, temperature in a region Formal definition: r i = (iv 1, iv 2, iv 3, …, iv m i, dv 1, dv 2, dv 3, …, dv m d ) where m = m i + m d

29/59 Basic Data Types Nominal Ordinal Scale / Quantitative Interval ratio Def: A set of not-ordered and non-numeric values For example: Categorical (finite) data {apple, orange, pear} {red, green, blue} Arbitrary (infinite) data {“12 Main St. Boston MA”, “45 Wall St. New York NY”, …} {“John Smith”, “Jane Doe”, …}

30/59 Basic Data Types Nominal Ordinal Scale / Quantitative Interval ratio Def: A tuple (an ordered set) For example: Numeric Binary Non-numeric

31/59 Basic Data Types Nominal Ordinal Scale / Quantitative Interval ratio Def: A numeric range Interval Ordered numeric elements on a scale that can be mathematically manipulated, but cannot be compared as ratios For example: date, current time (Sept 14, 2010 cannot be described as a ratio of Jan 1, 2011) Ratio where there exists an “absolute zero” For example: height, weight

32/59 Basic Data Types (Formal) Nominal (N){…} Ordinal (O) Scale / Quantitative (Q)[…] Q → O [0, 100] → O → N → {C, B, F, D, A} N → O (??) {John, Mike, Bob} → {red, green, blue} → ?? O → Q (??) Hashing? Bob + John = ?? Readings in Information Visualization: Using Vision To Think. Card, Mackinglay, Schneiderman, 1999

33/59 Operations on Basic Data Types What are the operations that we can perform on these data types? Nominal (N) = and ≠ Ordinal (O) >, <, ≥, ≤ Scale / Quantitative (Q) everything else (+, -, *, /, etc.) Consider a distance function

34/59 Half-Way Point (Find your partner)