LAB 03: SCATTERPLOTS February 10, 2015 SDS 136 Communicating with Data.

Slides:



Advertisements
Similar presentations
Mr F’s Maths Notes Statistics 8. Scatter Diagrams.
Advertisements

CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Two Variable Analysis Joshua, Alvin, Nicholas, Abigail & Kendall.
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
AP STATISTICS LESSON 3 – 1 EXAMINING RELATIONSHIPS SCATTER PLOTS.
MATH 1107 Elementary Statistics Lecture 6 Scatterplots, Association and Correlation.
LECTURE 2 Understanding Relationships Between 2 Numerical Variables
Descriptive Methods in Regression and Correlation
Relationship of two variables
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Advantages and Disadvantages
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 3: Examining relationships between Data
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Examining Relationships
Scatterplots, Association,
Quantitative Skills 1: Graphing
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Section 2.2 Correlation A numerical measure to supplement the graph. Will give us an indication of “how closely” the data points fit a particular line.
Lesson Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.
Descriptive Statistics becoming familiar with the data.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Graphing Why? Help us communicate information : Visual What is it telling your? Basic Types Line Bar Pie.
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
The Practice of Statistics
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Association between 2 variables We've described the distribution of 1 variable - but what if 2 variables are measured on the same individual? Examples?
3.2: Linear Correlation Measure the strength of a linear relationship between two variables. As x increases, no definite shift in y: no correlation. As.
 Describe the association between two quantitative variables using a scatterplot’s direction, form, and strength  If the scatterplot’s form is linear,
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Reasoning in Psychology Using Statistics Psychology
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
CS 235: User Interface Design April 30 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
Relationships Scatterplots and Correlation.  Explanatory and response variables  Displaying relationships: scatterplots  Interpreting scatterplots.
Bivariate Data Parallel Boxplots. Parallel boxplots are made up of several regular boxplots. They are drawn up on a common scale so they can be easily.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Chapter 4 Scatterplots – Descriptions. Scatterplots Graphical display of two quantitative variables We plot the explanatory (independent) variable on.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Pictograph Uses an Icon to represent a quantity. A key must be used to explain the icon Advantages Easy to read Visually appealing Handles large data.
Bar Graphs Used to compare amounts, or quantities The bars provide a visual display for a quick comparison between different categories.
Scatter plots Adapted from 350/
LAB 01: BAR AND LINE CHARTS February 3, 2015 SDS 136 Communicating with Data.
Unit 6 Review! Objective: to review data analysis Common Core State Standards 8.SP.1; 8.SP.2; 8.SP.3; 8.SP.4.
© 2010 Pearson Education, Inc. All rights reserved Data Analysis/Statistics: An Introduction Chapter 10.
Data Visualization basics Petar Horozov Nikolay Nedyalkov
Mr Barton’s Maths Notes
GRAPHS – MODE OF OPERATION
Data Collecting, Organizing & Analyzing
10 Chapter Data Analysis/Statistics: An Introduction
Ch. 10 – Scatterplots, Association and Correlation (Day 1)
Graphing Data.
Chapter 7: Scatterplots, Association, and Correlation
Lab 04: Visualizing Multiple Variables
Analyzing One-Variable Data
Describing Relationships
Unit 1 Most common: Line Graph
Scatter Plots Below is a sample scatter plot, can you tell me what they are designed to show.
IV. Graphing Types of graphs Graphing
Chapter 3 Scatterplots and Correlation.
Graphing Skills.
Association between 2 variables
Presentation transcript:

LAB 03: SCATTERPLOTS February 10, 2015 SDS 136 Communicating with Data

Outline Scatterplots - Definitions - Comparing multiple categories - What gets encoded? - Pros and cons - Questions to ask - Things to watch out for Lab 3: Building scatterplots in Tableau

Scatterplot Definition: a visualization that shows the correlation between two quantitative (numerical) variables One axis shows one quantitative measure The other axis shows a second quantitative measure Each point represents an observation

Scatterplots on multiple categories We can change the mark type to compare the same variables across different categories

Correlation Scatterplots are particularly useful for highlighting correlation (what’s that?) Trend line (a.k.a “line of best fit”)

Trend lines May pass through all, some, or none of the data points Useful for making predictions Trend line (a.k.a “line of best fit”)

Positive correlation Two variables moving in the same direction Trend line points up and to the right

Positive correlation: example

Negative correlation Two variables moving in opposite directions Trend line points down and to the right

Negative correlation: example

No correlation Two variables are independent of one another No discernable pattern Trend line is flat

No correlation: example

What gets encoded Maximum value(s) in each dimension Minimum value(s) in each dimension General trend (what law is at work here?) Outliers (points that are “far from the pack”)

Pros and cons What’s good about scatterplots? What’s not so good?

Questions to ask Examine the data to figure out how many different categories you want to show Use this to determine your symbols (don’t forget labels!) How many categories? Examine the data to identify the biggest and smallest values for the first variable Use this value to scale your first axis What’s the range for V1? Examine the data to identify the biggest and smallest values for the second variable Use this value to scale your second axis What’s the range for V2?

Things to watch out for Choice of scale matters

Things to watch out for Choice of model matters LinearQuadratic

Things to watch out for Overplotting can make it hard to tell what’s going on in different categories

Things to watch out for When this is the case, using small multiples can sometimes help

Things to watch out for Variables with only a few unique values can also pose a problem

Lab 3: scatterplots Instructions for today’s lab are available at:

Up next A1 due tonight by 11:59pm Assignment 2 has been posted - A2 Piazza post will be due Monday Feb. 22 nd by 11:59pm - Submission instructions are included with the assignment Monday: perception