BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.

Slides:



Advertisements
Similar presentations
Section 6.1: Scatterplots and Correlation (Day 1).
Advertisements

Overview Correlation Regression -Definition
5/17/2015Chapter 41 Scatterplots and Correlation.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
Chapter 4 Describing the Relation Between Two Variables
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Describing the Relation Between Two Variables
Correlation Relationship between Variables. Statistical Relationships What is the difference between correlation and regression? Correlation: measures.
BPS - 5th Ed. Chapter 51 Regression. BPS - 5th Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
Correlation: Relationship between Variables
Describing Relationships: Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Describing Relationships: Scatter Plots and Correlation ● The world is an indivisible whole (butterfly effect and chaos theory; quantum entanglement, etc.)
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.2 The Association.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 141.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 1 – Slide 1 of 30 Chapter 4 Section 1 Scatter Diagrams and Correlation.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Essential Statistics Chapter 41 Scatterplots and Correlation.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
BPS - 3rd Ed. Chapter 51 Regression. BPS - 3rd Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
Chapter 5 Regression BPS - 5th Ed. Chapter 51. Linear Regression  Objective: To quantify the linear relationship between an explanatory variable (x)
BPS - 5th Ed. Chapter 51 Regression. BPS - 5th Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Chapter 4 Describing the Relation Between Two Variables 4.1 Scatter Diagrams; Correlation.
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Chapter 141 Describing Relationships: Scatterplots and Correlation.
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
BPS - 3rd Ed. Chapter 51 Regression. BPS - 3rd Ed. Chapter 52 u To describe the change in Y per unit X u To predict the average level of Y at a given.
Stat 1510: Statistical Thinking and Concepts REGRESSION.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
Essential Statistics Chapter 41 Scatterplots and Correlation.
Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 How Can We Explore the Association between Two Categorical Variables?
CHAPTER 3 Describing Relationships
Essential Statistics Regression
Basic Practice of Statistics - 3rd Edition

Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 5th Edition
CHAPTER 4: Scatterplots and Correlation
Unit 4 Vocabulary.
Basic Practice of Statistics - 5th Edition Regression
HS 67 (Intro Health Stat) Regression
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Correlation.
Basic Practice of Statistics - 3rd Edition Regression
CHAPTER 3 Describing Relationships
Essential Statistics Scatterplots and Correlation
CHAPTER 3 Describing Relationships
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Basic Practice of Statistics - 3rd Edition
Presentation transcript:

BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation

BPS - 3rd Ed. Chapter 42 Variable (X) and Variable (Y) u Prior chapters  one variable at a time u This chapter  relationship between two variables u One variable is an “outcome”: response variable (Y) u The other variable is a “predictor”: explanatory variable (X) u Are X and Y related? X  Y?

BPS - 3rd Ed. Chapter 43 Question A study investigates whether the there is a relationship between gross domestic product and life expectancy: Which is the explanatory variable (X)? Which is the response variable (Y)? All other variables that may influence life expectancy are “lurking” and may confound the relation between X and Y. Are there lurking variables in this analysis?

BPS - 3rd Ed. Chapter 44 u This chapter considers the case in which both X and Y are quantitative variables u Bivariate data points (x i, y i ) are plotted on graph paper to form a scatterplot Scatterplot

BPS - 3rd Ed. Chapter 45 u X = percent of students taking SAT u Y = mean SAT verbal score u What is the relationship between X and Y? Example of a scatterplot

BPS - 3rd Ed. Chapter 46 Interpreting scatterplots u Form Can data be described by straight line? [Linearity] u Direction Does the line slope upward or downward v Positive association = above-average values of Y accompany above-average values of X (and vice versa) v Negative association = above-average values of Y accompany below-average values of X (and vice versa) u Strength Do data point adhere to imaginary line?

BPS - 3rd Ed. Chapter 47 Form [discuss]

BPS - 3rd Ed. Chapter 48 Strength and direction u Direction: positive, negative or flat u Strength: How closely does a non- horizontal straight line fit the points of a scatterplot? Close fitting  strong Loose fitting  weak

BPS - 3rd Ed. Chapter 49 Strength cannot be reliably judged visually u These two scatterplots are of the same data (they have the exact same correlation) u The second scatter plot looks like a stronger correlation, but this is an artifact of the axis scaling

BPS - 3rd Ed. Chapter 410 Correlation coefficient (r) u Let r denote the correlation coefficient u r is always between -1 and +1, inclusive u Sign of r denotes direction of association u Special values for r :  r = +1  all points on upward sloping line  r = -1  all points on downward sloping line  r = 0  no line or horizontal line  The closer r is to +1 or –1, the better the fit of points to the line

BPS - 3rd Ed. Chapter 411 Examples of Correlations u Husband’s versus Wife’s ages v r =.94 u Husband’s versus Wife’s heights v r =.36 u Professional Golfer’s Putting Success: Distance of putt in feet versus percent success v r = -.94

BPS - 3rd Ed. Chapter 412 Correlation Coefficient r u Data on variables X and Y for n individuals: x 1, x 2, …, x n and y 1, y 2, …, y n u Each variable has a mean and std dev:

BPS - 3rd Ed. Chapter 413 Correlation coefficient r The formula for r can be understood by converting data points to standardized scores: where

BPS - 3rd Ed. Chapter 414 Illustrative example (gdp_life.sav) Per Capita Gross Domestic Product and Average Life Expectancy for Countries in Western Europe Does GDP predict life expectancy?

BPS - 3rd Ed. Chapter 415 Illustrative example (gdp_life.sav) CountryPer Capita GDP (X)Life Expectancy (Y) Austria Belgium Finland France Germany Ireland Italy Netherlands Switzerland United Kingdom

BPS - 3rd Ed. Chapter 416 Illustrative example (gdp_life.sav) Scatterplot

BPS - 3rd Ed. Chapter 417 Illustrative example (gdp_life.sav) xy = = sum = s x =1.532s y =0.795

BPS - 3rd Ed. Chapter 418 Illustrative example (gdp_life.sav)

BPS - 3rd Ed. Chapter 419 Interpretation of r Direction of association: positive or negative Strength of association: the closer |r| is to 1, the stronger the correlation. Here are guidelines: 0.0  |r| < 0.3  weak correlation 0.3  |r| < 0.7  moderate correlation 0.7  |r| < 1.0  strong correlation |r| = 1.0  perfect correlation

BPS - 3rd Ed. Chapter 420 Interpretation of r For GDP / life expectancy example, r = This indicates a strong positive correlation

BPS - 3rd Ed. Chapter 421 Problems with Correlations u Not all relations are linear u Outliers can have large influence on r u Lurking variables confound relations

BPS - 3rd Ed. Chapter 422 Not all Relationships are Linear Miles per Gallon versus Speed u r  0 (flat line) u But there is a non- linear relation

BPS - 3rd Ed. Chapter 423 Not all Relationships are Linear Miles per Gallon versus Speed u Curved relationship. u r was misleading.

BPS - 3rd Ed. Chapter 424 Outliers and Correlation The outlier in the above graph decreases r If we remove the outlier  strong relation

BPS - 3rd Ed. Chapter 425 Exercise 4.15: Calories and sodium content of hot dogs (a) What are the lowest and highest calorie counts? …lowest and highest sodium levels? (b) Positive or negative association? (c) Any outliers? If we ignore outlier,is relation still linear? Does the correlation become stronger?

BPS - 3rd Ed. Chapter 426 Exercise 4.13: IQ and school grades (a) Positive or negative association? (b) Is form linear? Does it appear strong? (c) What is the IQ and GPA for the outlier on the bottom there?