Chapter 7 Scatterplots, Association, and Correlation.

Slides:



Advertisements
Similar presentations
Section 6.1: Scatterplots and Correlation (Day 1).
Advertisements

Chapter 3 Examining Relationships
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
CHAPTER 3 Describing Relationships
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
Association between 2 variables We've described the distribution of 1 variable in Chapter 1 - but what if 2 variables are measured on the same individual?
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Warm-Up A trucking company determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Objectives (IPS Chapter 2.1)
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Association between 2 variables We've described the distribution of 1 variable - but what if 2 variables are measured on the same individual? Examples?
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Correlation tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition, we would like to.
Unit 3: Describing Relationships
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Essential Statistics Chapter 41 Scatterplots and Correlation.
CHAPTER 3 Describing Relationships
Basic Practice of Statistics - 3rd Edition
Chapter 3: Describing Relationships
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 5th Edition
Chapter 3: Describing Relationships
Chapter 2 Looking at Data— Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Essential Statistics Scatterplots and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Scatterplots, Association and Correlation
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Basic Practice of Statistics - 3rd Edition
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Presentation transcript:

Chapter 7 Scatterplots, Association, and Correlation

Examining Relationships Relationship between two variables Examples: Height and Weight Alcohol and Body Temperature SAT Verbal Score and SAT Math Score High School GPA and College GPA

Two Types of Variables Response Variable (Dependent) Measures an outcome of the study Explanatory Variable (Independent) Used to explain the response variable. Example: Alcohol and Body Temp Explanatory Variable: Alcohol Response Variable: Body Temperature

Two Types of Variables Does not mean that explanatory variable causes response variable It helps explain the response Sometimes there are no true response or explanatory variables Ex. Height and Weight SAT Verbal and SAT Math Scores

Graphing Two Variables Plot of explanatory variable vs. response variable Explanatory variable goes on horizontal axis (x) Response variable goes on vertical axis (y) If response and explanatory variables do not exist, you can plot the variables on either axis. This plot is called a scatterplot This plot can only be used if explanatory and response variables are both quantitative.

Scatterplots Scatterplots show patterns, trends, and relationships. When interpreting a scatterplot (i.e., describing the relationship between two variables) always look at the following: Overall Pattern Form Direction Strength Deviations from the Pattern Outliers

Interpreting Scatterplots Form Is the plot linear or is it curved? Strength Does the plot follow the form very closely or is there a lot of scatter (variation)?

Interpreting Scatterplots Direction Is the plot increasing or is it decreasing? Positively Associated Above (below) average in one variable tends to be associated with above (below) average in another variable. Negative Associated Above (below) average in one variable tends to be associated with below (above) average in another variable.

Example – Scatterplot The following survey was conducted in the U.S. and in 10 countries of Western Europe to determine the percentage of teenagers who had used marijuana and other drugs.

Example – Scatterplot 2434United States 3153Scotland 37Portugal 36Norway 1423North Ireland 819Italy 1637Ireland 15Finland 2140England 317Denmark 422Czech Republic Other DrugsMarijuanaCountry Percent who have used

Example – Scatterplot

The variables are interchangeable in this example. In this example, Percent of Marijuana is being used as the explanatory variable (since it is on the x-axis). Percent of Other Drugs is being used as the response since it is on the y-axis.

Example - Scatterplot The form is linear The strength is fairly strong The direction is positive since larger values on the x-axis yield larger values on the y-axis

Example - Scatterplot Negative association Outside temperature and amount of natural gas used

Correlation The strength of the linear relationship between two quantitative variables can be described numerically This numerical method is called correlation Correlation is denoted by r

Correlation A way to measure the strength of the linear relationship between two quantitative variables.

Correlation Steps to calculate correlation: Calculate the mean of x and y Calculate the standard deviation for x and y Calculate Plug all numbers into formula

Correlation

Calculating r. Femur (x) Humerus (y) Set up a table with columns for x, y,,,,, and

Calculating r yx

Calculating r Recall: So,

Calculating r Recall: So,

Calculating r. Put everything into the formula:

Properties of r r has no units (i.e., just a number) Measures the strength of a LINEAR association between two quantitative variables If the data have a curvilinear relationship, the correlation may not be strong even if the data follow the curve very closely.

Properties of r r always ranges in values from –1 to 1 r = 1 indicates a straight increasing line r = -1 indicates a straight decreasing line r = 0 indicates no LINEAR relationship As r moves away from 0, the linear relationship between variables is stronger

Properties of r Changing the scale of x or y will not change the value of r Not resistant to outliers Strong correlation ≠ Causation Strong linear relationship between two variables is NOT proof of a causal relationship!

Reading JMP Output The following is some output from JMP where I considered Blood Alcohol Content and Number of Beers. The explanatory variable is the number of beers. Blood alcohol content is the response variable.

Reading JMP Output

Summary of Fit

Reading JMP Output RSquare = r 2 This means I know this is positive because the scatterplot has a positive direction. The Mean of the Response is the mean of the y’s or