4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

7.1 Seeking Correlation LEARNING GOAL
 Objective: To look for relationships between two quantitative variables.
Chapter 4 The Relation between Two Variables
Chapter 3 Bivariate Data
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
Scatter Diagrams and Linear Correlation
Scatterplots, Association, and Correlation 60 min.
Chapter 4 Describing the Relation Between Two Variables
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Describing the Relation Between Two Variables
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Topic 2 Bivariate Data. Data for a single variable is univariate data Many or most real world models have more than one variable … multivariate data In.
Chapter 3 Linear Regression and Correlation
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Describing Relationships: Scatterplots and Correlation
Scatter Diagrams and Correlation
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Linear Regression and Correlation
Relationship of two variables
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Describing the Relation Between Two Variables
Warm-Up A trucking company determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon.
Scatterplots, Associations, and Correlation
Chapter 6 & 7 Linear Regression & Correlation
M22- Regression & Correlation 1  Department of ISM, University of Alabama, Lesson Objectives  Know what the equation of a straight line is,
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 1 – Slide 1 of 30 Chapter 4 Section 1 Scatter Diagrams and Correlation.
1 Chapter 7 Scatterplots, Association, and Correlation.
2 Variable Stats Does the amount of time spent on homework improve your grade? If you eat more junk food will you gain weight? Does the amount of rainfall.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Chapter 10 Correlation and Regression
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Sec 1.5 Scatter Plots and Least Squares Lines Come in & plot your height (x-axis) and shoe size (y-axis) on the graph. Add your coordinate point to the.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Section 4.1 Scatter Diagrams and Correlation. Definitions The Response Variable is the variable whose value can be explained by the value of the explanatory.
Chapter 4 Describing the Relation Between Two Variables 4.1 Scatter Diagrams; Correlation.
Bivariate Data and Scatter Plots Bivariate Data: The values of two different variables that are obtained from the same population element. While the variables.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Scatter Diagrams and Correlation Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
Chapter 7 Scatterplots, Association, and Correlation.
Association between 2 variables We've described the distribution of 1 variable - but what if 2 variables are measured on the same individual? Examples?
The Big Picture Where we are coming from and where we are headed…
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Chapter 4 – Correlation and Regression before: examined relationship among 1 variable (test grades, metabolism, trip time to work, etc.) now: will examine.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Lesson Scatter Diagrams and Correlation. Objectives Draw and interpret scatter diagrams Understand the properties of the linear correlation coefficient.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Linear Regression Day 1 – (pg )
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
6.7 Scatter Plots. 6.7 – Scatter Plots Goals / “I can…”  Write an equation for a trend line and use it to make predictions  Write the equation for a.
Module 11 Scatterplots, Association, and Correlation.
Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change in the response.
Correlation & Linear Regression Using a TI-Nspire.
Part II Exploring Relationships Between Variables.
SIMPLE LINEAR REGRESSION MODEL
Lecture Notes The Relation between Two Variables Q Q
Section 11.1 Correlation.
Association between 2 variables
Presentation transcript:

4.1 Scatter Diagrams and Correlation

2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall amounts and plant growth  Exercise and cholesterol levels for a group of people  Height and weight for a group of people ● In these cases, we are interested in whether the two variables have some kind of a relationship

2 Variables ● When we have two variables, they could be related in one of several different ways  They could be unrelated  One variable (the explanatory or predictor variable) could be used to explain the other (the response or dependent variable)  One variable could be thought of as causing the other variable to change ● In this chapter, we examine the second case … explanatory and response variables

Lurking Variable Sometimes it is not clear which variable is the explanatory variable and which is the response variable Sometimes the two variables are related without either one being an explanatory variable Sometimes the two variables are both affected by a third variable, a lurking variable, that had not been included in the study

Example of a Lurking Variable ● A researcher studies a group of elementary school children  Y = the student’s height  X = the student’s shoe size ● It is not reasonable to claim that shoe size causes height to change ● The lurking variable of age affects both of these two variables

More Examples ● Rainfall amounts and plant growth  Explanatory variable – rainfall  Response variable – plant growth  Possible lurking variable – amount of sunlight ● Exercise and cholesterol levels  Explanatory variable – amount of exercise  Response variable – cholesterol level  Possible lurking variable – diet

Scatter Diagram The most useful graph to show the relationship between two quantitative variables is the scatter diagram Each individual is represented by a point in the diagram The explanatory (X) variable is plotted on the horizontal scale The response (Y) variable is plotted on the vertical scale

Scatter Diagram An example of a scatter diagram Note the truncated vertical scale!

Relations ● There are several different types of relations between two variables  A relationship is linear when, plotted on a scatter diagram, the points follow the general pattern of a line  A relationship is nonlinear when, plotted on a scatter diagram, the points follow a general pattern, but it is not a line  A relationship has no correlation when, plotted on a scatter diagram, the points do not show any pattern

Positive vs. Negative Linear relations have points that cluster around a line Linear relations can be either positive (the points slants upwards to the right) or negative (the points slant downwards to the right)

Nonlinear Nonlinear relations have points that have a trend, but not around a line The trend has some bend in it

Not Related When two variables are not related There is no linear trend There is no nonlinear trend Changes in values for one variable do not seem to have any relation with changes in the other

Examples ● Examples of nonlinear relations  “Age” and “Height” for people (including both children and adults)  “Temperature” and “Comfort level” for people ● Examples of no relations  “Temperature” and “Closing price of the Dow Jones Industrials Index” (probably)  “Age” and “Last digit of telephone number” for adults

Linear Correlation Coefficient The linear correlation coefficient is a measure of the strength of linear relation between two quantitative variables The sample correlation coefficient “r” is This should be computed with software (and not by hand) whenever possible

Linear Correlation Coefficient ● Some properties of the linear correlation coefficient  r is a unitless measure (so that r would be the same for a data set whether x and y are measured in feet, inches, meters, or fathoms)  r is always between –1 and +1  Positive values of r correspond to positive relations  Negative values of r correspond to negative relations

Linear Correlation Coefficient ● Some more properties of the linear correlation coefficient  The closer r is to +1, the stronger the positive relation … when r = +1, there is a perfect positive relation  The closer r is to –1, the stronger the negative relation … when r = –1, there is a perfect negative relation  The closer r is to 0, the less of a linear relation (either positive or negative)

Examples ● Examples of positive correlation ● In general, if the correlation is visible to the eye, then it is likely to be strong

Examples of positive correlation Strong Positive r =.8 Moderate Positive r =.5 Very Weak r =.1

Negative ● Examples of negative correlation ● In general, if the correlation is visible to the eye, then it is likely to be strong

Strong Negative r = –.8 Moderate Negative r = –.5 Very Weak r = –.1 ● Examples of negative correlation

Nonlinear ● Nonlinear correlation ● Has an r = 0.1, but the difference is that the nonlinear relation shows a clear pattern (or lack of)

Correlation… ● Correlation is not causation! ● Just because two variables are correlated does not mean that one causes the other to change ● There is a strong correlation between shoe sizes and vocabulary sizes for grade school children  Clearly larger shoe sizes do not cause larger vocabularies  Clearly larger vocabularies do not cause larger shoe sizes ● Often lurking variables result in confounding

4.3 Coefficient of Determination R 2 – coefficient of determination, measures the proportion of total variation in the response variable that is explained by the least- squares regression line.

Example Weight of Car Vs. Miles Per Gallon Y = x R = R 2 = % of the variability in miles per gallon can be explained by its linear relationship with the weight. 7% of miles per gallon would be explained by other factors

Calculators Draw a scatter diagram AgeHDL Cholesterol AGE VS. HDL CHOLESTEROL A doctor wanted to determine whether a relation exists between a male’s age and his HDL (so-called good) cholesterol. He randomly selected 17 of his patients and determined their HDL cholesterol levels. He obtained the following data.

New Document Insert Lists & Spreadsheet Column A (age) Column B (HDL) Type in Data Insert Data & Statistics (Ctrl I) Put “age” on x-axis (explanatory) Put “HDL” on y-axis (response) Observe Data (does there appear to be a relationship) Menu 6:regression Linear Regression Insert Calculator Page (Ctrl I) Run Linear Regression Menu 6: Statistics 1: Stat Calculations 3: Linear Regression X List “age” Y List “HDL” ENTER Record equation, r- value, and r 2 - value