Scatter Plots and Correlation

Slides:



Advertisements
Similar presentations
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Elementary Statistics Larson Farber 9 Correlation and Regression.
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Linear Correlation To accompany Hawkes lesson 12.1 Original content by D.R.S.
Correlation and Linear Regression
SIMPLE LINEAR REGRESSION
Correlation Scatter Plots Correlation Coefficients Significance Test.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
Correlation and Regression
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Ms. Khatijahhusna Abd Rani School of Electrical System Engineering Sem II 2014/2015.
Section 10.3 Hypothesis Testing for Means (Large Samples) HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant.
Section 12.1 Scatter Plots and Correlation HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Section 12.2 Linear Regression HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
Section 10.2 Hypothesis Testing for Means (Small Samples) HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.1.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 13.
Correlation and Linear Regression
Hypothesis Testing – Two Means(Small, Independent Samples)
Correlation and Regression
Hypothesis Testing – Two Population Variances
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Section 12.2 Linear Regression
Inference about the slope parameter and correlation
CHAPTER 10 & 13 Correlation and Regression
Regression and Correlation
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Is there a relationship between the lengths of body parts?
Correlation & Regression
Hypothesis Testing for Means (Small Samples)
Multiple Regression Equations
Correlation and Simple Linear Regression
Fundamentals of Hypothesis Testing
Chapter 5 STATISTICS (PART 4).
Elementary Statistics
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Simple Linear Regression
Correlation and Regression
LESSON 24: INFERENCES USING REGRESSION
Chapter 10 Correlation and Regression
Correlation and Regression
Correlation and Simple Linear Regression
Correlation and Regression
Section 1.4 Curve Fitting with Linear Models
Correlation and Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Warsaw Summer School 2017, OSU Study Abroad Program
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Scatter Plots and Correlation Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. HAWKES LEARNING SYSTEMS math courseware specialists Section 12.1 Scatter Plots and Correlation

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Definitions: Scatter Plot – a graph on the coordinate plane which contains one point for each pair of data. Independent Variable – placed on the x-axis, this variable causes a change in the dependent variable. Also known as the predictor variable or the explanatory variable. (x) Dependent Variable – placed on the y-axis, this variable changes in response to the independent variable. Also known as the response variable. (y) b0 = y-intercept b1 = slope

Draw a scatter plot to represent the following data: HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Draw a scatter plot: Draw a scatter plot to represent the following data: Hours of Study 0.5 1 1.5 2 3 4.5 5 Test Grade 72 84 68 85 77 81 48 90 99 88 Solution:

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Scatter Plots: We are interested in the relationship between the two variables being studied. Like any other relationship, some relationships are stronger than others. Linear Relationship – when a relationship seems to follow a straight line. Positive Slope (+b1) – indicates that as the values of one variable increase, so do the values of the other variable. (Higher Salary (x) , More Consumption (y)) Negative Slope (-b1)– indicates that as the values of one variable increase, the values of the other variable decrease. (Higher Price (x), Less Consumption (y))

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Types of Slopes: x x The hours you study for a final exam and your final grade. The price of a used car and the number of miles it was driven.

No Correlation– no relationship between the variables. HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation: Correlation is the mathematical term for the relationship between the two variables x and y. Strong Linear Correlation – when a relationship seems to follow a straight line. Weak Linear Correlation– when a relationship seems to follow a straight line, but the points are more scattered. No Correlation– no relationship between the variables.

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Types of Relationships:

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation coefficient: Pearson Correlation Coefficient,  – the parameter that measures the strength of a linear relationship for the population. Correlation Coefficient, r – measures how strongly one variable is linearly dependent upon the other for a sample. When calculating the correlation coefficient, round your answers to three decimal places.

Correlation Coefficient (r) HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation Coefficient (r) –1 ≤ r ≤ 1 Close to –1 means a strong negative correlation. Close to 0 means no correlation. Close to 1 means a strong positive correlation.

Calculate the correlation coefficient, r, for the data shown below. HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Find the value of r: Calculate the correlation coefficient, r, for the data shown below. Hours of Study 0.5 1 1.5 2 3 4.5 5 Test Grade 72 84 68 85 77 81 48 90 99 88 Solution: n = 10, ∑x = 20, ∑y = 792, ∑xy = 1690, ∑x2 = 66, ∑y2 = 64528  0.490

Significant Linear Relationship (Two-Tailed Test): HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Null and Alternative Hypotheses Testing for Linear Relationships: Significant Linear Relationship (Two-Tailed Test): H0:  = 0 (Implies there is no significant linear relationship.) Ha:  ≠ 0 (Implies there is a significant linear relationship.) Negative Linear Relationship (Left-Tailed Test): H0:  ≥ 0 Ha:  < 0 Positive Linear Relationship (Right-Tailed Test): H0:  ≤ 0 Ha:  > 0

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Testing Statistic: with d.f. = n – 2 To determine if the test statistic calculated from the sample is statistically significant we will need to look at the critical value.

Significant Linear Relationship (Two-Tailed Test): HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Rejection Rule for Testing Linear Relationships: Significant Linear Relationship (Two-Tailed Test): Reject H0 if |t | ≥ t/2 Negative Linear Relationship (Left-Tailed Test): Reject H0 if t ≤ –t Positive Linear Relationship (Right-Tailed Test): Reject H0 if t ≥ t

State the null of alternative hypotheses. HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Steps for Hypothesis Testing: State the null of alternative hypotheses. Set up the hypothesis test by choosing the test statistic and determining the values of the test statistic that would lead to rejecting the null hypothesis. Gather data and calculate the necessary sample statistics. Draw a conclusion.

First state the hypotheses: H0: Ha: HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Draw a conclusion: Use a hypothesis test to determine if the linear relationship between the number of parking tickets a student receives during a semester and their GPA during the same semester is statistically significant at the 0.05 level of significance. # of Tickets 1 2 3 5 7 8 GPA 3.6 3.9 2.4 3.1 3.5 4.0 2.8 3.0 2.2 2.1 1.7 Solution: First state the hypotheses: H0: Ha: Next, set up the hypothesis test and determine the critical value: d.f. = 13, a = 0.05 t0.025 = Reject if |t | ≥ t/2 , or if |t | ≥ 2.160  = 0  ≠ 0 2.160

The correlation is statistically significant. HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Solution (continued): Gather the data and calculate the necessary sample statistics: n = 15, r = –0.587 Finally, draw a conclusion: Reject if |t | ≥ t/2 , or if |t | ≥ 2.160 Since |t | = 2.614 is greater than t/2 = 2.160, we will reject the null hypothesis. The correlation is statistically significant.  –2.614

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Using Critical Values to Determine Statistical Significance: The correlation coefficient, r, is statistically significant if the absolute value of the correlation is greater than the critical value in the Pearson Correlation Coefficient Table. |r | > r

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Determine the significance: If |r | > r Yes, correlation is significant Determine whether the following values of r are statistically significant. r = 0.52, n = 19, a = 0.05 r = 0.456, Yes r = 0.52, n = 19, a = 0.01 r = 0.575, No

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Coefficient of Determination: The coefficient of determination, r 2, is the measure of the amount of variation in y explained by the variation in x.

HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Coefficient of Determination : If the correlation between the number of rooms in a house and its price is r = 0.65, how much of the variation in price can be explained by the relationship between the two variables? Solution: r = 0.65 r 2  So about 42.3% of the variation in the price of a house can be explained by the relationship between the two variables. 0.423

Copy Data from Hawkes to Excel and Paste in cell A1.

Statistically significant if |r | > r. , But, 0.331 < 0.735 4 0.95 0.99 5 0.878 0.959 6 0.811 0.917 7 0.754 0.875 8 0.707 0.834 9 0.666 0.798 10 0.632 0.765 11 0.602 0.735 12 0.576 0.708 r = 0.735 Statistically significant if |r | > r. , But, 0.331 < 0.735 Therefore, r is not statistically significant at the 0.01 level.

Copy Data from Hawkes to Excel and Paste in cell A1.