Presentation is loading. Please wait.

Presentation is loading. Please wait.

Scatter Plots and Correlation

Similar presentations


Presentation on theme: "Scatter Plots and Correlation"— Presentation transcript:

1 Scatter Plots and Correlation
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. HAWKES LEARNING SYSTEMS math courseware specialists Section 12.1 Scatter Plots and Correlation

2 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Definitions: Scatter Plot – a graph on the coordinate plane which contains one point for each pair of data. Independent Variable – placed on the x-axis, this variable causes a change in the dependent variable. Also known as the predictor variable or the explanatory variable. (x) Dependent Variable – placed on the y-axis, this variable changes in response to the independent variable. Also known as the response variable. (y) b0 = y-intercept b1 = slope

3 Draw a scatter plot to represent the following data:
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Draw a scatter plot: Draw a scatter plot to represent the following data: Hours of Study 0.5 1 1.5 2 3 4.5 5 Test Grade 72 84 68 85 77 81 48 90 99 88 Solution:

4 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Scatter Plots: We are interested in the relationship between the two variables being studied. Like any other relationship, some relationships are stronger than others. Linear Relationship – when a relationship seems to follow a straight line. Positive Slope (+b1) – indicates that as the values of one variable increase, so do the values of the other variable. (Higher Salary (x) , More Consumption (y)) Negative Slope (-b1)– indicates that as the values of one variable increase, the values of the other variable decrease. (Higher Price (x), Less Consumption (y))

5 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Types of Slopes: x x The hours you study for a final exam and your final grade. The price of a used car and the number of miles it was driven.

6 No Correlation– no relationship between the variables.
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation: Correlation is the mathematical term for the relationship between the two variables x and y. Strong Linear Correlation – when a relationship seems to follow a straight line. Weak Linear Correlation– when a relationship seems to follow a straight line, but the points are more scattered. No Correlation– no relationship between the variables.

7 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Types of Relationships:

8 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation coefficient: Pearson Correlation Coefficient,  – the parameter that measures the strength of a linear relationship for the population. Correlation Coefficient, r – measures how strongly one variable is linearly dependent upon the other for a sample. When calculating the correlation coefficient, round your answers to three decimal places.

9 Correlation Coefficient (r)
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Correlation Coefficient (r) –1 ≤ r ≤ 1 Close to –1 means a strong negative correlation. Close to 0 means no correlation. Close to 1 means a strong positive correlation.

10 Calculate the correlation coefficient, r, for the data shown below.
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Find the value of r: Calculate the correlation coefficient, r, for the data shown below. Hours of Study 0.5 1 1.5 2 3 4.5 5 Test Grade 72 84 68 85 77 81 48 90 99 88 Solution: n = 10, ∑x = 20, ∑y = 792, ∑xy = 1690, ∑x2 = 66, ∑y2 = 64528  0.490

11 Significant Linear Relationship (Two-Tailed Test):
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Null and Alternative Hypotheses Testing for Linear Relationships: Significant Linear Relationship (Two-Tailed Test): H0:  = 0 (Implies there is no significant linear relationship.) Ha:  ≠ 0 (Implies there is a significant linear relationship.) Negative Linear Relationship (Left-Tailed Test): H0:  ≥ 0 Ha:  < 0 Positive Linear Relationship (Right-Tailed Test): H0:  ≤ 0 Ha:  > 0

12 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Testing Statistic: with d.f. = n – 2 To determine if the test statistic calculated from the sample is statistically significant we will need to look at the critical value.

13 Significant Linear Relationship (Two-Tailed Test):
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Rejection Rule for Testing Linear Relationships: Significant Linear Relationship (Two-Tailed Test): Reject H0 if |t | ≥ t/2 Negative Linear Relationship (Left-Tailed Test): Reject H0 if t ≤ –t Positive Linear Relationship (Right-Tailed Test): Reject H0 if t ≥ t

14 State the null of alternative hypotheses.
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Steps for Hypothesis Testing: State the null of alternative hypotheses. Set up the hypothesis test by choosing the test statistic and determining the values of the test statistic that would lead to rejecting the null hypothesis. Gather data and calculate the necessary sample statistics. Draw a conclusion.

15 First state the hypotheses: H0: Ha:
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Draw a conclusion: Use a hypothesis test to determine if the linear relationship between the number of parking tickets a student receives during a semester and their GPA during the same semester is statistically significant at the 0.05 level of significance. # of Tickets 1 2 3 5 7 8 GPA 3.6 3.9 2.4 3.1 3.5 4.0 2.8 3.0 2.2 2.1 1.7 Solution: First state the hypotheses: H0: Ha: Next, set up the hypothesis test and determine the critical value: d.f. = 13, a = 0.05 t0.025 = Reject if |t | ≥ t/2 , or if |t | ≥ 2.160  = 0  ≠ 0 2.160

16 The correlation is statistically significant.
HAWKES LEARNING SYSTEMS math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Solution (continued): Gather the data and calculate the necessary sample statistics: n = 15, r = –0.587 Finally, draw a conclusion: Reject if |t | ≥ t/2 , or if |t | ≥ 2.160 Since |t | = is greater than t/2 = 2.160, we will reject the null hypothesis. The correlation is statistically significant. –2.614

17 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Using Critical Values to Determine Statistical Significance: The correlation coefficient, r, is statistically significant if the absolute value of the correlation is greater than the critical value in the Pearson Correlation Coefficient Table. |r | > r

18 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Determine the significance: If |r | > r Yes, correlation is significant Determine whether the following values of r are statistically significant. r = 0.52, n = 19, a = 0.05 r = 0.456, Yes r = 0.52, n = 19, a = 0.01 r = 0.575, No

19 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Coefficient of Determination: The coefficient of determination, r 2, is the measure of the amount of variation in y explained by the variation in x.

20 HAWKES LEARNING SYSTEMS
math courseware specialists Regression, Inference, and Model Building 12.1 Scatter Plots and Correlation Coefficient of Determination : If the correlation between the number of rooms in a house and its price is r = 0.65, how much of the variation in price can be explained by the relationship between the two variables? Solution: r = 0.65 r 2  So about 42.3% of the variation in the price of a house can be explained by the relationship between the two variables. 0.423

21

22 Copy Data from Hawkes to Excel and Paste in cell A1.

23 Statistically significant if |r | > r. , But, 0.331 < 0.735
4 0.95 0.99 5 0.878 0.959 6 0.811 0.917 7 0.754 0.875 8 0.707 0.834 9 0.666 0.798 10 0.632 0.765 11 0.602 0.735 12 0.576 0.708 r = 0.735 Statistically significant if |r | > r. , But, < 0.735 Therefore, r is not statistically significant at the 0.01 level.

24

25

26

27

28

29 Copy Data from Hawkes to Excel and Paste in cell A1.


Download ppt "Scatter Plots and Correlation"

Similar presentations


Ads by Google