Elementary Statistics Correlation and Regression.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
Correlation and Regression
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Chapter 12 Simple Linear Regression
Introduction to Regression Analysis
Elementary Statistics Larson Farber 9 Correlation and Regression.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Chapter 9: Correlation and Regression
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
Chapter 7 Forecasting with Simple Regression
Lecture 5 Correlation and Regression
Correlation & Regression
Correlation and Linear Regression
SIMPLE LINEAR REGRESSION
Measures of Regression and Prediction Intervals
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Correlation and Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation and Regression Chapter 9. § 9.3 Measures of Regression and Prediction Intervals.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Regression & Correlation (1) 1.A relationship between 2 variables X and Y 2.The relationship seen as a straight line 3.Two problems 4.How can we tell.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Correlation and Regression 9.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Lesson Testing the Significance of the Least Squares Regression Model.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Correlation and Regression
Correlation and Linear Regression
Scatter Plots and Correlation
Regression and Correlation
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Chapter 14 Inference for Regression
Presentation transcript:

Elementary Statistics Correlation and Regression

Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes smoked per day Score on SAT Height Hours of Training Explanatory (Independent) Variable Response (Dependent) Variable A relationship between two variables Number of Accidents Shoe SizeHeight Lung Capacity Grade Point Average IQ

Negative Correlation–as x increases, y decreases x = hours of training y = number of accidents Scatter Plots and Types of Correlation Hours of Training Accidents

Positive Correlation–as x increases, y increases x = SAT score y = GPA GPA Scatter Plots and Types of Correlation Math SAT

No linear correlation x = height y = IQ Scatter Plots and Types of Correlation Height IQ

Correlation Coefficient A measure of the strength and direction of a linear relationship between two variables The range of r is from –1 to 1. If r is close to 1 there is a strong positive correlation. If r is close to –1 there is a strong negative correlation. If r is close to 0 there is no linear correlation. –1 0 1

x y Absences Final Grade Application Final Grade X Absences

xy x 2 y2y2 Computation of r n x y

r is the correlation coefficient for the sample. The correlation coefficient for the population is (rho). The sampling distribution for r is a t-distribution with n – 2 d.f. Standardized test statistic For a two tail test for significance: For left tail and right tail to test negative or positive significance: Hypothesis Test for Significance (The correlation is not significant) (The correlation is significant)

A t-distribution with 5 degrees of freedom Test of Significance You found the correlation between the number of times absent and a final grade r = – There were seven pairs of data.Test the significance of this correlation. Use = Write the null and alternative hypothesis. 2. State the level of significance. 3. Identify the sampling distribution. (The correlation is not significant) (The correlation is significant) = 0.01

t –4.032 Rejection Regions Critical Values ± t 0 4. Find the critical value. 5. Find the rejection region. 6. Find the test statistic.

t 0 –4.032 t = –9.811 falls in the rejection region. Reject the null hypothesis. There is a significant correlation between the number of times absent and final grades. 7. Make your decision. 8. Interpret your decision.

The equation of a line may be written as y = mx + b where m is the slope of the line and b is the y-intercept. The line of regression is: The slope m is: The y-intercept is: Once you know there is a significant linear correlation, you can write an equation describing the relationship between the x and y variables. This equation is called the line of regression or least squares line. The Line of Regression

Ad $ = a residual (xi,yi)(xi,yi) = a data point revenue = a point on the line with the same x-value

Calculate m and b. Write the equation of the line of regression with x = number of absences and y = final grade. The line of regression is:= –3.924x xy x 2 y2y2 x y

Absences Final Grade m = –3.924 and b = The line of regression is: Note that the point = (8.143, ) is on the line. The Line of Regression

The regression line can be used to predict values of y for values of x falling within the range of the data. The regression equation for number of times absent and final grade is: Use this equation to predict the expected grade for a student with (a) 3 absences(b) 12 absences (a) (b) Predicting y Values = –3.924(3) = = –3.924(12) = = –3.924x

The coefficient of determination, r 2, is the ratio of explained variation in y to the total variation in y. The correlation coefficient of number of times absent and final grade is r = – The coefficient of determination is r 2 = (–0.975) 2 = Interpretation: About 95% of the variation in final grades can be explained by the number of times a student is absent. The other 5% is unexplained and can be due to sampling error or other variables such as intelligence, amount of time studied, etc. The Coefficient of Determination

The Standard Error of Estimate, s e,is the standard deviation of the observed y i values about the predicted value. The Standard Error of Estimate

= xy Calculate for each x. The Standard Error of Estimate

Given a specific linear regression equation and x 0, a specific value of x, a c-prediction interval for y is: where Use a t-distribution with n – 2 degrees of freedom. The point estimate is and E is the maximum error of estimate. Prediction Intervals

Construct a 90% confidence interval for a final grade when a student has been absent 6 times. 1. Find the point estimate: The point (6, ) is the point on the regression line with x-coordinate of 6. Application

Construct a 90% confidence interval for a final grade when a student has been absent 6 times. 2. Find E, At the 90% level of confidence, the maximum error of estimate is Application

Construct a 90% confidence interval for a final grade when a student has been absent 6 times. When x = 6, the 90% confidence interval is from to Find the endpoints. Application – E = – = E = = < y <