R Squared.

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 C Coefficient of Determination
Advertisements

AP Statistics Section 3.2 B Residuals
R Squared. r = r = -.79 y = x y = x if x = 15, y = ? y = (15) y = if x = 6, y = ? y = (6)
Linear regression and correlation
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression and Correlation
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Chapter 2 – Simple Linear Regression - How. Here is a perfect scenario of what we want reality to look like for simple linear regression. Our two variables.
Simple Linear Regression NFL Point Spreads – 2007.
Linear Regression Analysis
SE-280 Dr. Mark L. Hornick 1 Statistics Review Linear Regression & Correlation.
Biostatistics Unit 9 – Regression and Correlation.
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Linear Regression Least Squares Method: the Meaning of r 2.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Linear Regression Least Squares Method: an introduction.
Regression Regression relationship = trend + scatter
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Regression Lines. Today’s Aim: To learn the method for calculating the most accurate Line of Best Fit for a set of data.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
9.2A- Linear Regression Regression Line = Line of best fit The line for which the sum of the squares of the residuals is a minimum Residuals (d) = distance.
Intro to Regression POL 242. Summary Regression is the process by which we fit a line to depict the relationship between two variables (usually both interval.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
11/26/2015 V. J. Motto 1 Chapter 1: Linear Models V. J. Motto M110 Modeling with Elementary Functions 1.5 Best-Fit Lines and Residuals.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
Chapter 6 (cont.) Difference Estimation. Recall the Regression Estimation Procedure 2.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Correlation.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Unit 3 Sections 9.2 – Day 1.  After we determine that a relationship between two variables exists (correlation), the next step is determine the line.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
3.2 - Residuals and Least Squares Regression Line.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Chapter 14, continued More simple linear regression Download this presentation.
Section 2.8 Distance and Midpoint Formulas; Circles.
Least Square Regression Line. Line of Best Fit Our objective is to fit a line in the scatterplot that fits the data the best As just seen, the best fit.
Regression lines A line of best fit should: Go through ( x , y )
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
LEAST – SQUARES REGRESSION
Statistics 101 Chapter 3 Section 3.
Linear Regression Special Topics.
Fitting Linear Models to Data
Regression and Correlation
The Lease Squares Line Finite 1.3.
מפגש 2 חשיבה תוצאתית שמעיה דוד
Section 13.7 Linear Correlation and Regression
Regression and Residual Plots
AP Stats: 3.3 Least-Squares Regression Line
LEAST – SQUARES REGRESSION
Linear Regression.
Least Squares Method: the Meaning of r2
The Least-Squares Line Introduction
Section 2: Linear Regression.
Lesson 2.2 Linear Regression.
HW# : Complete the last slide
CALCULATING EQUATION OF LEAST SQUARES REGRESSION LINE
Regression & Prediction
R = R Squared
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Ch 9.
X ⦁ X = 64 ±8 ±14 X ⦁ X ⦁ X =
Presentation transcript:

R Squared

r = -.944

r = -.79

Which value for Y is a more accurate prediction for the given X value? y = -0.9402x + 43.721 y = -0.8141x + 9.1332 if x = 15, y = ? if x = 6, y = ? y = -0.9402(15) + 43.721 y = -0.8141(6) + 9.1332 y = 29.6195 y = 4.2486 Which value for Y is a more accurate prediction for the given X value?

To find how well the Line of Best Fit actually fits the data, we can find a number called R-Squared by using the following formula: 1- Sum of squared distances between the actual and predicted Y values actual Y values and their mean

For example, here’s how to find the R Squared value for the data/graph below: X Y 3 40 10 35 11 30 15 32 22 19 26 23 24 28 18 6 Equation for Line of Best Fit: y = .94x + 43.7 Correlation = -.94

Equation for Line of Best Fit: y = .94x + 43.7 Predicted Y Value Error Error Squared Distance between Y values and their mean Mean distances squared 3 40 10 35 11 30 15 32 22 19 26 23 24 28 18 6 Mean: Sum:

Equation for Line of Best Fit: y = .94x + 43.7 Predicted Y Value Error Error Squared Distance between Y values and their mean Mean distances squared 3 40 40.88 .88 .77 14.8 219.04 10 35 34.30 -.70 .49 9.8 96.04 11 30 33.36 3.36 11.29 4.8 23.04 15 32 29.60 -2.40 5.76 6.8 46.24 22 19 23.02 4.02 16.16 -6.2 38.44 26 -2.98 8.88 .8 .64 23 24 22.08 -1.92 3.69 -1.2 1.44 28 17.38 -4.62 21.34 -3.2 10.24 18 -.62 .38 -7.2 51.84 6 10.80 -19.2 368.65 Mean: 25.2 Sum: 91.81 855.60

1- 1- 1- 0.11 =.89 91.81 855.60 To calculate “R Squared”… Sum of squared distances between the actual and predicted Y values actual Y values and their mean 1- 91.81 855.60 1- 0.11 =.89

The value we got for R Squared was .89 OK. Don’t kill me. Remember this was the data/graph we were finding “R Squared” for? The value we got for R Squared was .89 Here’s a short-cut. To find R Squared… …Square r X Y 3 40 10 35 11 30 15 32 22 19 26 23 24 28 18 6 r = -.944 r2 = -.944 • -.944 r2 = .89

R Squared To determine how well the regression line fits the data, we find a value called R-Squared (r2) To find r2, simply square the correlation The closer r2 is +1, the better the line fits the data r2 will always be a positive number

r = -.944 r2 = .89

r = -.79 r2 = .62