E XAMINING R ELATIONSHIPS Residuals and Residual Plots.

Slides:



Advertisements
Similar presentations
Least-Squares Regression Section 3.3. Correlation measures the strength and direction of a linear relationship between two variables. How do we summarize.
Advertisements

AP Statistics Section 3.2 B Residuals
Residuals.
Least Squares Regression
AP Statistics.  Least Squares regression is a way of finding a line that summarizes the relationship between two variables.
Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
Chapter 8 Linear regression
Chapter 8 Linear regression
Lesson Diagnostics on the Least- Squares Regression Line.
Chapter 3 Bivariate Data
Scatter Diagrams and Linear Correlation
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
C HAPTER 2 S CATTER PLOTS, C ORRELATION, L INEAR R EGRESSION, I NFERENCES FOR R EGRESSION By: Tasha Carr, Lyndsay Gentile, Darya Rosikhina, Stacey Zarko.
Haroon Alam, Mitchell Sanders, Chuck McAllister- Ashley, and Arjun Patel.
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
VCE Further Maths Least Square Regression using the calculator.
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Correlation Correlation measures the strength of the LINEAR relationship between 2 quantitative variables. Labeled as r Takes on the values -1 < r < 1.
Residuals Target Goal: I can construct and interpret residual plots to assess if a linear model is appropriate. 3.2c Hw: pg 192: 48, 50, 54, 56, 58 -
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Section 2.2 Correlation A numerical measure to supplement the graph. Will give us an indication of “how closely” the data points fit a particular line.
3.3 Least-Squares Regression.  Calculate the least squares regression line  Predict data using your LSRL  Determine and interpret the coefficient of.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
WARM-UP Do the work on the slip of paper (handout)
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Warm Up Feel free to share data points for your activity. Determine if the direction and strength of the correlation is as agreed for this class, for the.
S CATTERPLOTS Correlation, Least-Squares Regression, Residuals Get That Program ANSCOMBE CRICKETS GESSEL.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Residuals and Residual Plots Section Starter A study showed that the correlation between GPA and hours of study per week was r =.6 –Which.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
A P STATISTICS LESSON 3 – 3 (DAY 3) A P STATISTICS LESSON 3 – 3 (DAY 3) RISIDUALS.
Linear Regression Day 1 – (pg )
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
1.5 Linear Models Warm-up Page 41 #53 How are linear models created to represent real-world situations?
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
CHAPTER 3 Describing Relationships
Chapter 4.2 Notes LSRL.
Sections Review.
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Cautions about Correlation and Regression
AP Stats: 3.3 Least-Squares Regression Line
Least-Squares Regression
Section 3.3 Linear Regression
AP Statistics, Section 3.3, Part 1
CHAPTER 3 Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
CHAPTER 3 Describing Relationships
GET OUT p.161 HW!.
Least Squares Regression
Objectives (IPS Chapter 2.3)
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Homework: pg. 180 #6, 7 6.) A. B. The scatterplot shows a negative, linear, fairly weak relationship. C. long-lived territorial species.
CHAPTER 3 Describing Relationships
Homework: PG. 204 #30, 31 pg. 212 #35,36 30.) a. Reading scores are predicted to increase by for each one-point increase in IQ. For x=90: 45.98;
Presentation transcript:

E XAMINING R ELATIONSHIPS Residuals and Residual Plots

F ACTS A BOUT L EAST S QUARES L INE Must be clear on Explanatory & Response Variables Switching the variables changes your equation Line always passes through the point (x-bar,y- bar) This always gives us a point to start w/ or use during graphing Correlation is closely related to slope Smaller r = smaller effect of x on predictions r and r 2 help define the strength of a straight line relationship between the variables Higher values = stronger relationship

R ESIDUALS Press the Button and I will use Linear Regression to tell your Future!! (or at least something close to it!!) Just like our friend ZOLTAR we can make predictions using our Line of Best Fit. However, do we know just how good our predictions are? Would we be willing to put a lot of CASH MONEY down to back them up? Luckily, we have an indicator in statistics that can help us decide the strength of our predictions AND tell us if a line is the “Best Fit”.

R ESIDUALS Unless your r value is perfect, your predictions won’t be A residual is the difference between the actual value and your predicted value  Each value observed value has a residual  The sum of the residuals is always 0 (or really, really close)  Should be… If not, that equation might not be the best fit! -roundoff error – when earlier values are rounded, the sum may not equal exacty 0 Residual =

G RAPHING THE LSL ON YOUR S CATTER P LOT Using the Bone Data, Let’s look at how we get the residuals (and how your calculator does it) FemurHumerus y = x Plug in all your x values into the equation and get a predicted y-hat FemurPredicted Humerus (y-hat) ( 38 ) ( 56 ) FemurResidual (y – y-hat) 3841 – – Now, subtract the PREDICTED value from ACTUAL value.

R ESIDUAL P LOT Scatterplot of the residuals against the explanatory variable (x). Assess the fit of the regression line Does your plot show the line fits? ResidualsFit No patternGood Fit CurveNon Linear Increasing spread Worse predictions for larger x Decreasing Spread Smaller x, worse predictions oIndividual Points w/ Large Residuals = Outliers in y oIndividual Points extreme in x = Influential Points Why use Residuals? The residual plot describes how well a LINEAR model fits our data

R ESIDUAL P LOT ON C ALCULATOR Plot the scatterplot of the data Find the least squares equation (LinReg y=a+bx) Put the equation into Y1 and graph it In L3, You need to get the residuals (quickly) Go to the top of L3 – 2 nd Stat - RESID Press enter (*Your calculator finds them for you!! YIPPEEE!!) You have to have STAT: CALC: 8; 1 st, before you run the Residuals… You’re calculator has to have an equation to plug into to find the Residuals Now do a scatterplot with Xlist = L1 and Ylist = L3 (residuals) The line in the middle is the least squares line. You can do 1 Variable Stats your RESID list to find out if the residual sum is 0.

R ESIDUALS O N C ALCULATOR (S CREENSHOTS ) – B Y HAND PRACTICE ? Run the GESSEL program Scatterplot Calc Function Regression Stats Plot w/ EQ Residual List Function Residual Plot

I NFLUENTIAL P OINT VS. O UTLIER Outlier – observation that is outside overall pattern (out of whack in the Y direction) Influential Point – observation that IF removed would dramatically change the result of least squares line and/or predictions (way out in the X direction)

I NFLUENTIAL P OINT VS. O UTLIER Let’s Change Child 19’s test score from 121 to 85 and see what happens to the EQ and Graph ORIGINALNEW Notice the minimal change in the equation and graph… This is an example of why Child 19 is considered an outlier. An “outlier” in y has a minimal effect on the equation and subsequent predicted values. The change here is in the R values.

I NFLUENTIAL P OINT VS. O UTLIER Let’s Change Child 18’s test score from 57 to 85 and see what happens to the EQ and Graph ORIGINALNEW Notice the dramatic change in the equation and graph… This is an example of why Child 18 is considered an influential point. A point in the extreme x can dramatically effect the position of the least squares line.

H OMEWORK Anscombe Discovery #46