Residuals, Influential Points, and Outliers

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Residuals.
 Objective: To identify influential points in scatterplots and make sense of bivariate relationships.
Copyright © Cengage Learning. All rights reserved. 2 Polynomial and Rational Functions.
Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
Scatter Diagrams and Linear Correlation
Regression Wisdom.
Getting to Know Your Scatterplot and Residuals
Chapter 9 Regression Wisdom
CHAPTER 3 Describing Relationships
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Linear Regression.
Relationship of two variables
Residuals and Residual Plots Most likely a linear regression will not fit the data perfectly. The residual (e) for each data point is the ________________________.
How do scientists show the results of investigations?
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
M23- Residuals & Minitab 1  Department of ISM, University of Alabama, ResidualsResiduals A continuation of regression analysis.
Regression Wisdom.  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t take it for granted.)
AP Statistics Chapter 8 & 9 Day 3
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Wednesday, May 13, 2015 Report at 11:30 to Prairieview.
Linear Regression Chapter 8.
Summarizing Bivariate Data
Regression Regression relationship = trend + scatter
Chapter 5 Residuals, Residual Plots, & Influential points.
Verbal SAT vs Math SAT V: mean=596.3 st.dev=99.5 M: mean=612.2 st.dev=96.1 r = Write the equation of the LSRL Interpret the slope of this line Interpret.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 3.3 Cautions about Correlations and Regression Wisdom.
WARM-UP Do the work on the slip of paper (handout)
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Algebra 3 Lesson 1.9 Objective: SSBAT identify positive, negative or no correlation. SSBAT calculate the line of best fit using a graphing calculator.
Warm Up Feel free to share data points for your activity. Determine if the direction and strength of the correlation is as agreed for this class, for the.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Residuals.
Residual Plots Unit #8 - Statistics.
Independent Dependent Scatterplot Least Squares
Introduction to Regression
Chapter 9 Regression Wisdom
Regression Wisdom. Getting the “Bends”  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Regression Wisdom Copyright © 2010, 2007, 2004 Pearson Education, Inc.
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
Influential Points By Noelle Hodge. Does the age at which a child begins to talk predict later score on a test of mental ability? A study of the development.
MATH 2311 Section 5.4. Residuals Examples: Interpreting the Plots of Residuals The plot of the residual values against the x values can tell us a lot.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Residual Plots EXPLORING BIVARIATE DATA. STUDY GUIDE 1. Read pages 57—64 of the Exploring Bivariate Data packet.
CHAPTER 3 Describing Relationships
Statistics 101 Chapter 3 Section 3.
Unit 4 Lesson 4 (5.4) Summarizing Bivariate Data
Chapter 5 Lesson 5.3 Summarizing Bivariate Data
Residuals Learning Target:
1. Describe the Form and Direction of the Scatterplot.
Outliers… Leverage… Influential points….
CHAPTER 3 Describing Relationships
Residuals, Influential Points, and Outliers
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3.2 Regression Wisdom.
Chapter 9 Regression Wisdom.
CHAPTER 3 Describing Relationships
Presentation transcript:

Residuals, Influential Points, and Outliers

Objective To develop an understanding of the impact of unusual features in the relationship between two quantitative variables.

Observed y – Predicted y Residual = Observed y – Predicted y for a given value of x Residuals are used in order to find the best LSRL (line of fit)

Residual Plot We use this to decide whether or not the original data actually follows a linear pattern random scatter = true linear relationship

Bad Residual Plots Curved Patterns Increasing or Decreasing spread in scatter

Properties of Residual Plots Always make your y-axis the set of residuals You may use either the x-value or the y-value for you x-axis (though minitab will use x-values as a default). In either case your graph should look the same On your graphing calculator RESID appears in the LIST menu after you have run LinReg(a + bx). Be sure to update LinReg(a + bx) for each new set of data.

Additional Items that can Influence LSRL Outliers Influential Points Leverage

Outliers will create large residuals Large residual changes LSRL Notice that the regression line does not change drastically by an outlier in the y-direction

Leverage: x-value far from the mean

Influential Point An observed value is said to be influential if when it is removed for the data set it would significantly change the value of the LSRL. Most texts will only use outliers with leverage in the x-direction as influential points (in the y-direction they are simply called outliers).

Note: Though it is tempting, we cannot just simply remove outliers or influential point from our data set. The best thing to do is create a LSRL for the data with this point and then without this point. Once you compare these two lines of fit, you will often learn a great deal about the data that your are trying to model.

2000 Presidential Election

Resource: http://arts.bev.net/roperldavid/politics/fl2000.htm