Outliers… Leverage… Influential points….

Slides:



Advertisements
Similar presentations
Outliers and Influential Points
Advertisements

Statistical Methods Lecture 28
Section 10-3 Regression.
Linear Equations Review. Find the slope and y intercept: y + x = -1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing.
Regression Wisdom.
Getting to Know Your Scatterplot and Residuals
Chapter 9 Regression Wisdom
Analysis of Linear Functions. Trick Question? #6 x = ? This is a VERTICAL line. Slope (m) – UNDEFINED Steepness – MORE or NO Solution Y intercept (b)
Psychology 202b Advanced Psychological Statistics, II February 8, 2011.
Stat 112: Lecture 15 Notes Finish Chapter 6: –Review on Checking Assumptions (Section ) –Outliers and Influential Points (Section 6.7) Homework.
Linear vs. exponential growth Linear vs. exponential growth: t = 0 A = 1x(1+1) 0 = 1 A = 1x0 + 1 = 1.
Chapter 5 Linear Models Linear Models are studied intensively because: Easiest to understand and analyze Relationships are often linear Variables with.
Chapter 9: Regression Alexander Swan & Rafey Alvi.
Regression Diagnostics Checking Assumptions and Data.
Outliers and Influential Data Points in Regression Analysis James P. Stevens sujin jang november 10, 2008.
Stat 112: Lecture 16 Notes Finish Chapter 6: –Influential Points for Multiple Regression (Section 6.7) –Assessing the Independence Assumptions and Remedies.
Bell Ringer Write an equation in slope – intercept form that is parallel to the line 5
Exploring relationships between variables Ch. 10 Scatterplots, Associations, and Correlations Ch. 10 Scatterplots, Associations, and Correlations.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Regression Wisdom.  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t take it for granted.)
Stat 112 Notes 15 Today: –Outliers and influential points. Homework 4 due on Thursday.
Linear Regression Chapter 8.
Stat 112 Notes 16 Today: –Outliers and influential points in multiple regression (Chapter 6.7)
Dr. C. Ertuna1 Issues Regarding Regression Models (Lesson - 06/C)
DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:
Tutorial 4 MBP 1010 Kevin Brown. Correlation Review Pearson’s correlation coefficient – Varies between – 1 (perfect negative linear correlation) and 1.
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
* SCATTER PLOT – GRAPH WITH MANY ORDERS PAIRS * LINE OF BEST FIT – LINE DRAWN THROUGH DATA THAT BEST REPRESENTS IT.
Answers to Part 2 Review Pg #1  % over 50 – r=.69  % under 20 – r=-.71  Full time Fac – r=.09  Gr. On time-r=-.51.
Stats Project Chapter 9 By: Rebecca Scott and Sarah Musgrave.
Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output.
Chapter 9 Regression Wisdom
Regression Wisdom. Getting the “Bends”  Linear regression only works for linear models. (That sounds obvious, but when you fit a regression, you can’t.
Tutorial 5 Thursday February 14 MBP 1010 Kevin Brown.
Residuals, Influential Points, and Outliers
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
MATH 2311 Section 5.4. Residuals Examples: Interpreting the Plots of Residuals The plot of the residual values against the x values can tell us a lot.
Outliers and Influential points Imagine a scatter plot of the heights and weights of adult men….would it be a positive or negative association? Should.
CHAPTER 3 Describing Relationships
Unit 4 Lesson 4 (5.4) Summarizing Bivariate Data
Chapter 5 Lesson 5.3 Summarizing Bivariate Data
Non-Linear Models Tractable non-linearity Intractable non-linearity
Cautions about Correlation and Regression
Chapter 12: Regression Diagnostics
Model Inverse and Joint Variation
Section 3.3 Regression Wisdom
Simple Linear Regression
Week 5 Lecture 2 Chapter 8. Regression Wisdom.
1. Describe the Form and Direction of the Scatterplot.
Residuals The residuals are estimate of the error
Motivational Examples Three Types of Unusual Observations
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
Regression is the Most Used and Most Abused Technique in Statistics
Influential points.
Residuals, Influential Points, and Outliers
Lesson 2.2 Linear Regression.
Bell Ringer 1. Write the equation of the regression line for the data. Explain the meaning of the y-intercept and the slope of the line. 2. A new burger.
Checking Assumptions Primary Assumptions Secondary Assumptions
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Problems of Tutorial 9 (Problem 4.12, Page 120) Download the “Data for Exercise ” from the class website. The data consist of 1 response variable.
Lesson 2.2 Linear Regression.
Chapter 3.2 Regression Wisdom.
Chapter 9 Regression Wisdom.
Cases. Simple Regression Linear Multiple Regression.
Chapter 9 Regression Wisdom.
Writing Linear Equations
Homework: PG. 204 #30, 31 pg. 212 #35,36 30.) a. Reading scores are predicted to increase by for each one-point increase in IQ. For x=90: 45.98;
Model Inverse and Joint Variation
Presentation transcript:

Outliers… Leverage… Influential points…

(model does not change drastically) Outlier or Influential point? (or neither?) Outlier: - Low leverage - Weakens “r” WITHOUT “outlier” WITH “outlier” (model does not change drastically)

(slope changes drastically!) Outlier or Influential point? (or neither?) Influential Point: - HIGH leverage - Weakens “r” WITHOUT “outlier” WITH “outlier” (slope changes drastically!)

Outlier or Influential point? (or neither?) - HIGH leverage - STRENGTHENS “r” Linear model WITH and WITHOUT “outlier”

Outliers, leverage, and influence If a point’s x-value is far from the mean of the x-values, it is said to have high leverage. (it has the potential to change the regression line significantly) A point is considered influential if omitting it gives a very different model.

fin~