Lecture 14 Review of Lecture 13 What we’ll talk about today?

Slides:



Advertisements
Similar presentations
Assumptions underlying regression analysis
Advertisements

Correlation and regression
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
Chapter Topics Types of Regression Models
Regression Diagnostics - I
BCOR 1020 Business Statistics Lecture 26 – April 24, 2007.
Regression Diagnostics Checking Assumptions and Data.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Business Statistics - QBM117 Statistical inference for regression.
Chapter 7 Forecasting with Simple Regression
Advantages of Multivariate Analysis Close resemblance to how the researcher thinks. Close resemblance to how the researcher thinks. Easy visualisation.
Descriptive Methods in Regression and Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 12: Correlation and Linear Regression 1.
Quantitative Methods Residual Analysis Multiple Linear Regression C.W. Jackson/B. K. Gordor.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Howard Community College
Lecture Slides Elementary Statistics Twelfth Edition
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Statistical Data Analysis - Lecture /04/03
Correlation, Bivariate Regression, and Multiple Regression
AP Statistics Chapter 14 Section 1.
Statistics for Managers using Microsoft Excel 3rd Edition
LECTURE 13 Thursday, 8th October
Chapter 12: Regression Diagnostics
(Residuals and
Slides by JOHN LOUCKS St. Edward’s University.
Lecture Slides Elementary Statistics Thirteenth Edition
Diagnostics and Transformation for SLR
Week 5 Lecture 2 Chapter 8. Regression Wisdom.
1. An example for using graphics
CHAPTER 29: Multiple Regression*
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
6-1 Introduction To Empirical Models
Residuals The residuals are estimate of the error
Motivational Examples Three Types of Unusual Observations
LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis
Solution 8 12/4/2018 F P1 P2 RESI1 SRES1 TRES1 HI1 FITS1
The greatest blessing in life is
Tutorial 8 Table 3.10 on Page 76 shows the scores in the final examination F and the scores in two preliminary examinations P1 and P2 for 22 students in.
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
Regression is the Most Used and Most Abused Technique in Statistics
Chapter 4, Regression Diagnostics Detection of Model Violation
Solution 7 1. a). The scatter plot or the residuals vs fits plot
Three Measures of Influence
Ch11 Curve Fitting II.
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
The Examination of Residuals
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
Checking Assumptions Primary Assumptions Secondary Assumptions
Chapter 13 Multiple Regression
Algebra Review The equation of a straight line y = mx + b
Diagnostics and Transformation for SLR
Presentation transcript:

Lecture 14 Review of Lecture 13 What we’ll talk about today? Standard Regression Assumptions: a). about the form of the model b). about the measurement errors c). about the predictor variables d). about the observations II Examples of the Anscombe’s Quartet Data show that a). Gross Violations of assumptions will lead to serious problems b). Summary statistics may miss or overlook the features of the data. III Types of Residuals a). Ordinary b). Standardized c). Studentized What we’ll talk about today? I Graphical Methods for Exploring Data Structures a) Graphs before fitting b) Graphs after fitting 11/17/2018 ST3131, Lecture 14

Graphical Methods Graphical methods play an important role in data analysis, especially in linear regression analysis. It can reveal some important features that summary statistics may miss, e.g., the scatter plots of the Anscombe’s data. a). Graphs before fitting a model Functions: 1) Detect outliers 2) Suggest a model b). Graphs after fitting a model Functions: 1) Checking assumption violations 2). Detecting outliers 11/17/2018 ST3131, Lecture 14

Graphs before fitting a model Functions: 1) Detect outliers, high leverage point or influential points 2) Recognize the patterns 3) Explore the relationship between variables Types : 1). One-dimensional 2). Two-dimensional 3). Rotating plot 4). Dynamic graphs 11/17/2018 ST3131, Lecture 14

One-dimensional graphs Histogram Stem-and-leaf display Dot plot Box plot Functions: (1) Distribution of a single variable (2) Detect outliers, high leverage points, or influential points Two-dimensional graphs Matrix plot: pair-wise scatter plot Purpose: explore patterns of pair-wise variables 11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14 Stem-and-leaf of Y N = 15 Leaf Unit = 0.10 2 10 88 3 11 0 4 11 2 6 11 45 6 11 7 11 9 (1) 12 0 7 12 3 6 12 4 5 12 66 3 12 8 2 13 01 11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14

Drawback when p>1, the scatter plots of Y vs Xj may or may not show linear patterns even when Y and X1, X2, …,Xp have a good or perfect linear relationship. Hamiltan’s Data 11/17/2018 ST3131, Lecture 14

Y vs X1: Y=11.989+.004X1, t-test=.09, R_sq=0.0 Hamiltan’s Data : Y, X1, X2 Fitted Results: Y vs X1: Y=11.989+.004X1, t-test=.09, R_sq=0.0 Question: Y is uncorrelated with X1? Y vs X2: Y=10.632+.195 X2, t-test=1.74, R_sq=.188 Question: Y is uncorrelated with X2? Y vs X1, X2: Y=-4.515+3.097X1+1.032X2, F-test=39222, R_sq=1.0 Question: Y is almost perfectly linearly correlated with X1 and X2? Question: What assumption is violated by the Hamiltan’s Data? 11/17/2018 ST3131, Lecture 14

Rotating Plots: 3-dimensional plot Rotate the points in different directions s o that three-dimensional structure becomes apparent. Dynamic Graphs: p>3 Graphs are in a dynamic status instead of a static status. Good for exploring the structural and relationship in more than 3-dimensions. 11/17/2018 ST3131, Lecture 14

b) Graphs after fitting a model Functions: 1) Checking assumptions, 2) Detection of outliers, high leverage points, influential points 3). Diagnostic plots for the effect of variables Standardized Residuals-based Plots Normal Probability Plot of standardized residuals: ordered standardized residuals vs normal scores Function: Main Idea : If the residuals are normally distributed, the ordered standardized residuals should be approximately the same as the ordered normal scores. In this case, the plot should resemble a (nearly) straight-line with intercept and slope . 11/17/2018 ST3131, Lecture 14

Function: Check linearity or homogeneity assumptions on Xj 2. Scatter Plots of standardized residuals against each of the predictor variables Function: Check linearity or homogeneity assumptions on Xj Main Idea: Under the standard assumptions, the standardized residuals are nearly uncorrelated with each of the predictor variables. In this case, the residual points should be randomly scattered in the range. For example, 11/17/2018 ST3131, Lecture 14

3. Scatter Plots of standardized residuals against the fitted values. Function: Check Independence, homogeneity of the measurement errors Linearity of the data Main Idea: Under the Independence, homogeneity of the measurement errors Linearity of the data assumptions, the standardized residuals are nearly uncorrelated with the fitted values. In this case, the residual points should be randomly scattered in the range, e.g., 11/17/2018 ST3131, Lecture 14

Function: Check Independence, homogeneity of the measurement errors 4. Index Plot of standardized residuals i. e. the scatter plot of standardized residuals against the indices of observations. Function: Check Independence, homogeneity of the measurement errors Linearity of the data Main Idea: Under the assumption of independence errors, the standardized residuals should be randomly scattered within a horizontal band around 0. 11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14

11/17/2018 ST3131, Lecture 14

After-class Questions: Is graphics before fitting a model model-based? Is graphics after fitting a model model-based? Why is graphics sometimes more useful than a statistic? 11/17/2018 ST3131, Lecture 14