Checking the data and assumptions before the final analysis.

Slides:



Advertisements
Similar presentations
Simple linear models Straight line is simplest case, but key is that parameters appear linearly in the model Needs estimates of the model parameters (slope.
Advertisements

Assumptions underlying regression analysis
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Multiple regression refresher Austin Troy NR 245 Based primarily on material accessed from Garson, G. David Multiple Regression. Statnotes: Topics.
Topic 9: Remedies.
FACTORIAL ANOVA Overview of Factorial ANOVA Factorial Designs Types of Effects Assumptions Analyzing the Variance Regression Equation Fixed and Random.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
FACTORIAL ANOVA.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
ANOVA notes NR 245 Austin Troy
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Psych 524 Andrew Ainsworth Data Screening 1. Data check entry One of the first steps to proper data screening is to ensure the data is correct Check out.
Lecture 25 Regression diagnostics for the multiple linear regression model Dealing with influential observations for multiple linear regression Interaction.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Every achievement originates from the seed of determination. 1Random Effect.
Lecture 24 Multiple Regression (Sections )
STAT 104 Section 6 Daniel Moon. Agenda Review Midterm 1 Practice Problems.
Lecture 24: Thurs., April 8th
Regression Diagnostics Checking Assumptions and Data.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Inferential Statistics: SPSS
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Statistics for Business and Economics Dr. TANG Yu Department of Mathematics Soochow University May 28, 2007.
Basics of Regression Analysis. Determination of three performance measures Estimation of the effect of each factor Explanation of the variability Forecasting.
Analysis of Residuals Data = Fit + Residual. Residual means left over Vertical distance of Y i from the regression hyper-plane An error of “prediction”
Inferential Statistics
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
ANOVA ANOVA is used when more than two groups are compared In order to conduct an ANOVA, several assumptions must be made – The population from which the.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Week 5Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing Residuals.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Data Analysis.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Multivariate Data Analysis Chapter 2 – Examining Your Data
I271B QUANTITATIVE METHODS Regression and Diagnostics.
B AD 6243: Applied Univariate Statistics Multiple Regression Professor Laku Chidambaram Price College of Business University of Oklahoma.
KNN Ch. 3 Diagnostics and Remedial Measures Applied Regression Analysis BUSI 6220.
t Test Assumptions: Paired/dependent/matched Samples:
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Variance Stabilizing Transformations. Variance is Related to Mean Usual Assumption in ANOVA and Regression is that the variance of each observation is.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Analyze Of VAriance. Application fields ◦ Comparing means for more than two independent samples = examining relationship between categorical->metric variables.
Statistical Data Analysis - Lecture /04/03
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
B&A ; and REGRESSION - ANCOVA B&A ; and
CH 5: Multivariate Methods
Chapter 12: Regression Diagnostics
Y - Tests Type Based on Response and Measure Variable Data
CHAPTER 29: Multiple Regression*
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
Simple Regression Mary M. Whiteside, PhD.
Single-Factor Studies
Single-Factor Studies
Unit 3 – Linear regression
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
Adequacy of Linear Regression Models
MANOVA Control of experimentwise error rate (problem of multiple tests). Detection of multivariate vs. univariate differences among groups (multivariate.
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Diagnostics and Remedial Measures
REVIEW Course Review.
Presentation transcript:

Checking the data and assumptions before the final analysis. Cleaning up your act Checking the data and assumptions before the final analysis. 4/5/2019 AGR206

Topics Detection of problems: Fixing problems: Population represented by sample. Missing data. Normality of errors. Linearity and Lack of Fit. Homogeneity of variance. Outliers. Univariate Multivariate Multicollinearity. Fixing problems: Transformations 4/5/2019 AGR206

Population and sample. Scope of regression. 4/5/2019 AGR206

Normality. Usually, it is assumed that errors are normally distributed. In JMP, obtain errors (residuals) and then use the Analyze Distributions platform. Example: xmpl_Pyield.jmp 4/5/2019 AGR206

Linearity/Lack of Fit Lack of Fit compares the variance within replicated X values to the variance around the model. 4/5/2019 AGR206

Lack of Fit ANOVA Table Example: xmpl_Pyield.jmp 4/5/2019 AGR206

Homogeneity of Variance Variance of errors are assumed equal. Plot errors vs. Yhat and X’s. Use UnEqual variance test in the Fit Y by X platform. Example: xmpl_Pyield 4/5/2019 AGR206

Outliers Univariate: Multivariate: Check the studentized residual. Compare to t(n-p, 0.001). Or correct by Bonferroni. Check X dimension: hii<2(m+1)/n hii is the leverage m is the number of X variables n is the number of observations Multivariate: Mahalanobis squared distance ~ 2(m) if variables are normal. 4/5/2019 AGR206

Outliers Example: xmpl_UVoutl.jmp Example: xmpl_MVoutl.jmp 4/5/2019 AGR206