Conditions of application Assumption checking. Assumptions for mixed models and RM ANOVA Linearity  The outcome has a linear relationship with all of.

Slides:



Advertisements
Similar presentations
Quntative Data Analysis SPSS Exploring Assumptions
Advertisements

To go further: intra- versus interindividual variability.
Methods for Dummies Isobel Weinberg & Alexandra Westley.
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
Lab3: writing up results and ANOVAs with within and between factors 1.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
BA 555 Practical Business Analysis
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Lecture 24: Thurs., April 8th
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #20.
Simple Linear Regression Analysis
Regression Diagnostics Checking Assumptions and Data.
Correlation and Regression Analysis
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Multiple Regression Dr. Andy Field.
Slide 1 Testing Multivariate Assumptions The multivariate statistical techniques which we will cover in this class require one or more the following assumptions.
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Checking Regression Model Assumptions NBA 2013/14 Player Heights and Weights.
Conditions of applications. Key concepts Testing conditions of applications in complex study design Residuals Tests of normality Residuals plots – Residuals.
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for Regression BPS chapter 23 © 2010 W.H. Freeman and Company.
Chapter 3: Examining relationships between Data
Inference for Regression
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
11/4/2015Slide 1 SOLVING THE PROBLEM Simple linear regression is an appropriate model of the relationship between two quantitative variables provided the.
© Buddy Freeman, Independence of error assumption. In many business applications using regression, the independent variable is TIME. When the data.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Data Analysis.
Residual Analysis for ANOVA Models KNNL – Chapter 18.
Assumptions 5.4 Data Screening. Assumptions Parametric tests based on the normal distribution assume: – Independence – Additivity and linearity – Normality.
1/11/2016Slide 1 Extending the relationships found in linear regression to a population is procedurally similar to what we have done for t-tests and chi-square.
ANOVA, Regression and Multiple Regression March
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
Residual Plots Unit #8 - Statistics.
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
Slide 1 Regression Assumptions and Diagnostic Statistics The purpose of this document is to demonstrate the impact of violations of regression assumptions.
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Regression Chapter 5 January 24 – Part II.
Multiple Linear Regression Introduction The common univariate techniques for data analyses do not allow us to take into account the effect of other covariates/confounders.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Statistical Data Analysis - Lecture /04/03
Correlation, Bivariate Regression, and Multiple Regression
Statistics for the Social Sciences
What is normal anyway?! Disclaimer: I am not an expert!
Assumption of normality
Chapter 12: Regression Diagnostics
Applied Statistical Analysis
(Residuals and
CHAPTER 29: Multiple Regression*
EQ: How well does the line fit the data?
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
When You See (This), You Think (That)
CHAPTER 12 More About Regression
Inferential Statistics
Exercise 1: Gestational age and birthweight
Presentation transcript:

Conditions of application Assumption checking

Assumptions for mixed models and RM ANOVA Linearity  The outcome has a linear relationship with all of the predictors Homoscedasticity  The residuals are equally variable at any level of the predictors Normality of the residuals

Fitting the model Fit the same model from last week:  SIGNAL = (b 0 + u 0 ) + b 1 ACCELERATION + b 2 COIL + b 3 COILxACCELERATION + ε  Be sure to use the long dataset, and ALL values of RESOLUTION  ACCELERATION is a covariate, not a factor  SAVE the residuals and the predicted values

Residual plots Create a histogram of the residuals (Analyze → Descriptive Statistics → Frequencies → Chart), and a scatterplot of the residuals v.s. the predicted values (Graphs → Chart builder). What are we testing for?  Linearity (no pattern)  Homoscedasticity (constant variance)  Normality of residuals (bell-shaped histogram)

Analyze the residual plots Do our plots look okay?  Scatterplot Looks decent  Histogram Looks plausibly normal, given the sample size Weird bi-modality

Normality tests Go to Analyze → Descriptive Statistics → Explore

Tests of normality Check the Shapiro-Wilk and the Kolmogorov- Smirnov Neither value is statistically significant What does that mean?  We have no evidence of non-normality  We pass!  Be careful, though: these tests are poor at finding bimodality

Analyze the Q-Q plot What are we testing?  Normality of the residuals How does it look?  Decent, except for the extreme tails  Probably okay

RM ANOVA Back to the wide file! Fit an RM ANOVA. Ignore RESOLUTION SAVE Cook's Distance

Checking Cook's D Transform the dataset to long form Only keep ID and the Cook's D variables Plot Cook's D v.s. ID

How does it look? Not too bad None of the points are wildly farther than the others It looks none of the points were wildly influential  Subject 23 had a big impact, though

Now... Check the conditions of application for the same models, only now only for subjects with RESOLUTION = 2

Tests of normality Uh-oh We fail our tests! And they don't have a lot of power with small samples, so this might be really bad

Q-Q plots Not looking good. There's something bad happening in the tails

Residual histogram That spike in the middle is problematic

Residuals v.s. predicted At least this looks okay

Residuals v.s. acceleration This explains it: it's the weird interaction between acceleration and coil that we noticed before

Questions? About the homework?