Limitations of ANOVA ©2005 Dr. B. C. Paul. The Data Size Effect We Did ANOVA with one factor We Did ANOVA with one factor We Did it with two factors (Driver.

Slides:



Advertisements
Similar presentations
Week 2 – PART III POST-HOC TESTS. POST HOC TESTS When we get a significant F test result in an ANOVA test for a main effect of a factor with more than.
Advertisements

Hypothesis testing 5th - 9th December 2011, Rome.
AMMBR - final stuff xtmixed (and xtreg) (checking for normality, random slopes)
Measures of Center: Mean, Median, Mode Please view this tutorial and answer the follow-up questions on loose leaf to turn in to your teacher.
1 Reliability in Scales Reliability is a question of consistency do we get the same numbers on repeated measurements? Low reliability: reaction time High.
Introduction to Regression ©2005 Dr. B. C. Paul. Things Favoring ANOVA Analysis ANOVA tells you whether a factor is controlling a result It requires that.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
Xtreg and xtmixed: recap We have the standard regression model (here with only one x): but think that the data are clustered, and that the intercept (c.
Solving Equations with the Variable on Both Sides Objectives: to solve equations with the variable on both sides.
Getting to Know Your Scatterplot and Residuals
Statistics for the Social Sciences
January 7, afternoon session 1 Multi-factor ANOVA and Multiple Regression January 5-9, 2008 Beth Ayers.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
The t-test:. Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Matching level of measurement to statistical procedures
Lecture 9: One Way ANOVA Between Subjects
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of scientific research When you know the system: Estimation.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 5): Outliers Fall, 2008.
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Correlations and T-tests
1 4. Multiple Regression I ECON 251 Research Methods.
Chapter 11: Inference for Distributions
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.
Assumption and Data Transformation. Assumption of Anova The error terms are randomly, independently, and normally distributed The error terms are randomly,
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Two-Way Balanced Independent Samples ANOVA Overview of Computations.
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Objectives of Multiple Regression
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Selecting the Correct Statistical Test
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Unequal Variance and ANOVA ©2005 Dr. B. C. Paul. ANOVA Assumptions ANOVA assumes the populations sampled in each class are normally distributed Also assumes.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
ANOVA Greg C Elvers.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
Analysis of Covariance David Markham
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Proportions for the Binomial Distribution ©2005 Dr. B. C. Paul.
One Way ANOVA ©2005 Dr. B. C. Paul modified 2009 Note – The concepts presented in these slides are considered common knowledge to those familiar with statistics.
Spring /6.831 User Interface Design and Implementation1 Lecture 15: Experiment Analysis.
Two Way ANOVA ©2005 Dr. B. C. Paul. ANOVA Application ANOVA allows us to review data and determine whether a particular effect is changing our results.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Economics 173 Business Statistics Lecture 20 Fall, 2001© Professor J. Petry
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Analysis of Residuals ©2005 Dr. B. C. Paul. Examining Residuals of Regression (From our Previous Example) Set up your linear regression in the Usual manner.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
GETTING COMFORTABLE WITH YOUR DATA II One way to turn your data into knowledge, and another way that’s probably better Winter Storm 2010 Stats workshop.
Multiple Regression ©2005 Dr. B. C. Paul. Problems with Regression So Far We have only been able to consider one factor as controlling at a time Everything.
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Welcome to MM570 Psychological Statistics
Stats Lunch: Day 8 Repeated-Measures ANOVA and Analyzing Trends (It’s Hot)
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
ANOVA, Regression and Multiple Regression March
Dr. C. Ertuna1 Hypothesis Testing 2 Samples (Chapter - 04/D)
Problems with Variance ©2005 Dr. B. C. Paul. Determining What To Do We have looked at techniques that depend on normally distributed data with variance.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Week 2 – PART III POST-HOC TESTS.
Statistical Data Analysis - Lecture10 26/03/03
What we’ll cover today Transformations Inferential statistics
Presentation transcript:

Limitations of ANOVA ©2005 Dr. B. C. Paul

The Data Size Effect We Did ANOVA with one factor We Did ANOVA with one factor We Did it with two factors (Driver and Car) We Did it with two factors (Driver and Car) We could have used the same procedure to do three way (maybe add country or city driving) We could have used the same procedure to do three way (maybe add country or city driving) Or Four Way (maybe add season of the year) Or Four Way (maybe add season of the year) What happens to our test data requirements? What happens to our test data requirements?

Data Set Size Illustration Lets say we tested 4 cars with pairs of before and after MPG Lets say we tested 4 cars with pairs of before and after MPG Minimum we would need 2 drivers on each car Minimum we would need 2 drivers on each car Ie need 8 paired driving tests Ie need 8 paired driving tests Why 8 tests for four cars? Why 8 tests for four cars? Need a way to measure variability not accounted for by the test Need a way to measure variability not accounted for by the test If all differences are accounted for you have no SS error If all differences are accounted for you have no SS error With no denominator you cannot do the F test to decide if your effect is significant With no denominator you cannot do the F test to decide if your effect is significant

The Exponential Explosion Now lets look for a Driver Effect Now lets look for a Driver Effect Lets try 4 drivers Lets try 4 drivers Each driver drives each car twice Each driver drives each car twice 8 pairs of data is now 32 pairs 8 pairs of data is now 32 pairs Now lets look for a Country, Urban Effect Now lets look for a Country, Urban Effect We’ll run the tests in downtown of a large city, in a suburb, and in the country We’ll run the tests in downtown of a large city, in a suburb, and in the country Now we need 96 pairs of data Now we need 96 pairs of data Now lets see if it depends on season Now lets see if it depends on season We’ll do Winter, Spring, Summer, Fall We’ll do Winter, Spring, Summer, Fall Now we need 384 pairs of data Now we need 384 pairs of data

Problem of ANOVA As you get more and more effects to study the amount of data needed grows exponentially As you get more and more effects to study the amount of data needed grows exponentially There are practical limits to how much you can do at once There are practical limits to how much you can do at once There are specialized techniques that can be done There are specialized techniques that can be done We had every driver test every car under every condition. We had every driver test every car under every condition. Eng 540 Design of Experiments does a lot for elegant alternatives Eng 540 Design of Experiments does a lot for elegant alternatives

Relief from SPSS at a Price Our experimental design called for equal numbers of tests under all conditions Our experimental design called for equal numbers of tests under all conditions Actually the procedure I showed with SPSS will run the test without equal numbers of tests under every condition. Actually the procedure I showed with SPSS will run the test without equal numbers of tests under every condition. The Price The Price If I do not check every driver in every car I will loose my ability to measure interaction effects (it will go into the SS error) If I do not check every driver in every car I will loose my ability to measure interaction effects (it will go into the SS error) If I have equal numbers of cases in every cell the test tends to be forgiving (“Robust”) against violations of the normal distribution assumption If I have equal numbers of cases in every cell the test tends to be forgiving (“Robust”) against violations of the normal distribution assumption If my cells are uneven my model will start spitting out more poorly fit answers if I violate the normal distribution assumption. If my cells are uneven my model will start spitting out more poorly fit answers if I violate the normal distribution assumption.

The Who Done It Mystery ANOVA will easily tell you whether an effect exists ANOVA will easily tell you whether an effect exists When it says the driver makes a difference When it says the driver makes a difference Did you have 2 wacko drivers and the other 8 are all the same? Did you have 2 wacko drivers and the other 8 are all the same? It tells you whether an effect exists, but it might still come from only part of the data set It tells you whether an effect exists, but it might still come from only part of the data set

Coping With Who Done It You can run all sorts of plots to see if its just a few results that are different. You can run all sorts of plots to see if its just a few results that are different. You can run statistical tests to test different subsets of the data against each other You can run statistical tests to test different subsets of the data against each other The little options button on the SPSS field where you said Ok leads to a menu of optional tests and plots The little options button on the SPSS field where you said Ok leads to a menu of optional tests and plots Not going to deal with them right now other than telling you they are there. Not going to deal with them right now other than telling you they are there.

Now I Know – What Does It Mean? We found that the MPG improvement from the Red Rooster Carburetor varied with the driver We found that the MPG improvement from the Red Rooster Carburetor varied with the driver We put an ‘individual results may vary” disclaimer on our advertizing We put an ‘individual results may vary” disclaimer on our advertizing Ok, but how much do individual results vary? Ok, but how much do individual results vary? ANOVA doesn’t tell us ANOVA doesn’t tell us For some types of engineering works we have to know how big a difference something will make (Yes-No doesn’t always cut it) For some types of engineering works we have to know how big a difference something will make (Yes-No doesn’t always cut it)

Dealing with large numbers of possible causes May have a large, but more randomly organized set of data and conditions that might have influenced it. May have a large, but more randomly organized set of data and conditions that might have influenced it. Trying to do ANOVA for 15 affect variables would be unwieldy Trying to do ANOVA for 15 affect variables would be unwieldy Solution to the I need to quantify the effect and for maxing out the computer memory (and corporate budget) from doing a 15 way ANOVA is a method called “Regression” Solution to the I need to quantify the effect and for maxing out the computer memory (and corporate budget) from doing a 15 way ANOVA is a method called “Regression” Our next exciting topic!!! Our next exciting topic!!!