1 Model Evaluation and Selection. 2 Example Objective: Demonstrate how to evaluate a single model and how to compare alternative models.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Chapter 3 Introduction to Quantitative Research
Chapter 3 Introduction to Quantitative Research
The External Review Report as Ground for ENQA Board Decision Tibor Szanto Vice-President, ENQA Barcelona, 17 March 2009.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
Addition Facts
Supporting further and higher education JISC VRE Programme Quality Planning 12/01/05.
1 A gender and helping study with a different outcome.
Assumptions underlying regression analysis
SADC Course in Statistics (Session 09)
Chapter 7 Sampling and Sampling Distributions
Copyright Pearson Prentice Hall
Chapter 4: Basic Estimation Techniques
You will need Your text Your calculator
Elementary Statistics
Chapter 10: The t Test For Two Independent Samples
Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
(This presentation may be used for instructional purposes)
Non-Parametric Statistics
Chi-Square and Analysis of Variance (ANOVA)
1 Comparing SEM to the Univariate Model data from Grace and Keeley (2006) Ecological Applications.
Ch. 1: Number Relationships
Chapter 15 ANOVA.
Chapter 5 Test Review Sections 5-1 through 5-4.
Addition 1’s to 20.
Psychology Practical (Year 2) PS2001 Correlation and other topics.
1 CommunicationEnquiry – Planning– Planning Interdependence of Organism Y7 Cells test Sustainable Earth Investigating Dissolving Enquiry - ReflectingReflecting.
Putting Statistics to Work
Week 1.
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 10 One- and Two-Sample Tests of Hypotheses.
Writing up results from Structural Equation Models
Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University.
Chapter 12 Analyzing Semistructured Decision Support Systems Systems Analysis and Design Kendall and Kendall Fifth Edition.
Chapter 11: The t Test for Two Related Samples
1 Chapter 20: Statistical Tests for Ordinal Data.
Chapter 13 and 14. Error  Type 1: False positive. The effect was not really present, but our statistics lead us to believe that it is.  Type 2: False.
Multiple Regression and Model Building
Chapter 8 Hypothesis Test.
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Effect Size and Meta-Analysis
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
SOC 681 James G. Anderson, PhD
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Example of Simple and Multiple Regression
Reliability, Validity, & Scaling
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
1 Example 1.0: Indirect Effects and the Test of Mediation.
4.2 One Sided Tests -Before we construct a rule for rejecting H 0, we need to pick an ALTERNATE HYPOTHESIS -an example of a ONE SIDED ALTERNATIVE would.
SEM Analysis SPSS/AMOS
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
CHAPTER 11 SECTION 2 Inference for Relationships.
Measures of Fit David A. Kenny January 25, Background Introduction to Measures of Fit.
SEM Model Fit: Introduction David A. Kenny January 12, 2014.
Examples. Path Model 1 Simple mediation model. Much of the influence of Family Background (SES) is indirect.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
Measures of Fit David A. Kenny.
Chapter 13 Simple Linear Regression
Vocabulary Statistical Inference – provides methods for drawing conclusions about a population parameter from sample data Expected Values– row total *
Presentation transcript:

1 Model Evaluation and Selection

2 Example Objective: Demonstrate how to evaluate a single model and how to compare alternative models.

3 Evaluating the Sufficiency of a Single Model (followup to example of Mediation Test) When this model is run, a variety of measures of model fit will be generated. A question of importance is, "Is the fit of the model sufficiently good to yield reliable results?" The alternative model is one in which there is also an arrow from s_age to tcov. In other words, does fire severity explain the effect of stand age on cover, or, is there another pathway of influence independent of fire severity?

4 Finding Measures of Model Fit in Amos I It is always good to check the section of the output called Notes for Model. Here we can see that a minimum was achieved and the full p-value for the chi- square. P-value greater than 0.05 suggests that we could accept this model (it indicates no major deviations between data and model). The model chi-square is the most commonly used measure of absolute model fit.

5 Further Considerations of Model Chi-square It is well known that model p-values are not always the best way to decide if a model is adequate (in an absolute sense) or the best model (in a relative sense). This is a complex topic and one that lacks complete consensus. What is generally agreed upon is: (1) Chi-squares automatically increase with increasing sample size and p-values reflect increasing power for detecting deviations. (2) P-values for model chi-squares are pretty useful when sample sizes are less than 200, especially for models that do not include latent variables possessing multiple indicators. (3) It is recommended that folks look at multiple measures.

6 Further Considerations (cont.) One useful way to evaluate model adequacy is to see if the addition of pathways causes the model chi-square to drop by more than 3.84 units. This is the single-degree- of-freedom chi-square test. If adding a path reduces the chi-square by less than 3.84, it implies that the added path is not strongly supported by the data. In the current example, the chi-square is 3.243, which tells us that adding a path from s_age to tcov could only reduce model chi-square by This further indicates that our model could be considered to be adequate.

7 Finding Measures of Model Fit in Amos II Cmin means minimum chi-square. Model Fit tab gives us several measures to consider.

8 continued clicking on labels gives additional info

9 continued RMSEA indicates close fit. Also that a value of 0 (perfect fit) cannot be ruled out. An AIC for our model (the default model) of could only be reduced to a value of by saturating our model. This is less than the minimum recommended AIC difference of 2.0, suggesting models indistinguishable. BUT, AIC is often not a reliable measure.

10 continued some more The CAIC (consistent AIC) is generally viewed to be a better measure than AIC. Here we see that the default model value is more than 2.0 units smaller than the saturated model, supporting the conclusion that our model is adequate.

11 and still some more The BIC (Bayesian Information Criterion) is one of the more popular measures at the moment. In this case, the saturated model BIC is only greater, which is less than the 2.0 difference recommended for picking among models. This index tells us that while the evidence is better for the default model, the saturated model cant be ruled out.

12 and even still some more The Hoelter index relates back to our model Chi- square and its p-values. It tells us that at a sample size of 106, we would have enough power to detect an additional path from s_age to tcov with a p-value less than samples would be required to obtain a p-value less than 0.01.

13 AIC difference criteria AIC diffsupport for equivalency of models 0-2substantial 4-7weak > 10none Burnham, K.P. and Anderson, D.R Model Selection and Multimodel Inference. Springer Verlag. (second edition), p 70.

14 BIC difference criteria BIC diffsupport for difference between models 0-2weak 2-6positive 6-10strong > 10very strong Raftery, A.E Sociological Methodology. 25: , p 70

15 What do we conclude in this case? Given the data we have available, we could justify (in my view) omitting the pathway from s_age to tcov. However, we must recognize that this is an approximation of the truth. If we had more samples, would they lead us to decide that we needed to include a path from s_age to tcov? Without the additional samples we dont really know. Comparing the path coefficients for the two models would allow us to decide the scientific consequences of our model choice.

16 What is the SEM perspective on model selection? In SEM we use our scientific knowledge to guide our decisions, and this applies especially to model selection. Do we believe it serves our scientific purposes to omit the path from s_age to tcov? We certainly can present the results for the path in the following fashion if we think it merits discussion. s_agefidxtcov e1e ns -0.35

17 Final thought "Statistical tests are aids to (hopefully wise) judgement, not two-valued logical declarations of truth or falsity". Abelson, RP (1995) Statistics as Principled Argument. Lawrence Erlbaum Associates, Hillsdale, NJ, USA