Xitao Fan, Ph.D. Chair Professor & Dean Faculty of Education University of Macau Designing Monte Carlo Simulation Studies.

Slides:



Advertisements
Similar presentations
StatisticalDesign&ModelsValidation. Introduction.
Advertisements

Managerial Economics Estimation of Demand
CHAPTER 2 Building Empirical Model. Basic Statistical Concepts Consider this situation: The tension bond strength of portland cement mortar is an important.
Materials for Lecture 11 Chapters 3 and 6 Chapter 16 Section 4.0 and 5.0 Lecture 11 Pseudo Random LHC.xls Lecture 11 Validation Tests.xls Next 4 slides.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Computing Simulations in SAS Jordan Elm 7/26/2007 Reference: SAS for Monte Carlo Studies: A Guide for Quantitative Researchers by Xitao Fan, Akos Felsovalyi,
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
Structural Equation Modeling
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Factor Analysis Ulf H. Olsson Professor of Statistics.
BA 555 Practical Business Analysis
Statistical Methods Chichang Jou Tamkang University.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Biol 500: basic statistics
Topic 3: Regression.
Today Concepts underlying inferential statistics
Business Statistics - QBM117 Statistical inference for regression.
Validation of predictive regression models Ewout W. Steyerberg, PhD Clinical epidemiologist Frank E. Harrell, PhD Biostatistician.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Chapter 14 Inferential Data Analysis
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Testing Hypotheses.
Regression and Correlation Methods Judy Zhong Ph.D.
Chapter 2: The Research Enterprise in Psychology
AM Recitation 2/10/11.
Inference for regression - Simple linear regression
Chapter 2: The Research Enterprise in Psychology
Determining Sample Size
Regression Method.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Multivariate Statistical Data Analysis with Its Applications
Chapter 12 Multiple Regression and Model Building.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
General Principle of Monte Carlo Fall 2013 By Yaohang Li, Ph.D.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Academic Research Academic Research Dr Kishor Bhanushali M
Multivariate Data Analysis Chapter 1 - Introduction.
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
© (2015, 2012, 2008) by Pearson Education, Inc. All Rights Reserved Chapter 11: Correlational Designs Educational Research: Planning, Conducting, and Evaluating.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Overview G. Jogesh Babu. Overview of Astrostatistics A brief description of modern astronomy & astrophysics. Many statistical concepts have their roots.
Tutorial I: Missing Value Analysis
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Evaluation of structural equation models Hans Baumgartner Penn State University.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Quantitative Methods Residual Analysis Multiple Linear Regression C.W. Jackson/B. K. Gordor.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Estimating standard error using bootstrap
Linear Regression.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
12 Inferential Analysis.
BA 275 Quantitative Business Methods
EPSY 5245 EPSY 5245 Michael C. Rodriguez
12 Inferential Analysis.
Testing Causal Hypotheses
Presentation transcript:

Xitao Fan, Ph.D. Chair Professor & Dean Faculty of Education University of Macau Designing Monte Carlo Simulation Studies

Getting Involved in Monte Carlo Simulation Fan, X., Felsovalyi, A., Sivo, S. A., & Keenan, S. (2002) SAS for Monte Carlo studies: A guide for quantitative researchers. Cary, NC: SAS Institute, Inc. Fan, X. (2012). Designing simulation studies. In H. Cooper (Ed.), Handbook of Research Methods in Psychology, Vol. 2 (pp ). Washington, DC: American Psychological Association.

Getting Involved in Monte Carlo Simulation Peugh, J., & Fan, X. (In press). Enumeration index performance in generalized growth mixture models: a Monte Carlo test of Muthén’s (2003) hypothesis. Structural Equation Modeling. Peugh, J., & Fan, X. (In press). Modeling unobserved heterogeneity using latent profile analysis: A Monte Carlo simulation. Structural Equation Modeling. Peugh, J., & Fan, X. (2012). How well does growth mixture modeling identify heterogeneous growth trajectories? A simulation study examining GMM’s performance characteristics. Structural Equation Modeling, (19), Fan, X., & Sivo, S. A. (2009). Using  goodness-of-fit indices in assessing mean structure invariance. Structural Equation Modeling, 16, Fan, X. & Sivo, S. (2007). Sensitivity of fit indices to model misspecification and model types. Multivariate Behavioral Research, 42, Sivo, S. A., Fan, X., Witta, E. L., & Willse, J. T. (2006). The search for "optimal" cutoff properties: Fit index criteria in structural equation modeling. Journal of Experimental Education, 74,

Getting Involved in Monte Carlo Simulation Fan, Xitao, & Fan, Xiaotao. (2005). Power of latent growth modeling for detecting linear growth: Number of measurements and comparison with other analytic approaches. Journal of Experimental Education, 73, Fan, X., & Sivo, S. A. (2005). Sensitivity of fit indices to misspecified structural or measurement model components: Rationale of two-index strategy revisited. Structural Equation Modeling, 12, Fan, Xitao, & Fan, Xiaotao. (2005). Using SAS for Monte Carlo simulation research in structural equation modeling. Structural Equation Modeling, 12, Sivo, S., Fan, X., & Witta, L. (2005). The biasing effects of unmodeled ARMA time series processes on latent growth curve model estimates. Structural Equation Modeling, 12, Fan, X. (2003). Two Approaches for Correcting Correlation Attenuation Caused by Measurement Error: Implications for Research Practice. Educational and Psychological Measurement, 63, 6, Fan, X. (2003). Power of latent growth modeling for detecting group differences in linear growth trajectory parameters. Structural Equation Modeling, 10,

Getting Involved in Monte Carlo Simulation Yin, P., & Fan, X. (2001). Estimating R 2 shrinkage in multiple regression: A comparison of different analytical methods. Journal of Experimental Education, 69, Fan, X., & Wang, L. (1999). Comparing logistic regression with linear discriminant analysis in their classification accuracy. Journal of Experimental Education, 67, Fan, X., Thompson, B, & Wang, L. (1999). The effects of sample size, estimation methods, and model specification on SEM fit indices. Structural Equation Modeling: A Multidisciplinary Journal, 6, Fan, X., & Wang, L. (1998). Effects of potential confounding factors on fit indices and parameter estimates for true and misspecified SEM models. Educational and Psychological Measurement, 58, Fan, X. & Wang, L. (1996). Comparability of jackknife and bootstrap results: An investigation for a case of canonical analysis. Journal of Experimental Education, 64,

What Is a Monte Carlo Simulation Study?  “the use of random sampling techniques and often the use of computer simulation to obtain approximate solutions to mathematical or physical problems especially in terms of a range of values each of which has a calculated probability of being the solution” (Merriam-Webster On- Line).  An empirical alternative to a theoretical approach (i.e., a solution based on statistical/mathematical theory)  Increasingly possible because of the advances in computing technology

Situations Where Simulation Is Useful  Consequences of Assumption Violations Statistical Theory: stipulates what the condition should be, but does not say what the reality would be if the conditions were not satisfied in the data  Understanding a Sample Statistic That May Not Have Theoretical Distribution ● Many Other Situations  Retaining the optimal number of factors in EFA  Evaluating the performance of mixture modeling in identifying the latent groups  Assessing the consequences of failure to model correlated error structure in latent growth modeling

Basic Steps in a Simulation Study  Asking Questions Suitable for a Simulation Study  Questions for which no (no trustworthy) analytical/theoretical solutions  Simulation Study Design (Example)  Include / manipulate the major factors that potentially affect the outcome  Data Generation  Sample data generation & transformation  Analysis (Model Fitting) for Sample Data  Accumulation and Analysis of the Statistic(s) of Interest  Presentation and Drawing Conclusions  Conclusions limited to the design conditions

An Example: Independent t-test (group variance homogeneity)

Data Generation in a Simulation Study  Common Random Number Generators *binomial, Cauchy, exponential, gamma, Poisson, normal, uniform, etc. *All distributions are based on uniform distribution  Simulating Univariate Sample Data *Normally-Distributed Sample Data (N ~ ,  2 ) *Non-Normal Distribution: Fleishman (1978): a, b, c, d: coefficients needed for transforming the unit normal variate to a non- normal variable with specified degrees of population skewness and kurtosis. Fleishman, A. I. (1978). A method for simulating non-normal distributions. Psychometrika, 43,

Data Generation in a Simulation Study  Sample Data from a Multivariate Normal Distribution *matrix decomposition procedure (Kaiser & Dickman, 1962): F:k  k matrix containing principal component factor pattern coefficients obtained by applying principal component factorization to the given population inter-correlation matrix R;  Sample Data from a Multivariate Non-Normal Distribution *Interaction between non-normality and inter-variable correlations *Intermediate correlations using Fleishman coefficients (Vale & Maurelli, 1983) *Matrix decomposition procedure applied to intermediate correlation matrix Kaiser, H. F., & Dickman, K. (1962). Sample and population score matrices and sample correlation matrices from an arbitrary population correlation matrix. Psychometrika, 27, Vale, C. D., & Maurelli, V. A. (1983). Simulating multivariate nonnormal distributions. Psychometrika, 48,

Checking the Validity of Data Generation Procedures  Example: Multivariate non-normal sample data (three correlated variables)

From Simulation Design to Population Data Parameters  It may take much effort to obtain population parameters – t-test example

From Simulation Design to Population Data Parameters  Latent growth model example

From Simulation Design to Population Data Parameters  Latent growth model example

Accumulation and Analysis of the Statistic(s) of Interest  Accumulation: Straightforward or Complicated *Typically, not an automated process * Statistical software used * Analytical techniques involved * Type of statistic(s) of interest, etc.  Analysis *Follow-up data analysis may be simple or complicated *Not different from many other data analysis situations

Presentation and Drawing Conclusions  Presentation *Representativeness & Exceptions * Graphic Presentations * Typical: table after table of results – No one has the time to read the tables!  Drawing Conclusions *Validity & generalizability depend on the adequacy & appropriateness of simulation design *Conclusions must be limited by the design conditions and levels.