Log-Linear Models & Dependent Samples Feng Ye, Xiao Guo, Jing Wang.

Slides:



Advertisements
Similar presentations
Structural Equation Modeling
Advertisements

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Chi Square Tests Chapter 17.
The Analysis of Categorical Data. Categorical variables When both predictor and response variables are categorical: Presence or absence Color, etc. The.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Tests for Homogeneity.
Generalized Linear Mixed Model English Premier League Soccer – 2003/2004 Season.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
1 Experimental design and analyses of experimental data Lesson 6 Logistic regression Generalized Linear Models (GENMOD)
Different chi-squares Ulf H. Olsson Professor of Statistics.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Instructor: K.C. Carriere
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Linear statistical models 2008 Model diagnostics  Residual analysis  Outliers  Dependence  Heteroscedasticity  Violations of distributional assumptions.
Log-linear and logistic models
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
An Introduction to Logistic Regression
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
WLS for Categorical Data
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
The General (LISREL) SEM model Ulf H. Olsson Professor of statistics.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
Logistic Regression with “Grouped” Data Lobster Survival by Size in a Tethering Experiment Source: E.B. Wilkinson, J.H. Grabowski, G.D. Sherwood, P.O.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Logistic Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
AS 737 Categorical Data Analysis For Multivariate
Categorical Data Prof. Andy Field.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Chapter 16 The Chi-Square Statistic
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
FPP 28 Chi-square test. More types of inference for nominal variables Nominal data is categorical with more than two categories Compare observed frequencies.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
CHI SQUARE TESTS.
Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Hong Tran, April 21, 2015.
Copyright © 2010 Pearson Education, Inc. Slide
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
1 STA 617 – Chp11 Models for repeated data Analyzing Repeated Categorical Response Data  Repeated categorical responses may come from  repeated measurements.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Discrepancy between Data and Fit. Introduction What is Deviance? Deviance for Binary Responses and Proportions Deviance as measure of the goodness of.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Logistic Regression Analysis Gerrit Rooks
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
STATISTICAL TESTS USING SPSS Dimitrios Tselios/ Example tests “Discovering statistics using SPSS”, Andy Field.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc Tests for Homogeneity and Independence in a Two-Way Table Data resulting from observations.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
BINARY LOGISTIC REGRESSION
Logistic Regression APKC – STATS AFAC (2016).
Notes on Logistic Regression
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Data Analysis for Two-Way Tables
ביצוע רגרסיה לוגיסטית. פרק ה-2
15.1 The Role of Statistics in the Research Process
Categorical Data Analysis
Chapter 18: The Chi-Square Statistic
Modeling Ordinal Associations Bin Hu
Presentation transcript:

Log-Linear Models & Dependent Samples Feng Ye, Xiao Guo, Jing Wang

Outline  Symmetry, Quasi-independence & Quasi- symmetry.  Marginal Homogeneity & Quasi-symmetry.  Ordinal Quasi-symmetry Model.  Conclusion.

Quasi-independence Model  Model structure  Assumption: Independent model holds for off diagonal cells.  Model fit df = (I-1)^2-I

Symmetry  Model structure  Assumption: The off diagonal cells have equal expected counts.  Model fit: df = I^2-[I+I(I-1)/2]

Symmetry 123Total Total172023

Quasi-symmetry  Model structure  Model fit df = (I-2)(I-1)/2 Symmetry: Independence:

Application  Each week Variety magazine summarizes reviews of new movies by critics in several cities. Each review is categorized as pro, con, or mixed, according to whether the overall evaluation is positive, negative, or a mixture of the two. April 1995 through September 1996 for Chicago film critics Gene Siskel and Roger Ebert.

Application  Reviews of new movies by critics. Ebert SiskelConMixedPro Con24813 Mixed81311 Pro10964

Output & Interpretation Model df G2 p-value Quasi-Independence Symmetry Quasi-symmetry

Quasi-independence  Con  Mixed  pro e.g exp( )=4.41

 Symmetry = Quasi-symmetry + Marginal homogeneity  Quasi-symmetry + Marginal homogeneity = Symmetry  Fit statistics for marginal homogeneity Marginal Homogeneity & Quasi-symmetry

 Symmetry Model  Criteria For Assessing Goodness Of Fit  Criterion DF Value Value/DF  Deviance  Scaled Deviance  Pearson Chi-Square  Scaled Pearson X  Log Likelihood   Quasi-Symmetry Model  Criteria For Assessing Goodness Of Fit  Criterion DF Value Value/DF  Deviance  Scaled Deviance  Pearson Chi-Square  Scaled Pearson X  Log Likelihood  G 2 (S/QS) = = with df = 2, showing marginal homogeneity is plausible.

Marginal Homogeneity Testing

Marginal homogeneity is the special case β j =0. Specifying design matrix to produce expected frequency {µ ab }. Using G 2 and X 2 tests marginal homogeneity, with df=I-1

 d a = p +a – p a+ ; d’ =( d 1,….d I-1 )  Covariance matrix V with elements:  V ab = -(p ab + p ba ) – (p +a – p a+ )(p +b – p b+ )  V aa = p +a + p a+ -2p aa – (p +a – p a+ ) 2  Under marginal homogeneity, E(d) = 0.  W is asymptotically chi-squared with df = I-1.

Marginal Models  Criteria For Assessing Goodness Of Fit  Criterion DF Value Value/DF  Deviance  Scaled Deviance  Pearson Chi-Square  Scaled Pearson X  Log Likelihood   Analysis of Variance  Source DF Chi-Square Pr > ChiSq   Intercept <.0001  review  Residual 0..

Ordinal Quasi-symmetry Model  Quasi-independence, quasi-symmetry, symmetry models for square tables treat classifications as nominal.  Changing the constraints for log linear model to obtain reduced model for ordered response.  Model structure has a linear trend. Where is the ordered scores

Ordinal Quasi-symmetry Model  Parameter estimation & interpretation 1. Fitted marginal counts ==(?) observed marginal counts 2. Dividing the first two equations by n indicates the same means.  Goodness of fit: Checking the distance between reduced model and saturated model.

Logit Representation  Logit model  Interpretation 1. difference between marginal distribution 2. marginal homogeneity 3. Identify as binomial with trials, and fit a logit model with no intercept and predictor

Marginal Homogeneity  Marginal model (cumulative logits) marginal homogeneity :  Ordinal quasi-sym model 1. At the condition of ordinal quasi-symmetry marginal homogeneity is equivalent to symmetry 2. Fit statistic

Application Data 1  Reviews of new movies by critics. Ebert SiskelConMixedPro Con24813 Mixed81311 Pro10964

Output & Interpretation  Output  Marginal homogeneity? 1. No meaning to check if when ordinal quasi-symmetry fits poorly. 2. Using marginal model is a good way. 3. Check the symmetry under the condition of ordinal quasi-symmetry.

Conclusion  Summary statistics provide an overall picture of square tables. Kappa & Percentage  Log-linear provides a valuable addition even an alternative to summary statistic. 1. Quasi-symmetry is the most general model for square table. 2. Adding or deleting variables from log-linear models provides different useful models. 3. Quasi-symmetry models proposes a good instrument for marginal homogeneity.