Advanced statistics for master students Loglinear models.

Slides:



Advertisements
Similar presentations
Sociology 690 Multivariate Analysis Log Linear Models.
Advertisements

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Soc 3306a Lecture 6: Introduction to Multivariate Relationships Control with Bivariate Tables Simple Control in Regression.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Loglinear Models for Contingency Tables. Consider an IxJ contingency table that cross- classifies a multinomial sample of n subjects on two categorical.
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Statistical Methods Chichang Jou Tamkang University.
PSYC512: Research Methods PSYC512: Research Methods Lecture 19 Brian P. Dyre University of Idaho.
Chi-square Test of Independence
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Crosstabs. When to Use Crosstabs as a Bivariate Data Analysis Technique For examining the relationship of two CATEGORIC variables  For example, do men.
C. Logit model, logistic regression, and log-linear model A comparison.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Logistic Regression Logistic Regression - Dichotomous Response variable and numeric and/or categorical explanatory variable(s) –Goal: Model the probability.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
AS 737 Categorical Data Analysis For Multivariate
Categorical Data Prof. Andy Field.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Categorical Data Analysis School of Nursing “Categorical Data Analysis 2x2 Chi-Square Tests and Beyond (Multiple Categorical Variable Models)” Melinda.
Logit model, logistic regression, and log-linear model A comparison.
A. Analysis of count data
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
CADA Final Review Assessment –Continuous assessment (10%) –Mini-project (20%) –Mid-test (20%) –Final Examination (50%) 40% from Part 1 & 2 60% from Part.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Advanced statistics for master students Correspondence analysis.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
CHI SQUARE TESTS.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
1 Chapter 2: Logistic Regression and Correspondence Analysis 2.1 Fitting Ordinal Logistic Regression Models 2.2 Fitting Nominal Logistic Regression Models.
Reasoning in Psychology Using Statistics Psychology
Chapter Outline Goodness of Fit test Test of Independence.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Data Lab # 4 June 16, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Advanced statistics for master students Loglinear models II The best model selection and models for ordinal variables.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi Square & Correlation
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
ReCap Part II (Chapters 5,6,7) Data equations summarize pattern in data as a series of parameters (means, slopes). Frequency distributions, a key concept.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Chapter 12 Chi-Square Tests and Nonparametric Tests.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Nonparametric Statistics
Lecture note on statistics, data analysis planning – week 14 Elspeth Slayter, M.S.W., Ph.D.
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
I. ANOVA revisited & reviewed
BINARY LOGISTIC REGRESSION
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
LOGISTIC REGRESSION 1.
Categorical Data Aims Loglinear models Categorical data
The Chi-Square Distribution and Test for Independence
Wednesday, September 23 Descriptive v. Inferential statistics.
Reasoning in Psychology Using Statistics
Statistics in SPSS Lecture 9
ADVANCED DATA ANALYSIS IN SPSS AND AMOS
15.1 The Role of Statistics in the Research Process
Joyful mood is a meritorious deed that cheers up people around you
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Advanced statistics for master students Loglinear models

SM 152 Loglinear analysis -method for analysis of two and more dimensional contingency tables Other approaches for contingency tables (esp. For two- dimensional) 1) chi-square test of independence and adjusted residuals (short repetition) 2)Correspondence analysis (lecture 9)-this proceedure can use also more than two dimensions The main goal of loglinear analysis is to find dependencies in higher dimensional contingency tables -collection of more techniques there are more possibilities (e.g. in SPSS three procedures for loglinear analysis)

SM 152 1)Loglinear 2)Model selection 3)Logit (not included in the lecture) Loglinear models (Literature) Many monographies: Agresti (2002), Wiley; Simonof (2003), Springer; Xie (2000) Knoke,Burke (1980), Sage; Ishii-Kuntz (1994), Sage In Czech Hebák a kol: (2005)Vícerozměrné stat. metody s aplikacemi, 3. díl, kapitola 1

SM 152 Loglinear Models -try to use model which describe relation of two or more categorical (nominal or ordinal) variables -usually models for nominal varibles (sometimes only dichotomies), but models for ordinal data can be used (in SPSS only limited possibilities) -No distinction between dependent and independent variable (logit models use this distinction)

SM 152 Contingency tables – some descriptive statistics Frequencies Percentages –row, column or total? Odds – new measure for contingency tables Odds ratio as one number for 2x2 contingency tables Higher order odds and odds ratios

SM 152 Contingency tables – basics from bc. study Dependence of two nominal/ordinal variables and adjusted residuals Null hypothesis: independence of variables Alternative hypothesis: dependence Logic of the chi-square test: Differencies between model of independence (hypothesis, expected frequencies) and real data (observed frequencies) Chi-square test of independence and the logic of the same test in loglinear models SPSS example including adjusted residuals

SM 152 Excursus- work with SPSS syntax Two possibilities for contingency tables: 1)Original data and we use two variables 2)We do not have original data but we have contingency tables, we use third variable as weight variable Example of second approach: data list free/sex edu count. begin data ……atd end data. weight by count. val lab sex 1 "male" 2„female". val lab edu 1 „basic eduaction" 2„secondary education" 3„tertiary education".

SM 152 Loglinear analysis Basic statistical idea: To model frequencies in contingency tables Excursus: The logic of ANOVA Loglinear analysis is similar to ANOVA, but effects are not sumed but multiplied (see e.g. Field explanation of these similarities) It is possible to use effect of row variable, effect of column variable and also interaction effects (impact of combinations row&column variable together) Methodology: If we use more than two variables we make elaboration (we take impact of other variable see Babbie etc.)

SM 152 Terminology Saturated model-model with all variables and all possible interactions, this model explain fully observed frequencies but is not usefull (can not be tested) (observed freq.=expected) Real model (non saturated)-some interaction or variable is missing (do not explain fully observed frequencies but can be tested (expected frequencies from this model can be compared with observed frequencies in data (model estimates basic population contingency table, observed frequencies are from sample!!!) residual-differencies between observed frequencies and frequencies from model, can be statistically tested (we can identify problems in our models)

SM 152 Loglinear model with only 2 variables Saturated model-2 variables and their interaction Model of independence (see also the chi-square test above)- only row and column variable, no interaction residuals-differencies between observed frequencies and expected frequencies from model of independence Hierarchical models –models which include all lower order interactions and all variables Abbreviations fo hierarchical models and its meaning (ABC) (AB)C etc.

SM 152 Odds and its usage in loglinear modeling Basic concept esp. for interpretation of results (for loglinear models, logistic regression etc.) Can be derived from parameters of loglinear (or logit) models For statistical reasons we use LOGIT = Log(ODDS) – as we change (by logarithmic transformation) multiplicative models into additive we use logarithms insted of original variables Range of odds, odds ratios and logits (differencies)

SM 152 Note at the end: Loglinear analysis is confirmatory, it enables to test dependencies, inclusion of variables (or their interactions) into the model, fit of the model etc.