GZLM... including GEE. Generalized Linear Modelling A family of significance tests... Something we don’t see mentioned much in articles yet... but will.

Slides:



Advertisements
Similar presentations
Quantitative Data Analysis: Hypothesis Testing
Advertisements

By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
January 7, afternoon session 1 Multi-factor ANOVA and Multiple Regression January 5-9, 2008 Beth Ayers.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Using Statistics in Research Psych 231: Research Methods in Psychology.
ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Biol 500: basic statistics
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.
Today Concepts underlying inferential statistics
Intro to Parametric & Nonparametric Statistics Kinds & definitions of nonparametric statistics Where parametric stats come from Consequences of parametric.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 14: Non-parametric tests Marshall University Genomics.
Assumption of Homoscedasticity
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
Mann-Whitney and Wilcoxon Tests.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Leedy and Ormrod Ch. 11 Gray Ch. 14
Active Learning Lecture Slides
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
Stats Lunch: Day 7 One-Way ANOVA. Basic Steps of Calculating an ANOVA M = 3 M = 6 M = 10 Remember, there are 2 ways to estimate pop. variance in ANOVA:
Statistics for Education Research Lecture 8 Tests on Three or More Means with Repeated Measures: One-Way ANOVA with Repeated Measures Instructor: Dr. Tung-hsien.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Animated banners - H1 Sigurbjörn Óskarsson. Research design Repeated measures N=32 (- 1 outlier) Task testing Control category + 3 levels of experimental.
Stats Lunch: Day 4 Intro to the General Linear Model and Its Many, Many Wonders, Including: T-Tests.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
11 Chapter 12 Quantitative Data Analysis: Hypothesis Testing © 2009 John Wiley & Sons Ltd.
PSY2004 Research Methods PSY2005 Applied Research Methods Week Five.
Osteoarthritis Initiative Analytic Strategies for the OAI Data December 6, 2007 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
AOV Assumption Checking and Transformations (§ )
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Adjusted from slides attributed to Andrew Ainsworth
Experimental Design and Statistics. Scientific Method
Experimental Research Methods in Language Learning
I271B The t distribution and the independent sample t-test.
Generalized linear MIXED models
Review. Statistics Types Descriptive – describe the data, create a picture of the data Mean – average of all scores Mode – score that appears the most.
Repeated-measures designs (GLM 4) Chapter 13. Terms Between subjects = independent – Each subject gets only one level of the variable. Repeated measures.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
University of Durham D Dr Robert Coe University of Durham School of Education Tel: (+44 / 0) Fax: (+44 / 0)
1 Psych 5510/6510 Chapter 14 Repeated Measures ANOVA: Models with Nonindependent Errors Part 1 (Crossed Designs) Spring, 2009.
Stats Lunch: Day 8 Repeated-Measures ANOVA and Analyzing Trends (It’s Hot)
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
T tests comparing two means t tests comparing two means.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Experimental Research Methods in Language Learning Chapter 13 Paired-Samples and Independent- Samples T-tests.
Non-parametric Approaches The Bootstrap. Non-parametric? Non-parametric or distribution-free tests have more lax and/or different assumptions Properties:
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Mixed-Design ANOVA 13 Nov 2009 CPSY501 Dr. Sean Ho Trinity Western University Please download: treatment5.sav.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistics for Education Research Lecture 4 Tests on Two Means: Types and Paired-Sample T-tests Instructor: Dr. Tung-hsien He
Inferential Statistics Psych 231: Research Methods in Psychology.
Data Screening. What is it? Data screening is very important to make sure you’ve met all your assumptions, outliers, and error problems. Each type of.
Inferential Statistics Psych 231: Research Methods in Psychology.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Analysing Means II: Nonparametric techniques.
Sunee Raksakietisak Srinakharinwirot University
Ass. Prof. Dr. Mogeeb Mosleh
Inferential Statistics
Psych 231: Research Methods in Psychology
Presentation transcript:

GZLM... including GEE

Generalized Linear Modelling A family of significance tests... Something we don’t see mentioned much in articles yet... but will hear more of Maybe we should be using it!

Often we have RQs and RHs that require us to compare groups and/or conditions. E.g. – Do student attitudes vary depending on year of study and on which of two types of speaking instruction they received? – Does word length and word frequency affect how well learners remember the word? – Do speakers differ in their pronunciation of a sound depending on gender and formality of situation? – Do students trained to use online dictionaries improve in writing more than those who are not?

Well known statistical significance tests for these comparisons are the GLM family General Linear Model GLM includes: – t tests – ANOVA – Pearson correlation – Linear regression

But GLM is picky... comes with prerequisite requirements 1. the DV scale – Can’t deal with data that is not scores... –...that are ‘equal interval’ and –...on a supposedly open ended scale – Counts have to be treated as scores – Rating scales possibly, but... are they ‘equal interval’ – Not binary data such as pass/fail or yes/no responses

2. Further features of the score data (in the population) e.g. – Normality of distribution shape of scores – Similarity of spread of scores in different groups (aka homoscedasticity or homogeneity of variance) – Similarity of variance of differences between pairs of repeated measures (aka sphericity)

Previous ways of dealing with data that fails the prerequisites – Use GLM anyway, claiming it is ‘robust’ even when prerequisites are missing Or just use GLM and don’t check/mention the problems – For normality, transform the data to be more ‘normal’ in shape (Example) But results are then hard to talk about – Use an alternative test (nonparametric, weaker) But such tests are only available for simple comparisons

Since the 80s, but only recently available in popular packages like SPSS... GZLM, including GEE, covers most of the ground of the GLM family, and more, and deals with most of the problems Generalized Linear Model itself for comparing groups only (GZLM) An extension of GZLM called Generalized Estimating Equations (GEE) for comparing repeated measures (and groups if necessary)

An example comparing groups We see the issue of choosing the right analysis for the distribution shape Marin’s data, DV6 – DV: Six point rating scale response for how often learners use vocab strategies – EV: Two genders – EV: Five years of study in university

An example with repeated measures We see how to turn the data into ‘long’ form which GEE requires Issariya’s data – DV: Percent correct scores for learning vocab wordlists – EV: Pretest versus posttest – EV: Experimental group (with vocab learning strategy instruction) and control group (with extra practice)

An example with Poisson distribution We see computational limitations Nushoor’s data – DV: Counts of how often people used types of modifier expression with requests – EV: Types of modifier – EV: Four groups (2 NS, 2 NNS) – EVs: Types of request situation in terms of social variables such as power, and seriousness

An example with binary data Vineeta’s r data – DV: Numbers of r produced versus other variants – EVs: Various features of the word – EVs: Various features of the people – EV: Formality of situation This analysis is I think more or less equivalent to what traditional Varbrul analysis does....BUT – The output is in a different form – In fact this sort of analysis is not really statistically acceptable anyway (see _200043_en.pdf) _200043_en.pdf

To analyse data like Vineeta’s properly we need either the latest version of Varbrul called Rbrul, or GZLM Mixed... the latest bit of GZLM added to SPSS Watch this space....