Introduction to the General Linear Model (GLM)

Slides:



Advertisements
Similar presentations
Non-linear Components for Multiple Regression Why we might need non-linear components Type of non-linear components Squared-term components & their disadvantages.
Advertisements

Transformations & Data Cleaning
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
Extension The General Linear Model with Categorical Predictors.
Analytic Comparisons & Trend Analyses Analytic Comparisons –Simple comparisons –Complex comparisons –Trend Analyses Errors & Confusions when interpreting.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Psych 524 Andrew Ainsworth Data Screening 1. Data check entry One of the first steps to proper data screening is to ensure the data is correct Check out.
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
Multiple Regression Models Advantages of multiple regression Important preliminary analyses Parts of a multiple regression model & interpretation Differences.
Multiple Regression Models: Some Details & Surprises Review of raw & standardized models Differences between r, b & β Bivariate & Multivariate patterns.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Introduction the General Linear Model (GLM) l what “model,” “linear” & “general” mean l bivariate, univariate & multivariate GLModels l kinds of variables.
Simple Regression correlation vs. prediction research prediction and relationship strength interpreting regression formulas –quantitative vs. binary predictor.
General Linear Models -- #1 things to remember b weight interpretations 1 quantitative predictor 1 quantitative predictor & non-linear component 1 2-group.
Bivariate & Multivariate Regression correlation vs. prediction research prediction and relationship strength interpreting regression formulas process of.
Relationships Among Variables
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Multivariate Statistical Data Analysis with Its Applications
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
1 Psych 5510/6510 Chapter 10. Interactions and Polynomial Regression: Models with Products of Continuous Predictors Spring, 2009.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Coding Multiple Category Variables for Inclusion in Multiple Regression More kinds of predictors for our multiple regression models Some review of interpreting.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Regression Models w/ 2 Quant Variables Sources of data for this model Variations of this model Main effects version of the model –Interpreting the regression.
Chapter 16 Data Analysis: Testing for Associations.
Regression Models w/ 2-group & Quant Variables Sources of data for this model Variations of this model Main effects version of the model –Interpreting.
Plotting Linear Main Effects Models Interpreting 1 st order terms w/ & w/o interactions Coding & centering… gotta? oughta? Plotting single-predictor models.
Regression Models for Quantitative (Numeric) and Qualitative (Categorical) Predictors KNNL – Chapter 8.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Multivariate Data Analysis Chapter 2 – Examining Your Data
General Linear Model.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, partial correlations & multiple regression Using model plotting to think about ANCOVA & Statistical control Homogeneity.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
General Linear Model What is the General Linear Model?? A bit of history & our goals Kinds of variables & effects It’s all about the model (not the data)!!
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
THE ROLE OF STATISTICS IN RESEARCH. Reading APPENDIX A: Statistics pp
Remember the equation of a line: Basic Linear Regression As scientists, we find it an irresistible temptation to put a straight line though something that.
Topics: Multiple Regression Analysis (MRA)
Plotting Non-linear & Complex Main Effects Models
Inference about the slope parameter and correlation
Learning Objectives For two quantitative IVs, you will learn:
Logistic Regression CSC 600: Data Mining Class 14.
Multiple Regression – Part I
Non-linear relationships
Senior Statistical Criminologist
QM222 Class 16 & 17 Today’s New topic: Estimating nonlinear relationships QM222 Fall 2017 Section A1.
The Correlation Coefficient (r)
Learning Objectives For models with dichotomous intendant variables, you will learn: Basic terminology from ANOVA framework How to identify main effects,
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Data, Univariate Statistics & Statistical Inference
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Dr. Siti Nor Binti Yaacob
Advanced Quantitative Analysis
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Soc 3306a: ANOVA and Regression Models
Ass. Prof. Dr. Mogeeb Mosleh
I271b Quantitative Methods
Multivariate Statistics
Soc 3306a Lecture 11: Multivariate 4
1/18/2019 ST3131, Lecture 1.
3 basic analytical tasks in bivariate (or multivariate) analyses:
Multiple Regression Berlin Chen
The Correlation Coefficient (r)
Presentation transcript:

Introduction to the General Linear Model (GLM) what “model,” “linear” & “general” mean bivariate, univariate & multivariate GLModels kinds of variables

“General Linear Model” “Model” means that we are usually interested in predicting or “modeling” the values of one variable (criteria) from the values of one or more others (the predictors) “Linear” means that the variables will be “linearly transformed (* & /) and “linearly combined” (+ & -) to produce the model’s estimates “General” means that the model intends to provide a way to model & test RHs: about any combination of criterion and predictor variables (i.e., any model), and to test RHs: about comparisons among models

Regression “vs.” GLM The “constant” is often represented differently in GLM than in multiple regression … Single predictor models  single predictor regression y’ = bx + a  single predictor GLM y’ = b0 + b1x1 Multiple predictor models multiple predictor regression y’ = b1x1 b2x2+ a multiple predictor GLM y’ = b0 + b1x1 + b2x2

Common kinds of GLModels Bivariate  one criterion & one predictor  simple regression y’ = b0 + b1x Univariate  one criterion & multiple predictors  multiple regression in all its forms y’ = b0 + b1x1 + b2x2 + b3x3 Multivariate  multiple criterion & multiple predictors  canonical regression in all its forms b0 + b1y1 + b2y2 = b0 + b1x1 + b2x2 + b3x3

Common kinds of variables Quantitative variables Raw variable Centered variables X – mean for each participant Mean  0 simplifies math of more complicated models Re-centered variables X  a more meaningful value Change “start” or “stop” values E.g., “aging & intellectual decline” Mathematical “trick” to get the desired model/weights selecting which group or value will be represented in model’s bs

Common kinds of variables Quadratic quantitative variables X2 – actually represents combination of linear + quadratic Xcen2 – represents the “pure” quadratic term Model with X2 will have ≈ R2 as model with Xcen + Xcen2 A model with a quadratic term should always include the linear term for that variable

Common kinds of variables 2-group variables Unit coding (usually 1-2) Dummy Coding “control” or “comparison” group coded 0 “treatment” or “target” group coded 1 Effect Coding “control” or “comparison” group coded -1

Common kinds of variables k-group variables Raw coding (usually 1-2-3, etc.) Dummy Coding “control” or “comparison” group coded 0 “treatment” or “target” groups coded 1 on one variable & 0 on all others the full set of codes must be included in the model Effect Coding “control” or “comparison” group coded -1 “treatment” or “target” group coded 1 on one variable & 0 on all others

Common kinds of variables K-groups variables, cont. Comparison coding Combining simple and complex analytical comparison codes to represent specific, hypothesis driven, group comparisons E.g., Say you have 4 groups and RH: that… Group 1 has higher scores that the average scores of groups 2-4 the codes would be gp1 = 3 gp2 = -1 gp3 = -1 gp4 = -1 Groups 2 & 3 have higher average scores than do 1 & 4 the codes would be gp1 = -1 gp2 = 1 gp3 = 1 gp4 = -1 Group 2 has higher scores than the average scores of groups 3-4 the codes would be gp1 = 0 gp2 = 2 gp3 = -1 gp4 = -1 Usually havea set of k-1 codes

Common kinds of variables K-groups variables, cont. Polynomial coding If the groups represent a quantitative continuum, you use codes to represent different polynomial functions (linear, quadratic, cubic, etc.) to explore the shape of the relationship between that variable and the criterion E.g., for a 5-group variable, the polynomial codes are … Linear -2 -1 0 1 2 Quadratic 2 -1 -2 -1 2 Cubic -1 2 0 -2 1 Quartic 1 -4 6 -4 1 the full set of codes must be included in the model

Common kinds of variables Ordered-category variables Sometimes you have a quantitative variable that you want to change into a set of ordered categories e.g.  % grade into “A” “B” “C” “D” “F” e.g.  % grade into “Pass” “Fail” e.g.  aptitude test scores into “remedial” “normal” “gifted” Sometimes this is done to help with “ill-behaved distributions” e.g.  frequency variable with mean=1.1, std=8.4, sk=4.2 e.g.  frequency variable with 60% “0” 38% “1” max = 118 Important because  skewed univariate distributions can “create” apparently nonlinear bivariate relationships

Common kinds of variables Ordered-category variables, cont. Once you form the ordered categories (using “IF,” “RECODE” or other transformations), you can enter those variables into the GLM in different ways Using the category values (e.g., 1, 2, 3, etc) *** Centering or re-centering the category values *** Dummy codes of the category values Effect codes of the category values Polynomial codes of the category values*** *** indicates approaches that make assumptions about the interval nature of the variable and/or its normal distribution, with which not everyone agrees!

Common kinds of variables Interactions Interactions represent the “joint effect” or “non-additive combination” of 2 or more predictors as they relate to a criterion (or set of criteria in the multivariate case). They are the “moderation,” “it depends,” “sometimes,” or “maybe” that makes our science and statistical analyses so interesting. Interactions can be formed as the product of any 2 or more variables of any of the types discussed above. There are some “guidelines” about forming, including and interpreting interaction terms.

Common kinds of variables Interaction  “Guidelines” When including a 2-way interaction, both related main effects must be included When including a 3-way interaction, all 3 main effects and all 3 2-way interactions must be included When including a non-linear interaction term, the related linear and nonlinear main effects, and linear interaction terms must be included The associated terms can not exceed the df of the variables involved