Analyzing Continuous and Categorical IVs Simultaneously Analysis of Covariance.

Slides:



Advertisements
Similar presentations
PSYC512: Research Methods PSYC512: Research Methods Lecture 13 Brian P. Dyre University of Idaho.
Advertisements

ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
General Linear Model Introduction to ANOVA.
Issues in factorial design
Topic 12 – Further Topics in ANOVA
Regression Basics Predicting a DV with a Single IV.
Data Analysis Statistics. Inferential statistics.
The Psychologist as Detective, 4e by Smith/Davis © 2007 Pearson Education Chapter Twelve: Designing, Conducting, Analyzing, and Interpreting Experiments.
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
January 6, afternoon session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Data Analysis Statistics. Inferential statistics.
Ch. 14: The Multiple Regression Model building
Intro to Statistics for the Behavioral Sciences PSYC 1900
CORRELATIONAL RESEARCH I Lawrence R. Gordon Psychology Research Methods I.
Multiple Regression – Basic Relationships
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapter 9: Correlational Research. Chapter 9. Correlational Research Chapter Objectives  Distinguish between positive and negative bivariate correlations,
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Understanding Research Results
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Analysis of Covariance Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Moderators. Definition Moderator - A third variable that conditions the relations of two other variables Example: SAT-Quant and math grades in school.
Chapter 14 Introduction to Multiple Regression
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Curvilinear 2 Modeling Departures from the Straight Line (Curves and Interactions)
Regression Analyses. Multiple IVs Single DV (continuous) Generalization of simple linear regression Y’ = b 0 + b 1 X 1 + b 2 X 2 + b 3 X 3...b k X k Where.
Lab 5 instruction.  a collection of statistical methods to compare several groups according to their means on a quantitative response variable  Two-Way.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Psych 5500/6500 Other ANOVA’s Fall, Factorial Designs Factorial Designs have one dependent variable and more than one independent variable (i.e.
Analysis of Covariance adjusting for potential confounds.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Dummy Variable Regression Models chapter ten.
Commonly Used Statistics in the Social Sciences Chi-square Correlation Multiple Regression T-tests ANOVAs.
Categorical Independent Variables STA302 Fall 2013.
General Linear Model.
9.1 Chapter 9: Dummy Variables A Dummy Variable: is a variable that can take on only 2 possible values: yes, no up, down male, female union member, non-union.
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Overview and Common Pitfalls in.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, partial correlations & multiple regression Using model plotting to think about ANCOVA & Statistical control Homogeneity.
ANCOVA.
1 G Lect 10M Contrasting coefficients: a review ANOVA and Regression software Interactions of categorical predictors Type I, II, and III sums of.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
29 October 2009 MRC CBU Graduate Statistics Lectures 4: GLM: The General Linear Model - ANOVA & ANCOVA1 MRC Cognition and Brain Sciences Unit Graduate.
ScWk 298 Quantitative Review Session
Statistical Significance
Chapter 14 Introduction to Multiple Regression
REGRESSION G&W p
Analysis of Variance and Covariance
Testing for moderators
Multiple Regression Analysis and Model Building
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Interactions & Simple Effects finding the differences
Introduction to Statistics
Soc 3306a: ANOVA and Regression Models
Soc 3306a Lecture 11: Multivariate 4
Simple Linear Regression
Chapter 8: DUMMY VARIABLE (D.V.) REGRESSION MODELS
Financial Econometrics Fin. 505
Presentation transcript:

Analyzing Continuous and Categorical IVs Simultaneously Analysis of Covariance

Skill Set When we model a single categorical and a single continuous variable, what do the main effects look like? What do the interactions look like? What is the meaning of each of the three b weights in such models? What is the sequence of tests used to analyze such data? Why should we avoid dichotomizing continuous IVs? What is the difference between ordinal and disordinal interactions? Why do we test for regions of significance of the difference between regression lines when we have an interaction?

Mixed IVs Simplest example has 2 IVs 1 IV is categorical (e.g., Male, Female) 1 IV is continuous (e.g., MAT score) –Keats:Shelly::Byron:Harley-Davidson DV is continuous, e.g., GPA in law school Have used ANOVA for categorical and Regression for continuous Both are part of GLM. Many people call mixing categorical and continuous vbls Analysis of Covariance (ANCOVA).

Example Data Note that there are 40 people here. Effect coding (1, -1) has been used to identify males vs. females. Doesn’t matter which is which (-1, 1) for coding purposes.

Example Data Graph What is the main story here?

Group vs. Common Regression Coefficient Can have 1 common slope, b c. Can have 2 group slopes, b F and b M. Common slope is weighted average of group slopes: Weight by SS X (here, MAT scores) for each group. Weight comes from variability in X and number of people in group.

Telling the Story With Graphs (1) Why is there nothing to tell here?

Telling the Story (2) How does the graph tell us which variable is important?

Telling the Story (3) What stories are being told in each of these graphs? When the story is obvious, the graph tells it. But we need statistical tests when the results are not obvious, and when we want to persuade others (publish).

Testing Sequence (1) Construct vectors X, G and XG. –X is continuous –G is group (categorical) –XG is the product of the two. Just mult. Intercept for common group is a. Note three b weights. First tells difference in groups. Second is common slope. Third is interaction (difference in group slopes). Two common terms, two difference terms.

Testing Sequence (2) Estimate 3 slopes (and intercept). Examine R 2 for model. If n.s., no story; quit. If R 2 sig and large enough: Examine b 3. If sig, there is an interaction. If sig, estimate separate regressions for different groups. If b 3 is not sig, re-estimate model without XG. Examine b 1 and b 2.

Testing Sequence (3) The significance of the b weights tells the importance of the variables. Is b 1 significant? (G, categorical) Is b 2 significant? (X, cont) YesNo YesParallel slopes, different intercepts Identical regressions NoMean diffs only; slopes are zero Only possible with severe confounding; ambiguous story.

Test Illustration (1) R 2 =.44; p <.05 Y' = G+.0673X-.0146GX TermEstimateSEt G (b1; Sex) X (b2; MAT) * GX (b3; Int) Step 1. R 2 is large & sig. Step 2. Slope for interaction (b 3 ) is N.S. (low power test) Step 3. Drop GX and re-estimate.

Test Illustration (2) R 2 =.42; p <.05 Y' = G+.0687X TermEstimateSEt G (b1; Sex) X (b2; MAT) * Step 4. Examine slopes (b weights). The only significant slope is for MAT. Conclusion: Identical regressions for Males and Females. The slight difference in lines is due to sampling error.

Second Illustration (1) Suppose our data look like these. What story do you think they tell?

Second Illustration (2) R 2 =.72; p <.05 Y' = G+.0643X-.0117GX TermEstimateSEtp G (b1; Sex) X (b2; MAT) GX (b3; Int) Is there any story to tell? 2.Is there an interaction? R 2 =.72; p <.05 Y' = G+.0655X TermEstimateSEtp G (b1; Sex) X (b2; MAT) What is the story? Does it agree with the graph?

More Complex Designs With more complex designs, logic and sequence of tests remain the same. Categorical vbls may have more than 2 levels We may have several continuous IVs If multiple categories, create multiple (G-1) interaction terms. If multiple Xs, create products for each. Test the terms as a block using hierarchical regression :

Categorizing Continuous IVs The median split (e.g., personality, stress, BEM sex-role scales). Don’t do this because: –Loss of power and information – treat IQs of 100 and 140 as identical. –Loss of replication (median changes by sample) –Arbitrary value of split - “high stress” group may not be very stressed Some throw out middle people – also a problem because of range enhancement bias.

Interactions Some research is aimed squarely at interactions, e.g., Aptitude Treatment Interaction (ATI) research. Learning styles, etc. Types of Interactions: No interactionOrdinal Interaction Disordinal Interaction Implications?

Regions of Significance With a disordinal interaction, there must be a place where the treatments are equal (where the lines cross). The crossover is found by (a1-a2)/(b2-b1) or (4-1.5)/(.8-.3) = 2.5/.5 =5, just where it appears to be on the graph. Some places on X give equivalent effects. Other places show a benefit to one treatment or the other.

Simultaneous Regions of Significance F is the tabled value. N is n 1 +n 2 = total people.

Disordinal Example (1) Hypothetical experiment in teaching Research Methods. Learning style – high scores indicate preference for spoken instruction. Two instruction methods – graphics intensive and spoken intensive. N=40. X = learning style questionnaire score. G = method of instruction. DV is in-class test score.

Disordinal Example (2) RY (Test) X (Learn Style) G (Lect v. tutor) GX (Int) Y1 X.221 G GX M SD SourceDfSSMS F Model Error C Total R 2 =.91 VariableEstimateSEtp Int67.09 G X GX

Disordinal Example (3) n1=20YX G=1 Y1R X.951 M SD SourceDfSS G=1 Model Error C Total R 2 =.90 VariableEstimateSEtp Int X Group 1 data

Disordinal Example (4) n2=20YX G=-1 Y1R X-.971 M SD Group 2 data SourcedfSS G=-1 Model Error C Total R 2 =.93 VariableEstimateSEtP Int X

Disordinal Example (5) Therefore, the regression will all terms included is: Y'= G +.23X +.92GX The regression for the 1 group is: Y'= X The regression for the -1 group is: Y'= X. To find the crossover point, we find (a1-a2)/(b2-b1) which, in our case is ( )/( ) = N=40n1=20n2=20Group1 = 1 Group2 = -1 F.05(2,36) =3.26SS res(tot) = SS res(1) = SS res(2) = Note: SS res(tot) = SS res(1) + + SS res(2) = SD=15.26, SS =SD 2 *(N-1) = SD=15.41, SS=15.41*15.4 1*19 =28.2From corrs =27.35From corrs a1=40.10b1=1.15a2=94.09b2=-.69

Disordinal Example (6) Lower27.26 Middle29.34 Upper31.48 Therefore, our estimates are:

Disordinal Example (7) N.S. Region