Planned Contrasts and Data Management

Slides:



Advertisements
Similar presentations
Bivariate Analysis Cross-tabulation and chi-square.
Advertisements

Planned Contrast: Execution (Conceptual) 1. Must predict pattern of interaction before gathering data. Predict that Democratic women will be most opposed.
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
PSY 307 – Statistics for the Behavioral Sciences
ANOVA: Analysis of Variance
More on ANOVA. Overview ANOVA as Regression Comparison Methods.
Conceptual Review Conceptual Formula, Sig Testing Calculating in SPSS
Introduction to Analysis of Variance CJ 526 Statistical Analysis in Criminal Justice.
Factorial ANOVA 2-Way ANOVA, 3-Way ANOVA, etc.. Factorial ANOVA One-Way ANOVA = ANOVA with one IV with 1+ levels and one DV One-Way ANOVA = ANOVA with.
Chi-square Test of Independence
Lecture 16 Psyc 300A. What a Factorial Design Tells You Main effect: The effect of an IV on the DV, ignoring all other factors in the study. (Compare.
Two Groups Too Many? Try Analysis of Variance (ANOVA)
Lecture 15 Psyc 300A. Example: Movie Preferences MenWomenMean Romantic364.5 Action745.5 Mean55.
Today Concepts underlying inferential statistics
2-Way ANOVA, 3-Way ANOVA, etc.
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
ANOVA Chapter 12.
Chapter 14Prepared by Samantha Gaies, M.A.1 Chapter 14: Two-Way ANOVA Let’s begin by reviewing one-way ANOVA. Try this example… Does motivation level affect.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inferential Statistics: SPSS
Planned Contrasts and Data Management Class 19. QUIZ 3 ON THURSDAY, DEC. 5 Covers: Two-way ANOVA through Moderated Multiple Regression.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Project 6 Using The Analysis ToolPak To Analyze Sales Transactions Jason C. H. Chen, Ph.D. Professor of Management Information Systems School of Business.
Moderated Multiple Regression Class 18. Functions of Regression 1. Establishing relations between variables Do frustration and aggression co-occur? 2.
 Slide 1 Two-Way Independent ANOVA (GLM 3) Chapter 13.
Inferential Statistics
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
SPSS Basics and Applications Workshop: Introduction to Statistics Using SPSS.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, October 15, 2013 Analysis of Variance (ANOVA)
Sampling, sample size estimation, and randomisation
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Smoking Data The investigation was based on examining the effectiveness of smoking cessation programs among heavy smokers who are also recovering alcoholics.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
ONE-WAY BETWEEN-GROUPS ANOVA Psyc 301-SPSS Spring 2014.
Smith/Davis (c) 2005 Prentice Hall Chapter Fifteen Inferential Tests of Significance III: Analyzing and Interpreting Experiments with Multiple Independent.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Statistics for Political Science Levin and Fox Chapter Seven
Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.
Handout Eight: Two-Way Between- Subjects Design with Interaction- Assumptions, & Analyses EPSE 592 Experimental Designs and Analysis in Educational Research.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.Introduction To Basic Ratios 3.Basic Ratios In Excel 4.Cumulative.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Stats Methods at IC Lecture 3: Regression.
Categorical Variables in Regression
T-Tests and ANOVA I Class 15.
Comparing several means: ANOVA (GLM 1)
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Kin 304 Inferential Statistics
What if. . . You were asked to determine if psychology and sociology majors have significantly different class attendance (i.e., the number of days a person.
Comparing Several Means: ANOVA
Hypothesis Testing and Comparing Two Proportions
Statistics for the Social Sciences
Multiple Regression Chapter 14.
Psych 231: Research Methods in Psychology
Chapter 12 A Priori and Post Hoc Comparisons Multiple t-tests Linear Contrasts Orthogonal Contrasts Trend Analysis Bonferroni t Fisher Least Significance.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Chapter Fourteen McGraw-Hill/Irwin
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Exercise 1 Use Transform  Compute variable to calculate weight lost by each person Calculate the overall mean weight lost Calculate the means and standard.
Presentation transcript:

Planned Contrasts and Data Management Class 17

Topics Covered Today 1. Planned Contrasts   1. Planned Contrasts 2. Analysis of Residual Variance 3. Post-hoc tests 4. Data Management a. Setting up data files b. Cleaning data 5. Tuesday a. Interactions in regression b. Hand out ANOVA exercise

What we won't cover from Keppel Chapter 11:   What we won't cover from Keppel Chapter 11: 1. Orthogonal comparisons (pp.210-211) 2. Analysis of trend (pp. 211) 3. Multiple comparisons (pp. 211- 212) 4. Analysis of cell means (pp. 212-222)

Planned Contrast: Function 1. Factorial ANOVA tests for orthogonal (perpendicular) interactions. 2. Some studies predict non-orthogonal interactions. 3. Planned contrast provides more predictive power to confirm non-orthogonal contrasts of any particular shape (“wedge”, “arrow” [like above] or other).

Planned Contrast: Execution (Conceptual)   1. Must predict ????? before gathering data. Predict that Democratic women will be most opposed to gun instruction in school, compared to Democratic men, Republican men, and Republican women.

Planned Contrast: Execution (Conceptual)   1. Must predict pattern of interaction before gathering data. Predict that Democratic women will be most opposed to gun instruction in school, compared to Democratic men, Republican men, and Republican women.

Convert Separate Factors into Single Factor 1. Two separate factors Political Party Democrat, GOP Gender Male, Female   Convert the two separate factors into a ????

Convert Separate Factors into Single Factor 1. Two separate factors Political Party Democrat, GOP Gender Male, Female   Convert the two separate factors into a single factor genparty 1) Male Republican 2) Male Democrat 3) Female Republican 4) Female Democrat

Converting Multi-factors into Single Factor for Planned Contrast Gender Political Party Male Female Republican 5.00 4.75 Democrat 4.50 2.75 Converted into single factor with four levels   GENPARTY 1 = Male/Republican 5.00 2 = Male/Democrat 4.50 3 = Female/Republican 4.75 4 = Female/Democrat 2.75

Planned Contrast: Execution (Conceptual)   3. Conduct one-way ANOVA, with new single variable as predictor. 4. Assign weights to the four levels, as follows: 1) Male Republican -1 2) Male Democrat -1 3) Female Republican -1 4) Female Democrat 3 * Weights indicate which sub-groups are to be compared. * Weights must add up to zero 5. Planned contrast then limits comparison to the indicated groups, but “counts” all subjects in terms of degrees of freedom and computation of error. This provides greater predictive power.

Graph of Gender X Political Party and Opposition to Gun Usage in School

Orthogonal Interaction Univariate Analysis of Variance   [DataSet1] Orthogonal Interaction

Planned Contrast, Page 1

Planned Contrast, Page 2 Contrast Tests Contrast -6.000 1.768 1.696 Assumes eq. var. Doesn’t assume eq. var. value -6.000 Std. Error 1.768 1.696 -3.394 -3.539 12 5.501 .005 .014 t df Sig. (2 –tailed)

“Quality Control” for Planned Contrast Issue: Planned contrast can be a very “liberal” test, confirming patterns that don’t closely fit with actual predictions. Predicted this Obtained this Result of this –1, -1, -1, + 3 planned contrast is still significant How to assess the “quality” of a significant planned contrast?

Analysis of Residual Variance Logic of test: Did (Between groups effect – ???) leave significant amount of ??? variance unexplained? If so, then the contrast DID OR DID NOT do a good job? However, if “what’s left over” (i.e., between – contrast) is not significant, then the contrast accounts for ???

Analysis of Residual Variance Logic of test: Did (Between groups effect – Contrast effect) leave significant amount of sytematic (non-random) variance unexplained? If so, then the contrast did not do a good job. It did not explain the outcome fully. However, if “what’s left over” (i.e., between – contrast) is not significant, then the contrast accounts for most of the treatment. In this case, the contrast did do a good job.

Steps in Analysis of Residual Variance Test Get SPSS printout of planned contrast Get t of contrast, square it to get contrast F (t = F ) Compute SS contrast (SSc): Multiply contrast F by mean sq. w/n (MSw) of oneway. This results in SS contrast (SSc). Compute SS residuals (SSr): Get SS between (SSb) from oneway, and subtract SSc. (SSb – SSc) = SS residuals (SSr) Compute MS contrast (MSc): Divide SSr by df, which is (oneway df – planned contrast df). This produces the MS contrast (MSc) Compute F residuals: Divide MSc by MSw. MSc/MSw = F residuals Compute df for F resid: numerator df = (df oneway – df contrast; see 5a, above), denominator df = df within (from oneway). Check this F in F table from any stats book. If significant, contrast is not a good fit. If not significant, the contrast is a good fit.

Residuals Analysis Test 1. Get SPSS printout of planned contrast 2. F of contrast (Fcont) = t2; t = -3.39 t2 = 11.49 3. SScontrast (SScont): F cont X MSw = 11.49 X 1.04 = 11.95 4. SSresiduals (SSres): SSbetween (SSb) = 12.50 SSb – SScont = 12.50 – 11.95 = .55 5a. Contrast df = df oneway – df contrast = 15 -12 = 3 5b. MScontrast (MScont) = SSres / contrast df = .55/3 = 0.18 F residuals (Fresid): Divide MScont by MSw = 0.18/1.04 = .17 DF for Fresid = df contrast (see 5a, above), df within: (3, 12) 8. F table at (3, 12) df, for criterion p < .25; F = 1.56 9. Obtained Fresid < 1.56, therefore residual is not significant, therefore contrast result is a good fit for data.

Post Hoc Effects Do female democrats differ from other groups?   Do female democrats differ from other groups? 1 = Male/Republican 5.00 2 = Male/Democrat 4.50 3 = Female/Republican 4.75 4 = Female/Democrat 2.75 Conduct three t tests? Why or Why not? Solution: XXX tests of multiple comparisons.   XXX tests consider the inflated likelihood of Type ? error Kent's favorite—??? test of multiple comparisons, which is the most generous.

Post Hoc Effects Do female democrats differ from other groups?   Do female democrats differ from other groups? 1 = Male/Republican 5.00 2 = Male/Democrat 4.50 3 = Female/Republican 4.75 4 = Female/Democrat 2.75 Conduct three t tests? NO. Why not? Capitalizes on chance Solution: Post hoc tests of multiple comparisons.   Post hoc tests consider the inflated likelihood of Type I error Kent's favorite—Tukey test of multiple comparisons, which is the most generous. NOTE: Post hoc tests can be done on any multiple set of means, not only on planned contrasts.

Conducting Post Hoc Tests   1. Recode data from multiple factors into single factor, as per planned contrast. 2. Run oneway ANOVA statistic 3. Select "posthoc tests" option. ONEWAY gunctrl BY genparty /CONTRAST= -1 -1 -1 3 /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05).

Post hoc Tests, Page 1

Post Hoc Tests, Page 2

Data Management Issues Setting up data file   Checking accuracy of data Disposition of data Why obsess on these details? Murphy's Law If something can go wrong, it will go wrong, and at the worst possible time.

Creating a Coding Master 1. Get survey copy   2. Assign variable names 3. Assign variable values 4. Assign missing values 5. Proof master for accuracy 6. Make spare copy, keep in file drawer

Coding Master

Cleaning Data Set 1. Exercise in delay of gratification   1. Exercise in delay of gratification 2. Purpose: Reduce random error 3. Improve power of inferential stats.

Techniques for Cleaning Data 1. Print out data file and ???   2. Run “descriptives”: check ????? 3. Run “frequencies” 4. Run “cross tabs”: check if number ??? per condition looks OK 5. Correlations: Check if patterns of data move in expected direction 6. Multiple data entry: Locate and correct disparities in data entry.

Techniques for Cleaning Data 1. Print out data file and visually scan it.   2. Run “descriptives”: check means, min and max values, valid cases 3. Run “frequencies” 4. Run “cross tabs”: check if number of subs per condition looks OK 5. Correlations: Check if patterns of data move in expected direction 6. Multiple data entry: Locate and correct disparities in data entry.

Complete Data Set

Using Cross Tabs to Check for Missing or Erroneous Data Entry Case A: Expect equal cell sizes Gender   Oldest Youngest Only Child Males 10 20 Females 5 15 TOTAL 25 40 Case B: Impossible outcome Number of Siblings   Oldest Youngest Only Child None 4 3 6 One More than one 2 TOTAL 10 8

If No Errors Detected, chances are good that: Check a Sub Sample   1. Determine acceptable error rate Number of cases to randomly sample, by rate of acceptable error rate: Acceptable Error Rate Number of Cases to Randomly Review   If No Errors Detected, chances are good that: .50 5 50% or fewer errors .40 10 40% or fewer errors .20 25 20% or fewer errors .10 50 10% or fewer errors

Storing Data Raw Data Automated Data 2. File raw data by ID #   Raw Data 1. Hold raw data in secure place 2. File raw data by ID # 3. Hold raw date for at least X years post Y Automated Data 1. One pristine source, one working file, one syntax file 2. Back up, Back up, Back up

Storing Data Raw Data Automated Data 2. File raw data by ID #   Raw Data 1. Hold raw data in secure place 2. File raw data by ID # 3. Hold raw date for at least 5 years post publication Automated Data 1. One pristine source, one working file, one syntax file 2. Back up, Back up, Back up

Sort Raw Data Records 01-20 21-40 41-60 61-80 81-100 101-120  

COMMENT SYNTAX FILE GUN CONTROL STUDY SPRING 2007 COMMENT DATA MANAGEMENT IF (gender = 1 & party = 1) genparty = 1 . EXECUTE . IF (gender = 1 & party = 2) genparty = 2 . IF (gender = 2 & party = 1) genparty = 3 . IF (gender = 2 & party = 2) genparty = 4 . COMMENT ANALYSES UNIANOVA gunctrl BY gender party /METHOD = SSTYPE(3) /INTERCEPT = INCLUDE /PRINT = DESCRIPTIVE /CRITERIA = ALPHA(.05) /DESIGN = gender party gender*party . ONEWAY gunctrl BY genparty /CONTRAST= -1 -1 -1 3 /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05).

Cleaning Data Set 1. Exercise in delay of gratification   1. Exercise in delay of gratification 2. Purpose: Reduce random error 3. Improve power of inferential stats.