Coding Manual and Process

Slides:



Advertisements
Similar presentations
Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
Advertisements

Chapter 4 – Reliability Observed Scores and True Scores Error
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Microsoft Office Word 2013 Expert Microsoft Office Word 2013 Expert Courseware # 3251 Lesson 4: Working with Forms.
Introduction to Multilevel Modeling Using SPSS
(5) Moderators - Coding. Overview General Information to keep in mind:  Coded variables - Objective – which are usually study characteristics that can.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Processing the Data.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inference for regression - Simple linear regression
INT 506/706: Total Quality Management Introduction to Design of Experiments.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Classroom Assessment A Practical Guide for Educators by Craig A
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. The equation that describes how the dependent variable y is related to the independent variables.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Quantifying Data Advanced Social Research (soci5013)
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Dependencies Complex Data in Meta-analysis. Common Dependencies Independent subgroups within a study (nested in lab?) Multiple outcomes on the same people.
CHAPTER 11 SECTION 2 Inference for Relationships.
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
Chapter 13 Multiple Regression
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
RESEARCH METHODS Lecture 29. DATA ANALYSIS Data Analysis Data processing and analysis is part of research design – decisions already made. During analysis.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Calculating Inter-coder Reliability
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analysis of Experiments
Data Management Seminar, 8-11th July 2008, Hamburg WinDEM- Verification Checks Part I.
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
Chapter 13 Understanding research results: statistical inference.
Coding Preparing The Research for Data Entry. Coding (defined) Coding is the process of converting questionnaire responses into a form that a computer.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Data Entry, Coding & Cleaning SPSS Training Thomas Joshua, MS July, 2008.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Stats Methods at IC Lecture 3: Regression.
Chapter 8 Introducing Inferential Statistics.
Data Sciences Unit, School of Psychology, Deakin University
Selecting the Best Measure for Your Study
Classroom Assessment A Practical Guide for Educators by Craig A
Constructing Propensity score weighted and matched Samples Stacey L
Meta-analysis: Conceptual and Methodological Introduction
The Question The first step is deciding on the question to be asked:
CHAPTER 2 Research Methods in Industrial/Organizational Psychology
Random Effects & Repeated Measures
Manahel Alafaleq and Lianghuo Fan
Effect size measures for single-case designs: General considerations
Program Design Introduction to Computer Programming By:
Chi square.
Chapter 11: Inference for Distributions of Categorical Data
Main Effects and Interaction Effects
I. Statistical Tests: Why do we use them? What do they involve?
Psych 231: Research Methods in Psychology
PA430 - Data coding March 7/8, 2000.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Chi square.
Indicator 3.05 Interpret marketing information to test hypotheses and/or to resolve issues.
Presentation transcript:

Coding Manual and Process

Illustration, and Introduction of metafor Teacher expectancy data (Raudenbush and Bryk) Reading data into R Acquiring the metafor package Simple analyses using metafor What is metafor actually doing?

The Coding Manual Begin by keeping a list of possible covariates you might be interested in But eventually narrow because meta-analytic sample sizes tend to be small, with insufficient degrees of freedom to support too much data. Begin preparing a draft coding manual Consider a web-based coding manual Saves confusing multiple electronic files that coders give you periodically Allows coders to work at home Automatically saves and forwards data to you.

Components of Coding Manual The goal is codes that are useful in describing and predicting effect sizes. An ID system (to be discussed shortly) Substantive codes, e.g., Treatment subtype Methodological Codes, e.g., Kind of comparison group (TT v TC) Effect size codes (effect size and sample size per group) How to deal with missing data Plausible Guesses Distinguishing “No” from “Unknown” Code missing data as NA for analysis in R Partial example next page, full example CROPS

Structure of Coding Manual In meta-analysis, the effect size is the basic unit being described The structure of the manual depends on whether your studies are group comparisons or correlational.

Typical Structure for Group Comparisons ID Variables that are constant in a study e.g., publication status, year published, methods like use of random assignment Variables associated with Group 1 E.g., characteristics of participants, of treatment, of treatment providers Variables associated with Group 2 All of Group 1 codes, plus an indicator for whether the comparison is Treatment-Treatment or Treatment-Control Variables associated with the outcome measure E.g., self-report or not, training of observers, masking, construct being measured Variables associated with effect size E.g., method of computation, sample size.

Typical Structure for Correlational Studies ID Variables that are constant in a study Variables associated with Variable 1 E.g., what is it measuring, how measured, who measured, etc. Variables associated with Variable 2 Variables associated with effect size

An ID System I keep ID bibliography in Microsoft Word Purpose: to locate particular places in the data set To correct errors (e.g., found when cleaning data) To recode (e.g., to change a particular study) Components: Study ID number Comparison ID number (effect sizes are always on comparisons) Dependent Variable ID number Any other information you want to record To record reasons for excluding studies initially used. Sample ID system in Handouts (Part on next slide)

Training of Coders Crucial to train coders, especially If large numbers of studies (so you can’t do it all yourself) If inferences are complex (e.g., should alternating assignment be coded as random?) If coders are undergraduates (or at that level) Random spot checks on all codes by PI

The Coding Process Do effect size coding separate from all other coding Inter-Rater Reliability (IRR) Decide whether to code some or all studies more than once (advisable if you are not the coder). Some or all studies? If some, at least 10-20? Can use third coder (PI?) to resolve disagreements among the first two coders. Report IRR analysis (kappa; see handout)

The Hierarchical Structure of Meta-Analytic Data In between-group treatment outcome studies, Some data are constant over the whole study (e.g., year of publication, publication status), and so only needs to be coded once Some data vary for each treatment condition (e.g., length of treatment, dosage). Some data vary for each outcome measure (e.g., self- vs other report, reliability) Some data vary for each effect size (e.g., n per group, computation method), and so needs to be coded for every effect size So:

Square vs Hierarchical Data Sets The effect size is the basic unit (each line) Square Data Sets What we commonly think of as a data set Every variable coded for each effect size even if redundant Efficient if number of studies (k) or effect sizes is small. Hierarchical Data Sets Code study level variables once for each study Code treatment variables once for each treatment Code outcome variables once for each outcome Code effect size variables once for each effect size Then use “match files” to duplicate all the missing codes from what would have been the square data set Most efficient when k or number of effect sizes is large, but more complex to do.

Data Entry Depends on square or hierarchical data sets If square, one file If hierarchical, one file for each “level” Option #1: First do paper codes, then enter into computer More room for transcription errors Option #2: Direct data entry into, say, SPSS Data Editor, or an Excel Worksheet with a manual as a guide. Or directly into a web-based manual.

Example of Excel Coding Sheet