Analyzing Intervention Fidelity and Achieved Relative Strength David S. Cordray Vanderbilt University NCER/IES RCT Training Institute,2010.

Slides:



Advertisements
Similar presentations
Richard M. Jacobs, OSA, Ph.D.
Advertisements

Structural Equation Modeling
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Modeling “The Cause”: Assessing Implementation Fidelity and Achieved Relative Strength in RCTs David S. Cordray Vanderbilt University IES/NCER Summer Research.
Effect Size Overheads1 The Effect Size The effect size (ES) makes meta-analysis possible. The ES encodes the selected research findings on a numeric scale.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Concept of Measurement
ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Clustered or Multilevel Data
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Practical Meta-Analysis -- D. B. Wilson
Today Concepts underlying inferential statistics
Correlational Designs
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Regression and Correlation Methods Judy Zhong Ph.D.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inference for regression - Simple linear regression
Correlation.
Chapter 15 Correlation and Regression
Topic 6.1 Statistical Analysis. Lesson 1: Mean and Range.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Achieved Relative Intervention Strength: Models and Methods Chris S. Hulleman David S. Cordray Presentation for the SREE Research Conference Washington,
Progressing Toward a Shared Set of Methods and Standards for Developing and Using Measures of Implementation Fidelity Discussant Comments Prepared by Carol.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Conceptualizing Intervention Fidelity: Implications for Measurement, Design, and Analysis Implementation Research Methods Meeting September 20-21, 2010.
Conceptualizing Intervention Fidelity: Implications for Measurement, Design, and Analysis Implementation: What to Consider At Different Stages in the Research.
Assessing Intervention Fidelity in RCTs: Models, Methods and Modes of Analysis David S. Cordray & Chris Hulleman Vanderbilt University Presentation for.
Testing Hypotheses about Differences among Several Means.
Assumptions of value-added models for estimating school effects sean f reardon stephen w raudenbush april, 2008.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 12 Correlational Designs.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Background Treatment fidelity in group based parent training: Predicting change in parent and child behaviour Dr. Catrin Eames, Bangor University, UK
Review of Research Methods. Overview of the Research Process I. Develop a research question II. Develop a hypothesis III. Choose a research design IV.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
Lecture 10: Correlation and Regression Model.
Empirically Based Characteristics of Effect Sizes used in ANOVA J. Jackson Barnette, PhD Community and Behavioral Health College of Public Health University.
© (2015, 2012, 2008) by Pearson Education, Inc. All Rights Reserved Chapter 11: Correlational Designs Educational Research: Planning, Conducting, and Evaluating.
Limit collection of categorical data Age – – – – – & Above Income ,000 10,001 – 25,000 25,001 – 35,000.
T tests comparing two means t tests comparing two means.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Single-Subject and Correlational Research Bring Schraw et al.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Developing an evaluation of professional development Webinar #2: Going deeper into planning the design 1.
Progressing Toward a Shared Set of Methods and Standards for Developing and Using Measures of Implementation Fidelity Symposium Chair: Chris S. Hulleman,
Designing An Adaptive Treatment Susan A. Murphy Univ. of Michigan Joint with Linda Collins & Karen Bierman Pennsylvania State Univ.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Effectiveness of Selected Supplemental Reading Comprehension Interventions: Impacts on a First Cohort of Fifth-Grade Students June 8, 2009 IES Annual Research.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
The Cause…or the “What” of What Works? David S. Cordray Vanderbilt University IES Research Conference Washington, DC June 16, 2006.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Statistical analysis.
Multiple Imputation using SOLAS for Missing Data Analysis
Statistical analysis.
12 Inferential Analysis.
Making Sense of Advanced Statistical Procedures in Research Articles
Chapter Eight: Quantitative Methods
12 Inferential Analysis.
Considering Fidelity as an Element Within Scale Up Initiatives Validation of a Multi-Phase Scale Up Design for a Knowledge-Based Intervention in Science.
Analyzing Intervention Fidelity and Achieved Relative Strength
Presentation transcript:

Analyzing Intervention Fidelity and Achieved Relative Strength David S. Cordray Vanderbilt University NCER/IES RCT Training Institute,2010

Perspectives on Interventionion Fidelity: When is Analysis Needed? Monitoring implementation, retraining, maximize fidelity to the intervention model. Program-specific models (within Tx, only) Interpreting results from ITT analyses; –Cause-Effect congruity; –Lack of Cause-Effect congruity. Analysis of variation in delivery/receipt of intervention components. Compliance-based analyses (TOT, LATE) Mediation – instrumental variables approach

Program Specific Models Analyses conducted within the intervention, only R  A  Outputs  Outcomes No variance, no covariance Component-wise assessment of fidelity Configural judgment about implementation fidelity

Achieved Relative Strength =.15 Infidelity “Infidelity” t C t tx T TX’ T Tx TCTC Treatment Strength Outcome Intervention Exposure True Fidelity Tx Contamination Augmentation of C Intervention Exposure Positive Infidelity Intervention Differentiation Review: Concepts and Definitions Variability

Analysis Type I Congruity of Cause-Effect

Common Cause-Effect Scenarios The Cause The Effect Low High Low Low/Low = Cause-Effect Congruity Low/High = ???? HighHigh/Low = Dampening Process ???? High/High = Cause-Effect Congruity

Cause-Effect Congruity: High/High Example Fantuzzo, King & Heller (1992) studied the effects of reciprocal peer tutoring on mathematics and school adjustment. –2 X 2 factorial design crossing levels of structured peer tutoring and group reward –45 min. 2-3 per week; sessions Fidelity assessments: –Observations (via checklist) of students and staff, rated the adherence of group members to scripted features of each condition; 50% random checks of sessions –Mid-year, knowledge tests to index the level of understanding of students about the intervention components in each of the four conditions.

Fantuzzo et al. Continued Fidelity results: –Adherence (via observations): % across conditions, 95% overall –Student understanding (via 15 item test): 82% SD=11% (range %); ANOVA=ns Reward+structure condition: 84% Control: 86% Effects on mathematics computation: ES= ( )/1.71 = 1.58 Congruity=High/High; no additional analyses needed

Exposure and Achieved Relative Strength Fantuzzo et al. example is: –Relatively rare; –Incorporates intervention differentiation, yielding fidelity indices for all conditions. More commonly, intervention exposure is assessed: –Yielding scales of the degree to which individuals experience the intervention components in both conditions –The achieved relative strength index is used for establishing the differences between conditions on causal components

Average ARS Index Where, = mean for group 1 (t Tx ) = mean for group 2 (t C ) S T = pooled within groups standard deviation n Tx = treatment sample size n C = control sample size n = average cluster size p = Intra-class correlation (ICC) N = total sample size Group DifferenceSample Size Adjustment Clustering Adjustment

A Partial Example of the Meaning of ARSI Randomized Group Assignment Professional Development Differentiated Instruction Improved Student Outcomes

Hours of Professional Development Mean= 28.2 SD=7.04 Mean=61.8 SD=6.14 Control Intervention ARSI: = ( )/6.61 =5.08 U3= 99% Very Large Group Difference, Limited Overlap Between Conditions

Cohen’s U3 Index: Very Large Group Separation Control Mean 50 th percentile Intervention Mean U3=99 th Percentile ARSI=5.08

Hours of Professional Development Mean= 28.2 SD=7.04 Mean=30.8 SD=6.14 Control Intervention ARSI: = ( )/6.61 =0.39 U3= 66% Small Group Differences, Substantial Overlap

Main points…. Analysis of intervention fidelity and achieve relative strength is a natural counterpart to estimating ESs in ITT studies. They provide an interpretive framework for explaining outcome effects. When ES and ARSI are discordant, serve as the basis for additional analysis. Next section focuses on analysis of variation

Analysis II Linking Variation in Treatment Receipt/Delivery to Outcomes

Variation in Treatment Receipt/Delivery Within Groups: Regression Models Using Fidelity/ARS Indicators Rather than relying on the 0,1 coding of groups, fidelity indicators replace the group variable. New question being answered. Value of fidelity indices will depend on their strength of the relationship with the outcome; The greater the group difference, on average, the less informative fidelity indicators will be; and High predictability requires reliable indices

Using Group, Fidelity Indicators, or Both: A Simple Example Randomized Group Assignment Fidelity Indicator= Hours of Professional Development Outcome= Differentiated Instruction Improved Student Outcomes

The “Value Added” of Implementation Fidelity/ARS Data Group Separation U3Predicting Level of Differentiated Instruction R 2 Group R 2 Hours Pro Development Small * (0.28) Large *0.437* (0.22) Very Large *0.549* (0.15)

Intent-to-treat (ITT) vs. Fidelity/ARS Index: An Example Justice, Mashburn, Pence, & Wiggins (2008) examined: –Language-Focused Curriculum (LFC) in 14 classes; –Classes randomly assigned to LFC and control; –Core component of LFC is the use of language stimulation techniques (e.g., open questions, recasts, models); and –Outcome  Growth in expressive language examined (fall to spring)

Justice et al. Continued Implementation fidelity assessed: –3 times using 2 hour observation (45 item check list) 50 min. video sample; and 40 weekly lesson plans. Fidelity score = –weighted sum of frequency of the use of 7 language stimulation techniques (range 0-21); Fidelity = score/21; averaged over observations Results: –LST teachers average fidelity = 0.57 (range ) –Control teachers average fidelity = 0.32 (range ) –ANOVA F=11.83, p =.005; d = ARSI = 1.71

Justice et al. Continued Level 1 Level 2  ITT Level 2  TOT 0,1 Group Fidelity Score Both coefficients were not significant and near zero.

Justice, et al. (2008): Some Important Issues Few teachers exhibited high levels of LST use (core component of LFC) Fidelity overall = 0.45 Justice et al. argue, the large group difference (ARSI=1.71 for fidelity = 0.57 vs. 0.32) may not have been sufficient because the dosage (0.57) was so far below what is needed to affect language development. Other possibilities include: –Reliability of the scaling? –Use of average when trend in observations showed improvement? –Coverage of central constructs? –Functional form of fidelity-outcome linkage?

Analysis III: Compliance-based Models

ITT and Treatment on the Treated (TOT) Average Treatment Effect (ATE) estimate assume full compliance. Intent to treat (ITT) estimate ignores compliance. –Common to have “no-shows” (for whatever reasons). What is the effect of treatment on those receiving treatment (TOT) – the “shows”? Bloom 1984 – proposed an adjustment to the ITT estimate, applies to non-compliance of “no- shows” only: ITT/p T Where P T is the percent Tx recipients

TOT continued Assumes treatment receipt/delivery can be meaningfully dichotomized and there is experimentally induced receipt or non-receipt of treatment. Works well when no shows experience no treatment effect, Assumes randomization does not influence the outcome unless an individual receives the intended treatment. This is plausible when treatments are voluntary, substantial, and without a close substitute. In mandatory treatments assumption is less plausible, randomization can affect behavior and outcomes,

Local Average Treatment Effect (LATE) LATE (Local Average Treatment Effect): – adjust ITT estimate by T and C treatment receipt rates (compliers in T and defiers in C) –ITT is adjusted as: ITT/(p c -p d ) LATE can approximate causal estimates, if a series of assumptions hold. Model can be expressed as Z  D  Y (or random assignment  treatment receipt  outcome)

Some Key LATE Assumptions Defiers do not exist or the average program effect is the same for compliers and defiers (monotonicity assumption). The effect of Z (assignment) on Y (outcome) is thru variable D (Tx receipt) or the exclusion restriction (all other paths between Z and Y are excluded). Strong covariation between Z and D Randomization makes the instrument and predicted D uncorrelated with error (e). SUTVA (stable unit treatment value assumption)

Analysis Type 4: Mediation Model

Mediation Models Random Assignment (Z) Treatment Receipt (D) Outcome (Y) This is an extension of LATE and is assumption laden

Key Points and Issues Fidelity assessment serves two roles: –Average causal difference between conditions; and –Using fidelity measures to assess the effects of variation in implementation on outcomes. Degree of fidelity and Achieved Relative Strength provide fuller picture of the results Modeling fidelity depends on: –Level of implementation fidelity –Outcome variance –Variability in implementation fidelity/ARS l Most applications, fidelity is just another Level 2 or 3 variable. Uncertainty and the need for alternative specifications: –Measure of fidelity –Index of achieved relative strength –Fidelity-outcome model specification (linear, non-linear) Adaptation-fidelity tension

Questions and Discussion