Statistical Analysis of Single Case Design Serial Dependence Is More than Needing Cheerios for Breakfast.

Slides:



Advertisements
Similar presentations
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Advertisements

Statistics : Role in Research. Statistics: A collection of procedures and processes to enable researchers in the unbiased pursuit of Knowledge Statistics.
Effect Size and Meta-Analysis
The Campbell Collaborationwww.campbellcollaboration.org Moderator analyses: Categorical models and Meta-regression Terri Pigott, C2 Methods Editor & co-Chair.
PTP 560 Research Methods Week 4 Thomas Ruediger, PT.
Heterogeneity in Hedges. Fixed Effects Borenstein et al., 2009, pp
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 14 Inferential Data Analysis
Nasih Jaber Ali Scientific and disciplined inquiry is an orderly process, involving: problem Recognition and identification of a topic to.
Single-Subject Designs
Relationships Among Variables
Chemometrics Method comparison
Inferential Statistics
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Overview of Meta-Analytic Data Analysis
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 – Multiple comparisons, non-normality, outliers Marshall.
Daniel Acuña Outline What is it? Statistical significance, sample size, hypothesis support and publication Evidence for publication bias: Due.
Single-Case Research: Standards for Design and Analysis Thomas R. Kratochwill University of Wisconsin-Madison.
Non-Overlap Methods in Single Case Research Methodology Erin E. Barton, PhD, BCBA-D.
Program Evaluation. Program evaluation Methodological techniques of the social sciences social policy public welfare administration.
Single-Case Research Designs: Training Protocols in Visual Analysis Wendy Machalicek University of Oregon Acknowledgement: Rob Horner Tom.
Chapter 12: Introduction to Analysis of Variance
Effect Sizes for Meta-analysis of Single-Subject Designs S. Natasha Beretvas University of Texas at Austin.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Agenda Review of Last week Learn about types of Research designs –How are they different from each other? From other things? Applying what you learned.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Statistical Applications for Meta-Analysis Robert M. Bernard Centre for the Study of Learning and Performance and CanKnow Concordia University December.
Correlational Research Chapter Fifteen Bring Schraw et al.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Current Methodological Issues in Single Case Research David Rindskopf, City University of New York Rob Horner, University of Oregon.
Education 795 Class Notes Data Analyst Pitfalls Difference Scores Effects Sizes Note set 12.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Effect Size Calculation for Meta-Analysis Robert M. Bernard Centre for the Study of Learning and Performance Concordia University February 24, 2010 February.
1 Copyright © 2011 by Saunders, an imprint of Elsevier Inc. Chapter 8 Clarifying Quantitative Research Designs.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Statistical Power The power of a test is the probability of detecting a difference or relationship if such a difference or relationship really exists.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Research Methods Ass. Professor, Community Medicine, Community Medicine Dept, College of Medicine.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
Chapter 10 The t Test for Two Independent Samples
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Handout Seven: Independent-Samples t Test Instructor: Dr. Amery Wu
Overlap Methods derived from Visual Analysis in Single Case Research Methodology In collaboration with Brian Reichow and Mark Wolery.
Single-Subject and Correlational Research Bring Schraw et al.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Meta-Analysis of Single-Case Designs
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
IES Single-Case Research Institute: Training Visual Analysis Rob Horner University of Oregon
Chapter 11 Meta-Analysis. Meta-analysis  Quantitative means of reanalyzing the results from a large number of research studies in an attempt to synthesize.
An Application of Multilevel Modelling to Meta-Analysis, and Comparison with Traditional Approaches Alison O’Mara & Herb Marsh Department of Education,
Chapter 12 Introduction to Analysis of Variance
Single-Case Effect Size and Meta-Analytic Measures
Analysis for Designs with Assignment of Both Clusters and Individuals
Experiments, Simulations Confidence Intervals
A Meta-Analysis of Video Modeling Interventions that Teach Employment Related Skills to Individuals with Autism Carol Sparber, M.Ed. Intervention Specialist.
Chapter Eight: Quantitative Methods
Gerald - P&R Chapter 7 (to 217) and TEXT Chapters 15 & 16
Review for Exam 2 Some important themes from Chapters 6-9
Randomization: A Missing Component of the Single-Case Research Methodological Standards Joel R. Levin University of Arizona Adapted from Kratochwill, T.
Correlated-Groups and Single-Subject Designs
Single-Case Intervention Research Training Institute
Non-Overlap Measures PND PEM ECL (PEM-T) NAP TauU TauUadj.
Chapter Nine: Using Statistics to Answer Questions
Single-Case Intervention Research Training Institute
Presentation transcript:

Statistical Analysis of Single Case Design Serial Dependence Is More than Needing Cheerios for Breakfast

Goal of Presentation Review concept of effect size Describe issues in using effect size concept for single case design Describe different traditional approaches to calculating effect size for single case design Illustrate one recent approach

Behavioral Intervention Research 3

Putting A Coat on Your Dog Before Going For a Walk

How We Have Gotten To This Meeting Long history of statistical analysis of SCD Criticism of quality of educational research (Shalvelson & Towne, 2002) Establishment of IES – Initial resistance to SCD Influence of professional groups IES willingness to fund research on statistical analysis

Concept of Effect Size of Study Effect size is a statistic quantifying the extent to which sample statistics diverge from the null hypotheses (Thompson, 2006)

Types of ESs for Group Design Glass Δ = (Me – Mc) / SDc Cohen’s d = (Me – Mc)/ Sdpooled Interpretation – Small =.20 – Medium =.50 – Large =.80 R 2 Eta 2 = SSeffect/SStotal

Statistical Analysis Antithetical To Single Case Design (SCD)? Original developers believed that socially important treatment effects have to be large enough to be reliably detected by visual inspection of the data.

Kazdin (2011) proposes Visual inspection less trustworthy when effects at not crystal clear Serial dependence may obscure visual analysis Detection of small effects may lead to understanding that could in turn lead to large effects Statistical analysis may generate ESs that allow one to answer more precise questions Effects for different types of individuals Experimenter effects

Example: PRT and Meta-Analysis (Shadish, 2012) Pivotal Response Training (PRT) for Childhood Autism 18 studies containing 91 SCD’s. For this example, to meet the assumptions of the method, the preliminary analysis: – Used only the 14 studies with at least 3 cases (66 SCDs). – Kept only the first baseline and PRT treatment phases, eliminating studies with no baseline After computing 14 effect sizes (one for each study), he used standard random effects meta- analytic methods to summarize results:

Results Distribution Description N Min ES Max ES Wghtd SD Fixed & Random Effects Model Mean ES -95%CI +95%CI SE Z P Fixed Random Random Effects Variance Component v = Homogeneity Analysis Q df p Random effects v estimated via noniterative method of moments. The results are of the order of magnitude that we commonly see in meta-analyses of between groups studies I 2 = 67.5%

Studies done at UCSB (=0) or elsewhere (=1) Analog ANOVA table (Homogeneity Q) Q df p Between Within Total Q by Group Group Qw df p Effect Size Results Total Mean ES SE -95%CI +95%CI Z P k Total Effect Size Results by Group Group Mean ES SE -95%CI +95%CI Z P k Maximum Likelihood Random Effects Variance Component v = se(v) = Of course, we have no idea why studies done at UCSB produce larger effects: different kinds of patients? different kinds of outcomes? But the analysis does illustrate one way to explore heterogeneity

Search for the Holy Grail of Effect Size Estimators No single approach agreed upon: (40+ have been identified, Swaminathan et al., 2008) Classes of approaches – Computational approaches – Randomization test – Regression approaches – Tau-U (Parker et al., 2011) as combined approach

Computational Approaches Percentage of Nonoverlapping Datapoints (PND) (Scruggs, Mastropieri, & Casto, 1987) Percentage of Zero Data (Campbell, 2004) Improvement Rate Difference (Parker, Vannest, & Brown, 2009)

ABAB 6 Level of Experimental Control No Exp Control Publishable Strong Exp Control

ABAB 7 Level of Experimental Control No Exp Control Publishable Strong Exp Control Evaluate for LEVEL Evaluate for TREND Evaluate for LEVEL Evaluate for TREND

Problem with phases

Randomization Test Edgington (1975, 1980) advocated strongly for use of nonparametric randomization tests. – Involves selection of comparison points in the baseline and treatment conditions – Requires random start day for participants (could be random assignment of participants in MB design, Wampold & Worsham, 1986) Criticized for SDC – Large Type I Error rate (Haardofer & Gagne, 2010) – Not robust to independence assumption and sensitivity low for data series < 30 to 40 datapoints (Manolov & Solanas, 2009)

ABAB 7 Level of Experimental Control No Exp Control Publishable Strong Exp Control Evaluate for LEVEL Evaluate for TREND Evaluate for LEVEL Evaluate for TREND

Regression (Least Squares Approaches) ITSACORR (Crosbie, 1993) – Interrupted time series analysis – Criticized for not being correlated with other methods White, Rusch, Kazdin, & Hartmann (1989) Last day of Treatment Comparison (LDT) – Compares two LDT for baseline and treatment – Power weak because of lengthy predictions

Regression Analyses Mean shift and mean-plus-trend model (Center, Skiba, & Casey; ) Ordinary least squares regression analysis (Allison & Gorman, 1993) – Both approaches attempt to control for trends in baseline when examining the performances in treatment d-Estimator (Shadish, Hedges, Rinscoff, 2012) GLS with removal of autocorrelaiton (Swaminathan, Horner, Rogers, & Sugai, 2012)

Tau-U (Parker, Vannest, Javis, & Sauber, 2011) Mann-Whitney U a nonparametric that compares individual data point in groups (AB comparisons) Kendal’s Tau does these same thing for trend within groups Tau-U – Tests and control for trend in A phase – Test for differences in A and B phases – Test and adjust for tend in B phase

ABAB 7 Level of Experimental Control No Exp Control Publishable Strong Exp Control Evaluate for LEVEL Evaluate for TREND Evaluate for LEVEL Evaluate for TREND

Tau-U Calculator Vannest, K.J., Parker, R.I., & Gonen, O. (2011). Single Case Research: web based calculators for SCR analysis. (Version 1.0) [Web-based application]. College Station, TX: Texas A&M University. Retrieved Sunday 15th July Combines nonoverlap between phases with trend from within intervention phases – Will detect and allow researcher to control for undesirable trend in baseline phase Data are easily entered on free website Generates a d for effects with trend withdrawn when necessary

Themes: An accessible and feasible effect size estimator As end users, SCD researchers need a tool that we can use without having to consult our statisticians – Utility of the hand calculated trend line analysis – Example of feasible tool, but criticized (ITSACORR< Crosbie, 1993). Parker, Vannest, Davis, & Sauber (2011) – Tau-U calculator

Theme: What is an effect—a d that detect treatment or/and level effect If a single effect size is going to be generated for an AB comparison: should the d be reported separately for level (intercept) or trend (slope)? – If so, problematic for meta-analysis ES estimators here appear to provide a combined effect for slope and intercept Parker et al. (2011) incorporate both

Theme: What comparisons get included in the meta-analysis Should we only use the initial AB comparison in ABAB Designs?

Theme: What comparisons get included in the meta-analysis Should we only include points at which functional relationship is established?

Theme: How many effects sizes per study? 30 Study Study 1 Study 2 … Study K Subject Subj 1 Subj 2 Subj 1 Subj 1 Subj 2 Subj 3 Subj4 Moments m mm m m m m m m mm mm m

Challenge: How do you handle zero baselines or treatment phases?

Heterogeneity In SCD: A reality SCD Researchers used a range of different designs and design combinations A look at current designs – Fall, 2011 Issue of Journal of Applied Behavior Analysis

Comparison of SDC and Group Design ESs: The Apples and Oranges Issues Logic of casual inferences different – Groups: Means differences between groups – SCD: Replication of effects either within or across all “participants” – Generally d represents different comparison Data collected to document an effect different – Group designs collect data before treatment and after treatment – SCDs collect data throughout treatment phase, so for treatments that build performance across time, they may appear less efficacious because they are including “acquisition” phase effects in analysis

ABAB 4 Level of Experimental Control No Exp Control Publishable Strong Exp Control

Conclusions I learned a lot Sophistication of analyses is increasing Feasibility of using statistical analysis is improving Can use statistical analysis as supplement to visual inspection (Kazdin, 2011) Statistical analysis may not be for everybody, but it is going to foster acceptability in the larger education research community, and for that reason SCD researchers should consider it.