General Overview and History of Statistical Analysis of Single-Case Intervention Data Joel R. Levin University of Arizona.

Slides:



Advertisements
Similar presentations
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
+ Evidence Based Practice University of Utah Presented by Will Backner December 2009 Training School Psychologists to be Experts in Evidence Based Practices.
PTP 560 Research Methods Week 4 Thomas Ruediger, PT.
Chapter Seventeen HYPOTHESIS TESTING
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 11 Experimental and Quasi-experimental.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
BCOR 1020 Business Statistics
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.
Today Concepts underlying inferential statistics
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
L1 Chapter 11 Experimental and Quasi- experimental Designs Dr. Bill Bauer.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Experimental Design The Gold Standard?.
Xitao Fan, Ph.D. Chair Professor & Dean Faculty of Education University of Macau Designing Monte Carlo Simulation Studies.
AM Recitation 2/10/11.
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.
Single-Case Research: Standards for Design and Analysis Thomas R. Kratochwill University of Wisconsin-Madison.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 11 Experimental Designs.
Statistical Randomization Techniques for Single-Case Intervention Data Statistical Randomization Techniques for Single-Case Intervention Data Joel R. Levin.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Randomization: A Missing Component of the Single-Case Research Methodological Standards Adapted from Kratochwill, T. R., & Levin, J. R. (2010). Enhancing.
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Classroom-Based Applications of Single-Case Designs: Methodological and Statistical Issues Joel R. Levin University of Arizona.
Current Methodological Issues in Single Case Research David Rindskopf, City University of New York Rob Horner, University of Oregon.
Statistical Models for the Analysis of Single-Case Intervention Data Introduction to:  Regression Models  Multilevel Models.
Random Thoughts On Enhancing the Scientific Credibility of Single-Case Intervention Research: Randomization to the Rescue Thomas R. Kratochwill and Joel.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Academic Research Academic Research Dr Kishor Bhanushali M
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Question paper 1997.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Randomized Single-Case Intervention Designs Joel R
Statistical Randomization Techniques for Single-Case Intervention Data Statistical Randomization Techniques for Single-Case Intervention Data Joel R. Levin.
Single-Subject and Correlational Research Bring Schraw et al.
Brief Comments on Single-Case Effect Sizes and Meta-Analyses* Joel R. Levin University of Arizona *Based, in part, on Kratochwill, T. R., & Levin, J. R.
Why Consider Single-Case Design for Intervention Research: Reasons and Rationales Tom Kratochwill February 12, 2016 Wisconsin Center for Education.
Chapter 13 Understanding research results: statistical inference.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistics & Evidence-Based Practice
Inference about the slope parameter and correlation
DAY 2 Visual Analysis of Single-Case Intervention Data Tom Kratochwill
Single-Case Effect Size and Meta-Analytic Measures
Understanding Results
Statistical Models for the Analysis of Single-Case Intervention Data
Joel R. Levin University of Arizona
Statistical Randomization Techniques for Single-Case Intervention Data and Introduction to ExPRT Statistical Software Joel R. Levin University of.
Masked Visual Analysis (MVA)
Statistical Models for the Analysis of Single-Case Intervention Data
Randomization: A Missing Component of the Single-Case Research Methodological Standards Joel R. Levin University of Arizona Adapted from Kratochwill, T.
Single-Case Intervention Research Training Institute
Understanding Statistical Inferences
Inferential Statistics
Masked Visual Analysis (MVA)
Single-Case Intervention Research Training Institute
Statistical Models for the Analysis of Single-Case Intervention Data
Presentation transcript:

General Overview and History of Statistical Analysis of Single-Case Intervention Data Joel R. Levin University of Arizona

Historical Overview Three Considerations (after Levin, Marascuilo, & Hubert, 1978) 1. Why an investigator would want, or feel the need, to conduct a formal statistical analysis scholarly research vs. client-centered focus investigator’s purpose and desired inferences generalizations from a single-case study Levin, J. R., Marascuilo, L. A., & Hubert, L. J. (1978). N = nonparametric randomization tests. In T. R. Kratochwill (Ed.), Single subject research: Strategies for evaluating change (pp ). New York: Academic Press.

Historical Overview 2. Whether any statistical analysis should be conducted predefined outcome criteria visual analysis options validity issues – external: generalization, sampling – internal: design features, experimental control 3. Which statistical procedure, of many potentially available, is the most appropriate one to use for the particular study/design? distributional assumptions statistical properties the most important criteria to be satisfied Levin, J. R., Marascuilo, L. A., & Hubert, L. J. (1978). N = nonparametric randomization tests. In T. R. Kratochwill (Ed.), Single subject research: Strategies for evaluating change (pp ). New York: Academic Press.

The Statistical Elephant in the Room: The Independence Assumption Of tea tastes (Fisher, 1935) and t tests From Shine (Shine & Bower, 1971) to Gentile, Roden, & Klein (1972) and beyond (e.g., Kratochwill et al., 1974) As reflected by the series autocorrelation (AC) coefficient, or the degree of serial dependence in the data Fisher, R. A. (1935). The design of experiments. New York: Hafner. Gentile, J. R., Roden, A. H., & Klein, R. D. (1972). An analysis of variance model for the intrasubject replication design. Journal of Applied BehaviorAnalysis, 5, Kratochwill, T. R., Alden, K., Demuth, D., Dawson, 0., Panicucci, C., Arntson, P., McMurray, N., Hempstead, J., & Levin, J. (1974). A further consideration in the application of an analysis-of-variance model for the intrasubject replication design. Journal of Applied Behavior Analysis, 7, Shine, L. C., & Bower, S. M. (1971). A one-way analysis of variance for single-subject designs. Educational and Psychological Measurement, 31,

From Levin, J. R., Lall, V. F., & Kratochwill, T. R. (2011). Extensions of a versatile randomization test for assessing single-case intervention effects. Journal of School Psychology, 49,

The Statistical Elephant in the Room: The Independence Assumption Empirical evidence, based on theory (e.g., Crosbie, 1987; Toothaker et al., 1983) But what exactly did these researchers find? And what does that have to do with the price of t in time-series intervention studies? Let’s take a look, in our own mini “Monte Carlo simulation study…

Autocorrelation Implications for an AB Intervention Study: A Simulation Example (With Special Thanks to John Ferron) 20 simulation “studies” were conducted with 30 outcome observations apiece. Each observation was drawn from a normal distribution with a mean of 0 and a standard deviation of 1 (i.e., normally distributed z scores). Each “study” of 30 observations was split into two halves, with the first 15 representing a baseline (A) phase and the second 15 representing a B (intervention) phase.

A Little Food for Thought In each of the 20 AB “studies,” the mean of the 15 A-phase observations and the mean of the 15 B- phase observations are calculated and statistically compared on the basis of a conventional two independent-sample t test based on a Type I error probability (α) per study of.10. Given the information presented so far, what results would you expect in the 20 “studies”?

A Little More Food to Chew On OK, now consider this additional information: In 10 of the “studies,” the 30 observations were generated assuming that they were mutually independent (i.e., as reflected by an autcorrelation of 0). In the other 10 “studies,” the 30 observations were generated assuming that they were serially dependent (as reflected by an autcorrelation of.40). What would you expect of the results now? Or would you prefer to phone a personal friend who is also a professional statistician?

A simulation with ρ = 0: r = -.018, p = ? A Phase Mean = -.27 (47.3) B Phase Mean = -.26 (47.4) (Note: 20 changes of direction in the series)

A simulation with ρ =.40: r =.575, p = ?

In Addition For the 10 AC = 0 simulations, with a nominal α of.10, ? of them resulted in rejecting the “no phase-mean difference” hypothesis ‒ thereby yielding an empirical α of ? whereas For the 10 AC =.40 simulations, with a nominal α of.10, ? of them resulted in rejecting the “no phase-mean difference” hypothesis ‒ thereby yielding an empirical α of ?

The Statistical Elephant in the Room: The Independence Assumption But what about in practice? –point (Busk & Marascuilo, 1988; Sharpley & Alavosius, 1988) –counterpoint (Huitema, 1985, 1988; Shadish & Sullivan, in press) –what’s the point? (Baer, 1988; Parsonson & Baer, 1992) Other autocorrelation issues –how best to estimate it? (e.g., Crosbie, 1993; Hedges, Pustejovsky, & Shadish, 2012; Riviello & Beretvas, 2011)

Some Autocorrelation References Baer, D. M. (1988). An autocorrelated commentary on the need for a different debate. Behavioral Assessment, 10, Busk, P. L., & Marascuilo, L. A. (1988). Autocorrelation in single-subject research: A counterargument to the myth of no autocorrelation. Behavioral Assessment, 10, Crosbie, J. (1987). The inability of the binomial test to control Type I error with single-subject data. Behavioral Assessment, 9, Crosbie, J. (1993). Interrupted time-series analysis with brief single-subject data. Journal of Consulting and Clinical Psychology, 61, Huitema, B. E. (1985). Autocorrelation in applied behavior analysis: A myth. Behavioral Assessment, 7, Huitema, B. E. (1988). Autocorrelation: 10 years of confusion. Behavioral Assessment, 10, Parsonson, B. S., & Baer, D. M. (1992). The visual analysis of data, and current research into the stimuli controlling it. In T. R. Kratochwill & J. R. Levin (Eds.), Single-case research design and analysis: New developments for psychology and education (pp ). Hillsdale, NJ: Eribaum. Riviello, C., & Beretvas, S. N. (2011). Detecting lag-one autocorrelation in interrupted time series experiments with small datasets. Unpublished manuscript, University of Texas, Austin. Shadish, W. R., Rindskopf, D. M., Hedges, L. V., & Sullivan, K. J. (2012). Bayesian estimates of autocorrelations in single-case designs. Behavior Research Methods, 4(4). DOI /s Shadish, W. R., & Sullivan, K. J. (in press). Characteristics of single-case designs used to assess intervention effects in Behavior Research Methods. Sharpley, C. F., & Alavosius, M. P. (1988). Autocorrelation in behavioral data: An alternative perspective. Behavioral Assessment, 10, Toothaker, L., E., Banz, M., Noble, C., Camp, J., & Davis, 0. (1983). N = 1 designs: The failure of ANOVA-based tests. Journal of Educational Statistics, 8,

So Who Cares About Elephants Anyway? Reflect back on our three initiating questions Why? –The hope for improved data-analysis objectivity and reliability, as well as more “valid” conclusions about intervention effects? –But always keep in mind the IOTT! Whether? In light of the investigator’s purposes, is statistical analysis really necessary?

So Who Cares About Elephants Anyway? Which? –Statistical-conclusion validity issues Acceptable Type I error control Acceptable statistical power The potential to yield informative confidence intervals and effect-size measures

Proposed Statistical Analysis Strategies Standard parametric statistical methods (e.g., t test, analysis of variance, conventional regression analysis, binomial test) –problems problems everywhere… Time-series analysis (e.g., ARIMA models) –concepts and basics (McCleary & Welsh, 1992) –problems, and problems within problems (Crosbie, 1993) –current limitations

The Elephant Plods Along to This Day: Some Published Examples Sometimes masquerading in esoteric international costumes: Lancioni et al.’s (2009) two-sample (A-phase vs. B-phase observations) Kolmogorov-Smirnov test Lancioni, G. E. et al. (2009). A learning assessment procedure to re-evaluate three persons with a diagnosis of post-coma vegetative state and pervasive motor impairment. Brain Injury, 23,

The Elephant Plods Along to This Day: Some Published Examples Take a gander at this: Wadnerkar et al.’s (2011) chi square “goodness of fit” test applied to a single case’s pre- and post- intervention categorized (and correlated) eye- gaze frequency counts – complete with computational errors. Wadnerkar, M. B., et al. (2011). A single case study of a family- centred intervention with a young girl with cerebral palsy who is a multimodal communicator. Child: Care, Health, and Development, 38,

Figure 2. Frequency of codes for girl’s multimodal communicative behaviours at pre and post intervention. AAC, augmentative and alternative communication.

The Elephant Plods Along to This Day: Some Published Examples How about an impressive combination of “nonparametric” techniques? Bragard et al.’s (2012) McNemar test of proportions and Wilcoxon test applied to 24 individual pre- and posttest items on a picture- naming task. Bragard, A. et al. (2012). Word-finding intervention for children with specific language impairment: A multiple single-case study. Language, Speech, and Hearing Services in the Schools, 43,

Proposed Statistical Analysis Strategies Adapted regression-based, HLM, and GAM models (e.g., Beretvas & Chung, 2008; Maggin, Swaminathan, Rogers, O’Keeffe, Sugai, & Horner, 2011; Rindskopf & Ferron, 2014; Shadish et al., 2014) –potential advantages, but wait and see –current limitations Nonparametric (permutation-based) analyses (e.g., Edgington & Onghena, 2007; Ferron & Levin, 2014) –potential advantages –current limitations Editorial comment: Tradeoffs between statistical analysis elegance/complexity and parsimony/comprehensibility

Proposed Statistical Analysis Strategies: References Beretvas, S. N., & Chung, H. (2008). An evaluation of modified R 2 -change effect size indices for single-subject experimental designs. Evidence-Based Communication Assessment and Intervention, 2:3, Edgington, E. S., & Onghena, P. (2007). Randomization tests (4th ed.) Boca Raton, FL: Chapman & Hall/CRC. Ferron, J. M., & Levin, J. R. (2014). Single-case permutation and randomization statistical tests: Present status, promising new developments. (pp ). In T. R. Kratochwill & J. R. Levin (Eds.), Single-case intervention research: Methodological and statistical advances. Washington, DC: American Psychological Association. Maggin, D. M., Swaminathan, H., Rogers, H. J., O’Keeffe, B. V., Sugai, G., & Horner, R. H. (2011). A generalized least squares regression approach for computing effect sizes in single-case research: Application examples. Journal of School Psychology, 49, McCleary, R., & Welsh, W. N. (1992). Philosophical and statistical foundations of time- series experiments. In T. R. Kratochwill & J. R. Levin (Eds.), Single-case research design and analysis: New developments for psychology and education (pp ). Hillsdale, NJ: Erlbaum. Rindskopf, D. M., & Ferron, J. M. (2014). Using multilevel models to analyze singe-case design data (pp ). In T. R. Kratochwill & J. R. Levin (Eds.), Single-case intervention research: Methodological and statistical advances. Washington, DC: American Psychological Association. Shadish, W. R., et al. (2014). Analyzing single-case designs: d, G, hierarchical models, Bayesian estimators, and the hopes and fears of researchers about analyses (pp ). In T. R. Kratochwill & J. R. Levin (Eds.), Single-case intervention research: Methodological and statistical advances. Washington, DC: American Psychological Association.

“True” Single-Case Applications and Classroom-Based Applications Differences in the two types of application Definitely not the same species Conceptual and methodological issues ─ potential confounders in each ─ number of time periods and number of observations per time period ─ between- and within-case attrition concerns

Potential Design Credibility Criteria for Classroom- Based Single-Case Research? Sample-Size Requirements –Number of classrooms, number of students within classrooms, number and duration of sessions/time periods Replication Requirements –Intervention replication within classrooms –Intervention replication across classrooms/schools –How many of each? Randomization Requirements –Intervention randomization within classrooms –Intervention randomization across classrooms/schools –Intervention start-point randomization –Combinations of these Control and Consideration of Extraneous Variables –Attention to teacher/classroom effects, within-school contagion effects, school selection, history, maturation, attrition, etc.

“True” Single-Case Applications and Classroom-Based Applications Statistical issues ─ consideration of the autocorrelation “elephant” in each, vis-à-vis statistical properties ─ this and other statistical-conclusion validity issues are currently being studied for different statistical- analysis strategies applied to various single-case designs