National Accessible Reading Assessment Projects DARA Research and Next Steps Cara Cahalan-Laitusis & Linda Cook Educational Testing Service.

Slides:



Advertisements
Similar presentations
Our World Begins with Reading Robert McCabe Vice President and Chief Education Officer Lexia Learning Systems, Inc.
Advertisements

Project VIABLE: Behavioral Specificity and Wording Impact on DBR Accuracy Teresa J. LeBel 1, Amy M. Briesch 1, Stephen P. Kilgus 1, T. Chris Riley-Tillman.
Value Added in CPS. What is value added? A measure of the contribution of schooling to student performance Uses statistical techniques to isolate the.
Using Growth Models to improve quality of school accountability systems October 22, 2010.
Using an iTouch to Increase Sight Word Accuracy Mary Beth Pummel, William R. Jenson, Daniel Olympia, Lora Tuesday Heathfield, Kristi Hunziker Department.
National Accessible Reading Assessment Projects Defining Reading Proficiency for Accessible Large Scale Assessments Principles and Issues Paper American.
Designing Accessible Reading Assessments National Accessible Reading Assessment Projects General Advisory Committee December 7, 2007 Overview of DARA Project.
Copyright © 2004 Educational Testing Service Listening. Learning. Leading. Using Differential Item Functioning to Analyze a State English-language Arts.
National Accessible Reading Assessment Projects Examining Background Variables of Students with Disabilities that Affect Reading Jamal Abedi, CRESST/University.
Copyright © 2006 Educational Testing Service Listening. Learning. Leading. Using Differential Item Functioning to Investigate the Impact of Accommodations.
Impact of Read Aloud on Test of Reading Comprehension Cara Cahalan-Laitusis Educational Testing Service.
National Accessible Reading Assessment Projects Defining Reading Proficiency for Accessible Large Scale Assessments Discussion of the Principles and Issues.
Partnership for Accessible Reading Assessment Sampling Issues and Accessible Assessments Christopher Johnstone National Center on Educational Outcomes.
Copyright © 2004 Educational Testing Service Listening. Learning. Leading. Using DIF to Examine the Validity and Fairness of Assessments for Students With.
Examining Differential Boost from Read Aloud on a Test of Reading Comprehension at Grades 4 and 8 Cara Cahalan-Laitusis, Linda Cook, Fred Cline, and Teresa.
PARA Project Overview, Results, Next Steps Martha Thurlow and Deborah Dillon National Accessible Reading Assessment Projects.
Partnership for Accessible Reading Assessment Analysis of Current Characteristics of State Reading Assessments Christopher Johnstone National Center on.
Listening. Learning. Leading. Using Factor Analysis to Investigate the Impact of Accommodations on the Scores of Students with Disabilities on English-Language.
Partnership for Accessible Reading Assessment Partnership for Accessible Reading Assessment (PARA) Research Martha Thurlow National Center on Educational.
Examining Differential Boost from Read Aloud on a Test of Reading Comprehension at Grades 4 and 8 Cara Cahalan-Laitusis Linda Cook Fred Cline Teresa King.
Designing Accessible Reading Assessments Examining Test Items for Differential Distractor Functioning Among Students with Learning Disabilities Kyndra.
National Accessible Reading Assessment Projects National Accessible Reading Assessment Projects General Advisory Committee December 8, 2006 Overview of.
Principal Investigators: Martha Thurlow & Deborah Dillon Introduction Assumptions & Research Questions Acknowledgments 1. What characteristics of current.
The Journey Toward Accessible Assessments Karen Barton CTB/McGraw-Hill Validity & Accommodations:
DIF Analysis Galina Larina of March, 2012 University of Ostrava.
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
Reading Fluency Intervention Strategies and Techniques 1. Does repeated reading alone show students gaining at least 10% reading comprehension skills of.
General Information --- What is the purpose of the test? For what population is the designed? Is this population relevant to the people who will take your.
Designing Accessible Reading Assessments Reading Aloud Tests of Reading Review of Research from the Designing Accessible Reading Assessments Projects Cara.
Session One. Types of research articles Theoretical Empirical.
N C E O National Center on Educational Outcomes Accommodation Decisions: Policy, Training, and Monitoring as Critical Aspects of an Objective Approach.
Chapter 10 - Part 1 Factorial Experiments.
Needs Analysis Instructor: Dr. Mavis Shang
Today Concepts underlying inferential statistics
Research Methods in MIS
Designing Accessible Reading Assessments Research on Making Large Scale Assessments More Accessible for Students with Disabilities Institute of Education.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Descriptive and Causal Research Designs
Identifying the gaps in state assessment systems CCSSO Large-Scale Assessment Conference Nashville June 19, 2007 Sue Bechard Office of Inclusive Educational.
LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.
Technical Adequacy Session One Part Three.
Standardization and Test Development Nisrin Alqatarneh MSc. Occupational therapy.
National Accessible Reading Assessment Projects Research on Making Large-Scale Reading Assessments More Accessible for Students with Disabilities June.
Cara Cahalan-Laitusis Operational Data or Experimental Design? A Variety of Approaches to Examining the Validity of Test Accommodations.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Chapter 8 Causal-Comparative Research Gay, Mills, and Airasian
Automated Scoring is a Policy and Psychometric Decision Christina Schneider The National Center for the Improvement of Educational Assessment
Modified Achievement Tests for Students with Disabilities: Design Strategies and Experimental Results Stephen N. Elliott Vanderbilt University CCSSO’s.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Modifying Achievement Test Items: A Theory-Guided & Data-Based Approach Stephen N. Elliott Learning Sciences Institute and Department of Special Education.
Human Resources Research Organization (HumRRO) 66 Canal Center Plaza, Suite 700 Alexandria, Virginia | Phone: | Fax:
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
Designing Accessible Reading Assessments A Multistage Adaptive and Accessible Reading Assessment for Accountability Cara Cahalan Laitusis ETS.
Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 5: Introduction to Norm- Referenced.
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
Designing Accessible Reading Assessments Putting Accessibility to the Test: Design of a Multistage Reading Assessment for Accountability Cara Cahalan Laitusis.
Nurhayati, M.Pd Indraprasta University Jakarta.  Validity : Does it measure what it is supposed to measure?  Reliability: How the representative is.
Study of Device Comparability within the PARCC Field Test.
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Chapter Eight: Quantitative Methods
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
DECISION-MAKING FOR RESULTS HSES- Data Team Training.
Inferential Statistics Psych 231: Research Methods in Psychology.
Validity and Reliability
Reliability & Validity
Making Sense of Advanced Statistical Procedures in Research Articles
Experimental Design.
Chapter Eight: Quantitative Methods
CAASPP Results 2015 to 2016 Santa Clara Assessment and Accountability Network May 26, 2017 Eric E, Zilbert Administrator, Psychometrics, Evaluation.
Presentation transcript:

National Accessible Reading Assessment Projects DARA Research and Next Steps Cara Cahalan-Laitusis & Linda Cook Educational Testing Service

National Accessible Reading Assessment Projects Presentation Experimental Study of Read Aloud Psychometric Research Research Plans for Year 3 –Psychometric analysis of experimental data –Tailored Test Design –Cognitive labs –IEP Decision Making for read aloud

National Accessible Reading Assessment Projects Differential Boost from Read Aloud (Non-disabled vs. RLD) 1.Is there a Differential Boost from read aloud? 2.How well do test scores (standard, audio, and fluency) predict variance in teacher ratings of reading comprehension? 3.Are teachers able to predict which students will benefit from read aloud?

National Accessible Reading Assessment Projects Prior Research No Differential Boost –Kosciokek & Ysseldyke (2000)- Small sample size (n=31) –Meloy, Deville, and Frisbie (2002) – Between subjects design (n=260, 76% non-disabled, randomly assigned to audio or standard) –McKevitt & Elliott (2003)-Small sample size (n=39) Differential Boost –Crawford and Tindal (2004)-(n=338, 78% non-disabled) –Fletcher, et. al (2006)-Between subjects design (randomly assigned to audio or standard). Sample included 91 Dyslexic (poor decoder) and 91 average decoders

National Accessible Reading Assessment Projects Data Collected GMRT Forms S and T –Extra Time –Extra Time with Read Aloud via CD 2 Fluency Measures –WJ Reading Fluency –Test of Silent Word Reading Fluency 2 Decoding Measures (4 th grade only) –WJ Letter Word ID –WJ Word Recognition Demographic and Survey Data

National Accessible Reading Assessment Projects Sample th Graders –522 Students with RLD –648 Students without a disability th Graders –394 Students with RLD –461 Students without a disability

National Accessible Reading Assessment Projects Design Group Session 1Session 2 FormAccommodationFormAccommodation 1SStandardTAudio 2S TStandard 3T SAudio 4T SStandard

National Accessible Reading Assessment Projects Means for Grade 4 Non-LDRLD Test/ConditionNMeanSDNMeanSD WJ Letter Word ID WJ Word Attack TOSWRF WJ Fluency Audio Standard Boost

National Accessible Reading Assessment Projects Means for Grade 8 Non-LDRLD Test/ConditionNMeanSDNMeanSD TOSWRF WJ Fluency Audio Standard Boost

National Accessible Reading Assessment Projects Scores by RLD and Grade

National Accessible Reading Assessment Projects 1.Is there a Differential Boost from read aloud? Repeated Measures ANOVA and ANCOVA Dependent Variables: –GMRT Standard –GMRT Audio Independent Variables: –Disability Status (RLD vs. NLD) –Form/Order (STSA, STAS, TSSA, TSAS) Covariate: Decoding and Fluency Measures

National Accessible Reading Assessment Projects ANOVA Findings Yes, students with reading-based learning disabilities have larger gains (on average) from read aloud than students without disabilities –Finding consistent at both grades 4 and 8, but boost is larger at grade 4 –Controlling for Decoding and/or Fluency as a covariate did not alter findings

National Accessible Reading Assessment Projects Multiple regression analyses to determine how much variance in teachers rating of reading comprehension (5-point scale) were predicted by three test scores: –Standard –Audio –Fluency 2.How well do test scores predict reading comprehension?

National Accessible Reading Assessment Projects Regression Findings Audio score does not significantly predict variance in Teacher Ratings of Reading Comprehension (beyond standard and fluency) for Grade 8 RLD Audio score adds to prediction of reading comprehension (beyond standard and fluency scores) for three groups (NLD grade 4, NLD grade 8, and RLD grade 4), but incremental change is small

National Accessible Reading Assessment Projects Analyses: Analysis of variance in boost by teacher predictions Cross-tabulations of teacher ratings by degree of boost (more than on SEM, less than one SEM, neither) 3.Are teachers able to predict which students will benefit from read aloud?

National Accessible Reading Assessment Projects Accuracy of Teacher Prediction For this study each student took a reading comprehension test that was read aloud by a CD player and another reading comprehension test that they read to themselves. Which test do you predict the student did better on? Test read aloud by CD player Test the student read to themselves No difference

National Accessible Reading Assessment Projects Findings from Teacher Predictions ANOVA indicated that on average teachers were able to predict score gain from audio at grade 4 but not grade 8 At the individual level teachers accurately predicted if a student would benefit from the audio version about 35% of the time and were completely wrong about 5% of the time

National Accessible Reading Assessment Projects DARA Psychometric Research Purpose of psychometric research: To help us understand how an examinee's disability or the accommodations he or she receives impacts the psychometric properties of a reading test

National Accessible Reading Assessment Projects Results of This Years Psychometric Analyses Psychometric Analyses –Factor analyses –Differential item functioning analyses Populations –Students with learning disabilities who took the test with and without accommodations Test –Grade 4 and grade 8 English-language arts (ELA) assessment Focus –Determine if the test measures the same constructs for Examinees without disabilities Examinees with disabilities who took the test with and without accommodations

National Accessible Reading Assessment Projects

STAR ELA Grade 4 and Grade 8 Summary Statistics Group Grade 4 Total GroupsGrade 8 Total Groups NMeanSDNMeanSD (1) Students without disabilities 298, , (2) LD, without accommodations 9, , (3) LD, 504/IEP accommodations 4, , (4) LD, read- aloud accommodation 1,

National Accessible Reading Assessment Projects Factor Analyses of ELA Assessment Exploratory analyses (separately in each group) – how many factors Confirmatory (multi-group) –Establish base-line model –Confirm number of factors needed to describe data across all groups

National Accessible Reading Assessment Projects Differential Item Functioning (DIF) Analyses The purpose of this study was to examine differential item functioning on the same English- Language Arts assessment that was used for the factor analyses DIF is a statistical observation that involves matching test takers from different groups on the characteristic measured [by the test] and then looking at performance differences on an item. (Sireci, 2006)

National Accessible Reading Assessment Projects Method Mantel-Haenszel Categorization3 Levels –A Negligible DIF –B Slight to Moderate DIF –C Moderate to Large DIF Directions of DIF Flags - Favors reference group + Favors focal group

National Accessible Reading Assessment Projects Comparisons Made in the Study

National Accessible Reading Assessment Projects DIF Categories ELA Grade 4 LD Without Accommodations Easy Difficult Favors Students Without Disabilities Favors LD Students

National Accessible Reading Assessment Projects DIF Categories ELA Grade 4 LD With Accommodations (IEP/504) Easy Difficult Favors Students Without Disabilities Favors LD (IEP/504)

National Accessible Reading Assessment Projects DIF Categories ELA Grade 4 LD With Accommodations (Read-Aloud) Easy Difficult Favors Students Without Disabilities Favors LD (Read-Aloud)

National Accessible Reading Assessment Projects DIF Categories ELA Grade 8 LD Without Accommodations Easy Difficult Favors Students Without Disabilities Favors LD Students

National Accessible Reading Assessment Projects DIF Categories ELA Grade 8 LD With Accommodations (IEP/504) Easy Difficult Favors Students Without Disabilities Favors LD (IEP/504)

National Accessible Reading Assessment Projects DIF Categories ELA Grade 8 LD With Accommodations (Read-Aloud) Easy Difficult Favors Students Without Disabilities Favors LD (Read-Aloud)

National Accessible Reading Assessment Projects Interpreting the Results of the DIF Study Grade 4 –1 C DIF item, 8 B DIF items Grade 8 –1 C DIF item, 6 B DIF items Majority of flagged items were reading items that favored students who took test with read-aloud accommodation Consistent with Factor Analysis Results

National Accessible Reading Assessment Projects Next Steps Psychometric Research Examination of Tailored Testing Cognitive Labs IEP decision making

National Accessible Reading Assessment Projects Psychometric Research

National Accessible Reading Assessment Projects Plans for Next Years Psychometric Analyses Psychometric analyses –Factor Analyses Differential item functioning analyses Populations –Students with learning disabilities who took the test with and without an audio accommodation Test –Gates-McGinitie Reading Test Focus –Aid in interpretation of results of differential boost study –Increase understanding of impact of disability and audio accommodation on reading test scores

National Accessible Reading Assessment Projects Factor Analyses We Plan to Carry Out Aid in interpretation of results of differential boost study –Compare factor structures for students without disabilities who took test with and without accommodation –Compare factor structures for students with disabilities who took test with and without accommodation

National Accessible Reading Assessment Projects Factor Analyses We Plan to Carry Out Increase understanding of impact of disability and accommodation on reading test scores –Compare factor structures of test given to examinees with and without disabilities under standard conditions –Compare factor structure of test given to examinees with disabilities who take test with accommodations and examinees without disabilities who take test without accommodations

National Accessible Reading Assessment Projects Purpose of Doing DIF and DDF Analyses on Data From the Differential Boost Study Aid in interpretation of results of differential boost study Increase understanding of impact of disability and accommodation on reading test scores

National Accessible Reading Assessment Projects Possible Comparisons for DIF Analyses

National Accessible Reading Assessment Projects Procedures for Analyzing Data Differential Item Functioning: Mantel- Haenszel Differential Distractor Analysis: Standardized Distractor Analysis

National Accessible Reading Assessment Projects Two Staged Tailored Testing

National Accessible Reading Assessment Projects Operational Data GradeDisabilityFormat Percent Below Chance Test TakersItems 4LD Audio Other Standard None Standard LD Audio Other Standard None Standard1.50.0

National Accessible Reading Assessment Projects GMRT Data Percent Below Chance GradeDisabilityFormat Test Takers Items 4RLD Audio Standard None Audio Standard RLD Audio Standard None Audio Standard1.1

National Accessible Reading Assessment Projects DARA Tailored Testing Model Two (or three) stages of testing Students subtests on stage 2 are determined by performance on routing test administered in stage 1 Ideally computer administered but can be paper administered Some parts could be individually administered (e.g., decoding) if only a few students are routed into a decoding measure and this format reduces the number of students receiving individualize testing accommodations (e.g., read aloud by human)

National Accessible Reading Assessment Projects Reading Comprehension Routing Test Reading Fluency Decoding and Extended Comprehension Test with Audio Extended Comprehension Test with Audio Extended Reading Comprehension Test

National Accessible Reading Assessment Projects Advantages of Model Score is more reliable estimate since items are targeted to students ability level Students may feel less frustrated if they can do some of the items on the routing test Teacher receives more information on low performing students strengths and weaknesses Fundamental Skills and Comprehension are not confounded for students with poor fundamental skills (some LD) or poor comprehension (some LD and ELL) Growth can be more accurately measured in students working significantly below (or above) grade level

National Accessible Reading Assessment Projects Disadvantages of Model Requires computer administration or teacher scoring of items after stage 1 Students who are routed to fluency test may be embarrassed Routing decision is made before test is scaled or standard setting is completed Design could route more that 2% of students to modified test

National Accessible Reading Assessment Projects Questions for Year 3 How many items (and of what difficulty) are needed for an accurate routing test? Can we equate the audio extended and standard extended using the routing test? What portion of students would be routed to fluency measure and what portion would be routed to decoding?

National Accessible Reading Assessment Projects Questions for Year 3 (continued) Are the 2 alternate routes highly correlated with the standard administration? What is the impact audio, fluency, and decoding scores on total test score. –If student is not a fluent reader should the total test score be non-proficient? Is the routing test accurate for all students? –Do some students do better on hard items? –Do some students having trouble with the first few items on the test?

National Accessible Reading Assessment Projects Questions for Year 3 (continued) How should we weight different measures and what impact will this have on subpopulations? Could we compose a tailored test from a states current operational item pool? –If not how many additional items would be required and at what difficulty level?

National Accessible Reading Assessment Projects Cognitive Labs

National Accessible Reading Assessment Projects Background Cognitive labs using the think aloud method on reading comprehension questions Build off the findings of last years large scale differential boost study –Gates MacGinitie Reading Comprehension Test –Use items found in preliminary findings of the DIF analysis of the GMRT data

National Accessible Reading Assessment Projects Cognitive Labs Advantages Beneficial to learn about components of mental processes of reading (Afflerbach & Johnston, 1984, Alavi, 2005; Pressley & Afflerbach, 1995) Beneficial in the development of assessments (Caspar, Lessler, & Willis, 1999; Desimone & LeFloch, 2004; Willis, 2005) Open flexible procedure can be catered to the specific situation and activity (Davison, Vogel & Coffman, 1997) May use a small sample size Procedure has been successfully conducted with children as young as 3 rd grade (Laing & Kamhi, 2002; Paulsen & Levine, 1999; Trambasso & Magliano, 1996)

National Accessible Reading Assessment Projects Cognitive Labs Disadvantages Thinking aloud is an unnatural step which may affect or interfere ones normal mental processes Students with disabilities may have difficulty with the procedure (Johnstone, Miller, & Thompson, in press) Responses have the potential to be incomplete or incorrect –Lack of desire/motivation –Embarrassment –Inability to understand the task

National Accessible Reading Assessment Projects Purpose of Study This study is being conducted to serve the following purposes: 1.How do students with and without reading-based learning disabilities differ as they approach a reading comprehension assessment? 2.Is this type of information gathering and data quality worthwhile to conduct in future large scale studies considering: –Age of students –Students with disabilities

National Accessible Reading Assessment Projects Research Questions 1.In what way do students with reading- based disabilities respond differently to reading comprehension questions compared to students without disabilities? 2.What errors occur while reading the passage/reading the items?

National Accessible Reading Assessment Projects IEP Decision Making

National Accessible Reading Assessment Projects IEP Decision Making What factors contribute to boost? –Low standard score –WJ Reading Measures –Teacher Predictions –Student Preference

National Accessible Reading Assessment Projects Analyses planned Regression analyses to predict boost for RLD students using –WJ scores –Standard score –Use of read aloud in class or on tests –Teacher predictions –NJ ASK from prior year