Technology Enhanced Items — Signal or Noise?

Slides:



Advertisements
Similar presentations
An Introduction to Test Construction
Advertisements

Common Core State Standards for Mathematics: Coherence Grade 7 Overview.
o Nearly all 50 states have adopted the Common Core State Standards and Essential Standards. o State-led and developed Common Core Standards for K-12.
Office of Academics & Accountability
What is a CAT?. Introduction COMPUTER ADAPTIVE TEST + performance task.
Iowa Assessment Update School Administrators of Iowa November 2013 Catherine Welch Iowa Testing Programs.
Smarter Balanced Assessment: What do parents need to know?
Writing High Quality Assessment Items Using a Variety of Formats Scott Strother & Duane Benson 11/14/14.
Leadership for the Common Core in Mathematics, University of Wisconsin-Milwaukee Linking Assessment Targets to Instructional Tasks and DOK This.
Contemporary Mathematics in Context: CORE-Plus Mathematics Project.
Smarter Balanced Assessment: What do parents need to know? Paramount Unified School District Parent Presentation, Grades Great Things Are.
Entering Activity Please respond to the following questions: From the process of unpacking the standards, what did you learn  about yourself?  about.
DEVELOPING ALGEBRA-READY STUDENTS FOR MIDDLE SCHOOL: EXPLORING THE IMPACT OF EARLY ALGEBRA PRINCIPAL INVESTIGATORS:Maria L. Blanton, University of Massachusetts.
Introduction to the Common Core Mathematics Standards Presented by Frank H. Osborne, Ph. D. © 2015 EMSE 3123 Math and Science in Education 1.
Presented by: COMMON CORE Standards Plus ®. A nonprofit group of educators All Learning Plus instructional materials are developed by educators. Our mission.
Parent Training California Assessment for Student
KCCT Kentucky’s Commonwealth Accountability Testing System Overview of 2008 Regional KPR.
Tran Keys, Ph.D. Research & Evaluation, Santa Ana USD
California Assessment of Student Performance and Progress (CAASPP) 1 California Department of Education, September 2015 EL SEGUNDO UNIFIED SCHOOL DISTRICT.
California Assessment of Student Performance and Progress (CAASPP) 1 California Department of Education, September 2015.
ELA & Math Scale Scores Steven Katz, Director of State Assessment Dr. Zach Warner, State Psychometrician.
CEDAR RIDGE MIDDLE SCHOOL JANUARY 15, 2015 acos2010.wikispaces.com.
Math SOL Test Changes School Board Work Session March 8, 2012.
Pearson Copyright 2010 Some Perspectives on CAT for K-12 Assessments Denny Way, Ph.D. Presented at the 2010 National Conference on Student Assessment June.
Summary of Assessments By the Big Island Team: (Sherry, Alan, John, Bess) CCSS SBAC PARCC AP CCSSO.
Acos2010.wikispaces.com. ACT Provides the Following:  a standards-based system of assessments to monitor progress toward college and career readiness.
MAP: Measured Academic Progress© Parent Coffee February 10, 2010.
Standards Based Grading. How is it different? Traditional Grade for each assignment Grade may accidentally be focused more on one concept than another,
Understanding the 2015 Smarter Balanced Assessment Results Assessment Services.
California Assessment of Student Performance and Progress (CAASPP) 1 California Department of Education, September 2015.
+ Marking New Learning Milestones within Henry County Schools Parent Presentation January 20 and 21, :00am and 7:00pm.
Smarter Balanced Assessment Consortium (SBAC) Fairfield Public Schools Elementary Presentation.
LaKenji Hastings, NWLC Assessment Program Specialist Georgia Milestones Parent Informational.
Mater Gardens Middle School MATHEMATICS DEPARTMENT WHERE LEARNING HAS NO FINISH LINE ! 1.
SBAC-Mathematics November 26, Outcomes Further understand DOK in the area of Mathematics Understand how the new SBAC assessments will measure student.
Louisiana’s Implementation of Common Core State Standards as delivered by Recovery School District CCSS Team K-1 Overview (ELA and Math) Presented by:
Middle School Math at Endeavor Charter School By Carolyn Southard learn do The only way to learn mathematics is to do mathematics. -Paul Halmos.
A New Trend Line in Student Achievement “Virginia's public schools are beginning a new trend line with the implementation of more challenging standards.
You Can’t Afford to be Late!
What is a CAT? What is a CAT?.
Preliminary Review of the 2012 Math SOL Results
Technology Enhanced Items – Signal or Noise?
Classroom Analytics.
Technology Enhanced Items — Signal or Noise
E/LA and Math Readiness
2015 PARCC Results for R.I: Work to do, focus on teaching and learning
Partial Credit Scoring for Technology Enhanced Items
Understanding your PreACT scores
A Parents’ Guide to the FL Standards Assessment
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
SAT vs. ACT Which one should I take?.
Understanding Your PSAT/NMSQT Results
Welcome. A Search for Signal Stephen T. Murphy.
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
Understanding your PreACT scores
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
Georgia Milestones Online Testing Program
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
Understanding Your PSAT/NMSQT Results
HS Physical Science Spring 2017
Understanding Your PSAT/NMSQT Results
Parent Teacher Conference
Presentation transcript:

Technology Enhanced Items — Signal or Noise?  Wednesday, June 28, 2017 | 8:30 am–10:00 am JW Grand Ballroom | Salon 2 (JW Marriott Austin)

Presenters Catherine Welch, Univ of Iowa/ITP Jon S. Twing, Pearson Wayne Camara, ACT Stephen T. Murphy, Measured Progress  Discussant: Joyce Zurkowski, CDOE  Moderator: Douglas F. Becker, HMH In these studies, they concluded that TE are more difficult than MC; TE are more discriminating than MC. 2. TE provided more information at the high ability range. Their statistics of interest are all IRT based, such as item characteristic function, item information functions, information efficiency.

TEIs in RFPs The vendor’s response must reflect familiarity with computer-based testing and the use of a variety of item types, including technology enhanced items (TEI) to assess students’ higher order cognitive skills as well as their knowledge of core ideas and concepts.  (RFP 2017-073 DOE New Hampshire Statewide Assessments)  STATE desires use of multiple item types and technology enhanced items to capitalize on efficiency while ensuring that NDSA assessments are aligned to the full breadth, depth, and cognitive complexity of STATE’s content standards.  (North Dakota State Assessment RFP 201-2017-175) Technology: Assessments will be delivered primarily online, include effective technology-enhanced items, and a facile, intuitive test management system.  (Nebraska Statewide Assessments RFI11162016) In these studies, they concluded that TE are more difficult than MC; TE are more discriminating than MC. 2. TE provided more information at the high ability range. Their statistics of interest are all IRT based, such as item characteristic function, item information functions, information efficiency.

Previous Studies Wan & Henly (AME 2012) Crabtree & Welch (NCME 2016) TEIs and MCs in statewide science achievement test Figural Response, Constructed Response Crabtree & Welch (NCME 2016) TEIs and MCs in an end-of-course Algebra exam Bukhari, Boughton, & Kim (NCME 2016) TEIs and MCs in grade 8 math and grade 7 ELA in PARCC and SBAC Matching, Equation and Expression Entry, Select and Order, Multiple Correct Response; Multiple Correct Response, Evidence-Based Selected Response

Previous Studies IRT based ICC, average item information, information efficiency Conclusions TEIs are more difficult than MCs. TEIs are more discriminating than MCs. TEIs provide more information at high ability In these studies, they concluded that TE are more difficult than MC; TE are more discriminating than MC. 2. TE provided more information at the high ability range. Their statistics of interest are all IRT based, such as item characteristic function, item information functions, information efficiency.

TEI’s at other conferences Tech-Enhanced Items: How Can They Provide Better Measurement? This presentation will critically evaluate a number of TEI formats and how they might be scored, from both a classical and item response theory (IRT) perspective. This discussion will be couched in the consideration of actual data delivered in K-12 educational assessments. One notable aspect is that substantial information can be reaped from incorrect responses, which are generally not considered except for advanced IRT models like partial credit or nominal response models. Even these do not always consider all available data. For example, a straightforward multiple- response item with six possible options and two correct answers might be scored as 0-1-2 points—better than simply 0-1—but not considering which of the four incorrect options were selected ignores any possible information that might be present there. Of course, this typically happens with multiple-choice items too, as the nominal response model is virtually never used in practice. In other cases— such as PARCC two-part items—IRT modeling frankly falls apart.

TEI’s at other conferences Investigation of Psychometric Properties of Technology-Enhanced Items Considering Content Characteristics Rong Jin, Stephen Murphy, Sid Sharairi -- Houghton Mifflin Harcourt Publishing  Recent research explored technology-enhanced (TE) items only by subject . This study dives into DOK, master domain, and grade level to explore over six hundreds TEs in twelve formats from a K-8 large scale mathematics assessment and compares the measurement properties between multiple-choice and TE, and between TE formats .  In a session mostly dealing with CAT and stopping rules

Domain and Item Type Mean TEI > MC in each domain. Generally speaking, The average Median Response Time of TE are higher than MC in every domain. Especially in Domain EE. The Domain related to grade 6 above usually have high Response Time in TE. Such as EE, NS, SP, RP. The domain covers grade 0-5 usually have low response time in TE. Such as CC (0), OA, NBT, MD (0-5) The standard deviation of Response Time in TE are usually higher than MC in most domain. Epically in Domain EE, G, NS, SP, RP (grade 6 above) Mean TEI > MC in each domain. Range for TEI and MC is grade related low: CC, OA, NBT, MD (grade K-5) high: EE, NS, SP, RP (grade 6+) Standard Deviation MC < TEI in each domain. Large (40+) for TEI in domain G, NS, SP, especially in EE.

Conclusions MC Vs. TEI TEI Types On average, TEI are more difficult, discriminating, and time consuming than MC in each DOK or domain. On average, when DOK increases, both MC and TEI become more challenging, less discriminating, and more time consuming but to different extents. On average, both MC and TEI in domains available in grade 6+ are more challenging and time consuming than those in domains in grade K-5. TEI Types Easy: Number Line, Ordering, and Matching. Hard: Multiple Select, Matrix, Graph, Enter Math. Time consuming: Enter Math and Graph.

Presenters Catherine Welch, Univ of Iowa/ITP Stephen T. Murphy, Measured Progress  Wayne Camara, ACT Jon S. Twing, Pearson Discussant: Joyce Zurkowski, CDOE  Moderator: Douglas F. Becker, HMH