Psychometric Issues in the Use of Testing Accommodations Chapter 4 David Goh.

Slides:



Advertisements
Similar presentations
Copyright © 2006 Educational Testing Service Listening. Learning. Leading. Using Differential Item Functioning to Investigate the Impact of Accommodations.
Advertisements

Chapter 6 Process and Procedures of Testing
Fairness in Testing: Introduction Suzanne Lane University of Pittsburgh Member, Management Committee for the JC on Revision of the 1999 Testing Standards.
Cal State Northridge Psy 427 Andrew Ainsworth PhD
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 6 Validity.
Chapter 4A Validity and Test Development. Basic Concepts of Validity Validity must be built into the test from the outset rather than being limited to.
Issues of Technical Adequacy in Measuring Student Growth for Educator Effectiveness Stanley Rabinowitz, Ph.D. Director, Assessment & Standards Development.
Equity Issues in Assessments for Individuals who are Deaf or Hard of Hearing Ann Moxley, Ph.D. California School for the Deaf - Fremont.
Issues Related to Assessment with Diverse Populations
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
Assessment: Reliability, Validity, and Absence of bias
ACCOMMODATIONS MANUAL How to Select, Administer, and Evaluate Use of Accommodations for Instruction and Assessment of Students with Disabilities.
MCAS-Alt: Alternate Assessment in Massachusetts Technical Challenges and Approaches to Validity Daniel J. Wiener, Administrator of Inclusive Assessment.
Introduction to Assessment Basic Terms and Concepts.
1 Some Key Points for Test Evaluators and Developers Scott Marion Center for Assessment Eighth Annual MARCES Conference University of Maryland October.
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Reliability and Validity. Criteria of Measurement Quality How do we judge the relative success (or failure) in measuring various concepts? How do we judge.
Please check, just in case…. Announcements 1.Standardized Test Description due in two weeks. 2.Questions about upcoming assignments? Make an appointment.
Comprehensive Assessment System Webinar #6 December 14, 2011.
Questions to check whether or not the test is well designed: 1. How do you know if a test is effective? 2. Can it be given within appropriate administrative.
Chapter 4 Testing with Computers and Testing Special Populations.
The University of Central Florida Cocoa Campus
Understanding and Using Standardized Tests
Technical Adequacy Session One Part Three.
Principles of Test Construction
Including Quality Assurance Within The Theory of Action Presented to: CCSSO 2012 National Conference on Student Assessment June 27, 2012.
CCSSO Criteria for High-Quality Assessments Technical Issues and Practical Application of Assessment Quality Criteria.
1 Chapter 6 Selection and Placement. 2 Introduction Why Selection is Important?
Chap. 2 Principles of Language Assessment
Automated Scoring is a Policy and Psychometric Decision Christina Schneider The National Center for the Improvement of Educational Assessment
Michigan Educational Assessment Program MEAP. Fall Purpose The Michigan Educational Assessment Program (MEAP) is Michigan’s general assessment.
Assessing Learning for Students with Disabilities Tom Haladyna Arizona State University.
CAROLE GALLAGHER, PHD. CCSSO NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 26, 2015 Reporting Assessment Results in Times of Change:
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
McGraw-Hill/Irwin © 2012 The McGraw-Hill Companies, Inc. All rights reserved. Obtaining Valid and Reliable Classroom Evidence Chapter 4:
Alternative Assessment Chapter 8 David Goh. Factors Increasing Awareness and Development of Alternative Assessment Educational reform movement Goals 2000,
Spring 2015 Kyle Stephenson
EDUC 5535 Spring  An artifact of the eugenics movement (in the 1920’s) - an attempt to sort people by their perceived intelligence or ability.
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. English Language Learners Assessing.
Nurhayati, M.Pd Indraprasta University Jakarta.  Validity : Does it measure what it is supposed to measure?  Reliability: How the representative is.
SECOND EDITION Chapter 5 Standardized Measurement and Assessment
C R E S S T / U C L A Psychometric Issues in the Assessment of English Language Learners Presented at the: CRESST 2002 Annual Conference Research Goes.
VALIDITY, RELIABILITY & PRACTICALITY Prof. Rosynella Cardozo Prof. Jonathan Magdalena.
Chapter 3 Selection of Assessment Tools. Council of Exceptional Children’s Professional Standards All special educators should possess a common core of.
PRINCIPLES OF LANGUAGE ASSESSMENT Riko Arfiyantama Ratnawati Olivia.
Critical Issues Related to ELL Accommodations Designed for Content Area Assessments The University of Central Florida Cocoa Campus Jamal Abedi University.
Intelligence. What is Intelligence? ▪ Definition: – The mental abilities to adapt to and shape the environment ▪ Involves reacting to and forming your.
ELL-Focused Accommodations for Content Area Assessments: An Introduction The University of Central Florida Cocoa Campus Jamal Abedi University of California,
WHS AP Psychology Unit 7: Intelligence (Cognition) Essential Task 7-3:Explain how psychologists design tests, including standardization strategies and.
© 2013 by Nelson Education1 Foundations of Recruitment and Selection I: Reliability and Validity.
Copyright © 2009 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 47 Critiquing Assessments.
Reliability and Validity
Principles of Language Assessment
Unit 8: Intelligence (Cognition)
Lecture 5 Validity and Reliability
Chapter 3: Legal, Ethical, and Diversity Foundations and Perspectives in Assessment ONLINE MODULE.
Test Design & Construction
Test Validity.
Introduction to the Validation Phase
Validity and Reliability
Week 3 Class Discussion.
the BIG 3 OVERVIEW OF CRITERIA FOR EVALUATING EDUCATIONAL ASSESSMENT
Part II Knowing How to Assess Chapter 5 Minimizing Error
Reliability and Validity of Measurement
Competency 007: E.
Norms.
Hawaii TAC Meeting WIDA Assessments
Investigations into Comparability for the PARCC Assessments
Presentation transcript:

Psychometric Issues in the Use of Testing Accommodations Chapter 4 David Goh

Testing Accommodations and Psychometric Soundness Federal and state laws require that nonbiased procedures be used in assessment But does using testing accommodations make the technical aspects of the tests questionable? –Reliability: accuracy –Validity: effectiveness, consistency and stability of scores

Standardization Standardization implies uniformity of procedures in administering a test and scoring the results Poses a problem for diverse learners who exhibit different physical, sensory, linguistic, cultural and or psychological qualities from the general population In theory all accommodations compromise standardization, so issues of validity and reliability need to be examined

Reliability Refers to consistency and stability High reliability produces consistent results Low reliability produces inconsistent results Ex- ELLs using a bilingual interpreter may make different interpretations Accommodations and modifications may increase measurement errors and decrease reliability Methods of examining reliability include comparing scores of the same test takers on different administration, with different sets of times, or with different scores or examiners

Validity Refers to effectiveness- is the test testing what it intended If used right- accommodations can increase validity If used inappositely, accommodations may nullify the results Types: content, response process, internal structure, other variables, and consequences of testing

Effect of Accommodations Limited evidence on the effect of test accommodations on reliability and validity Studies have shown that testing accommodations on SAT and GRE did not significantly effect reliability and validity –However, findings are not generalizable –But, some accommodations have been found to increase student performance- ex- extended time

Flagging Test Scores On large scale test a very controversial issue –Reveals examinee has a disability; prompts invalid inferences about score and prejudice; identifies and violates person with disabilities privacy that is protected under the law If comparability can be demonstrated between standard and non-standard administration, there is no need to flag –However, if test results are not comparable, then testing accommodations need to be noted and explained