Reliability in Testing Is the test or assessment tool consistent and dependable? Student-related reliability Rater reliability Test administration reliability.

Slides:



Advertisements
Similar presentations
AA-AAS Whats Up, Whats Down and Whats Next in South Dakota Linda Turner Special Education Programs SD Dept of Education.
Advertisements

The meaning of Reliability and Validity in psychological research
Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
Chapter 1 What is listening?
Consistency in testing
Topics: Quality of Measurements
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect4_1.
Reliability and Validity checks S-005. Checking on reliability of the data we collect  Compare over time (test-retest)  Item analysis  Internal consistency.
VALIDITY AND RELIABILITY
Lesson Six Reliability.
Testing What You Teach: Eliminating the “Will this be on the final
Making Your Assessments More Meaningful Flex Day 2015.
Some Practical Steps to Test Construction
Reliability n Consistent n Dependable n Replicable n Stable.
Lesson Seven Reliability. Contents  Definition of reliability Definition of reliability  Indication of reliability: Reliability coefficient Reliability.
1 BASIC CONSIDERATIONS in Test Design 2 Pertemuan 16 Matakuliah: >/ > Tahun: >
Basic Issues in Language Assessment 袁韻璧輔仁大學英文系. Contents Introduction: relationship between teaching & testing Introduction: relationship between teaching.
Creating Effective Classroom Tests by Christine Coombe and Nancy Hubley 1.
Validity, Reliability, & Sampling
Research Methods in MIS
Using statistics in small-scale language education research Jean Turner © Taylor & Francis 2014.
11/08/ Individualisation-Standardisation 11/08/
Classroom Assessment A Practical Guide for Educators by Craig A. Mertler Chapter 9 Subjective Test Items.
Principles of Language Assessment Ratnawati Graduate Program University State of Semarang.
Shawna Williams BC TEAL Annual Conference May 24, 2014.
RELIABILITY BY DESIGN Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.
State Assessment Website Address: NYSESLAT The NYSESLAT will be administered annually for five grade clusters: K-1,
Chap. 3 Designing Classroom Language Tests
Reliability Lesson Six
+ Old Reliable Testing accurately for thousands of years.
David W. Dillard AVCTC. Objectives Overview of the need for student assessments Define Student Assessments & parts of a rubric Samples of rubrics Develop.
Let’s Look at... Assessing Group Performance 1. Performance Groups Material for this section largely adapted from: “Assessing group work” © Copyright.
Student assessment AH Mehrparvar,MD Occupational Medicine department Yazd University of Medical Sciences.
Chap. 2 Principles of Language Assessment
And counting till Senior Project Presentations 11 DAYS…
USEFULNESS IN ASSESSMENT Prepared by Vera Novikova and Tatyana Shkuratova.
Administering, Analyzing, and Improving the Written Test
A ssessment & E valuation. Assessment Answers questions related to individuals, “What did the student learn?” Uses tests and other activities to determine.
RELIABILITY AND VALIDITY OF ASSESSMENT
Presented By Dr / Said Said Elshama  Distinguish between validity and reliability.  Describe different evidences of validity.  Describe methods of.
Assessment. Workshop Outline Testing and assessment Why assess? Types of tests Types of assessment Some assessment task types Backwash Qualities of a.
Validity in Testing “Are we testing what we think we’re testing?”
1 LANGUAE TEST RELIABILITY. 2 What Is Reliability? Refer to a quality of test scores, and has to do with the consistency of measures across different.
Reliability n Consistent n Dependable n Replicable n Stable.
THE ASSESSMENT CYCLE, ASSESSMENT DESIGN AND SPECIFICATIONS PROSET - TEMPUS1 Prepared by Maria Verbitskaya, Angelika Kalinina, Elena Solovova.
Assessment Ice breaker. Ice breaker. My most favorite part of the course was …. My most favorite part of the course was …. Introduction Introduction How.
Educator Effectiveness Summit School District’s Recommendation for the School Year.
Chapter 6 - Standardized Measurement and Assessment
Imagine…  A hundred students is taking a 100 item test at 3 o'clock on a Tuesday afternoon.  The test is neither difficult nor easy. So, not ALL get.
Stages of Test Development By Lily Novita
RELIABILITY BY DONNA MARGARET. WHAT IS RELIABILITY?  Does this test consistently measure what it’s supposed to measure?  The more similar the scores,
Standards-Based Tests A measure of student achievement in which a student’s score is compared to a standard of performance.
Content Enhancement Series Lesson design. Teacher Leadership quote “Failure to plan is planning to fail”
It’s been a long day. Have a laugh!. Testing Code of Ethics Training
Language Assessment Lecture 7 Validity & Reliability Instructor: Dr. Tung-hsien He
Consequential? Yikes!: Organization and time management can be the key to a passing score on the edTPA Billi L. Bromer, Ed. D. Brenau University National.
Monitoring and Assessment Presented by: Wedad Al –Blwi Supervised by: Prof. Antar Abdellah.
 Teaching: Chapter 14. Assessments provide feedback about students’ learning as it is occurring and evaluates students’ learning after instruction has.
Objective Examination: Multiple Choice Questions Dr. Madhulika Mistry.
Using Rubrics for Assessing Individual and/or Group Participation Marie Krbavac June 4, 2015.
Language Assessment.
EVALUATING EPP-CREATED ASSESSMENTS
Validity and Reliability
RELIABILITY IN TESTING
Research Methods Lesson 2 Reliability.
The extent to which an experiment, test or any measuring procedure shows the same result on repeated trials.
Why do we assess?.
Chapter 8 VALIDITY AND RELIABILITY
Presentation transcript:

Reliability in Testing Is the test or assessment tool consistent and dependable? Student-related reliability Rater reliability Test administration reliability Test reliability (Brown, 2004, 20-22)

Student-related reliability Give students consistent materials/time for test preparation Plenty of time for studying Consistent test time/conditions (always on Wednesdays)

Rater reliability For subjective scoring in high stakes tests: –Have more than one rater –Use rubric and have traning/norming sessions –Have outside periodic oversight –Keep tests anonymous For low stakes tests (quizzes): –Use rubric –Read through all tests before rating

Test administration reliability Consistent rules for all classes/teachers: –Dictionary use –Notes/books –Strict time limits

How to measure test reliability? One way is through test-retest method: –The same group of students takes the same test twice. (drawback- motivation and washback) What else could we do to test whether the same students (or very similar students) score the same on the same test twice?

Split half method Have the test split into two halves, equal in tasks and difficulty. Then measure the scores for each half This is a good way to look at reliability, but it requires that the split tests are really equal

Split half method – Your task What can you tell about the reliability for the following split sections on my test? Joanna8895 Jenny9087 Asuka9795 David9288 Annick7478 Mercedes8489

Checklist for Reliability (1) Have many independent items Delete items that do not discriminate between weaker/stronger students Limit choices – restrict student responses Keep items clear and unambiguous Provide clear instructions and examples Keep type large enough and cleanly copied

Checklist for Reliability (2) Provide practice with testing format Keep administration uniform Have clear answers for all items (as possible) Provide detailed answer key Train raters as necessary Determine acceptable responses before scoring starts Keep test-takers anonymous John Bunting (2004) presentation in the Course: Testing, Assessment and Teaching- A program for EFL Teachers at UABC. Facultad de Idiomas, UABC