Deborah Schnipke, PhD Virtual Psychometrics

Slides:

Advertisements

Similar presentations

AS Sociology Exam Technique.

Advertisements

Introduction to IRT/Rasch Measurement with Winsteps Ken Conrad, University of Illinois at Chicago Barth Riley and Michael Dennis, Chestnut Health Systems.

Psychometrics to Support RtI Assessment Design Michael C. Rodriguez University of Minnesota February 2010.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.

Item Response Theory in a Multi-level Framework Saralyn Miller Meg Oliphint EDU 7309.

Unit 0, Pre-Course Math Review Session 0.2 More About Numbers

AN OVERVIEW OF THE FAMILY OF RASCH MODELS Elena Kardanova

CS 128/ES Lecture 2b1 Attribute Data and Map Types.

Dimensional reduction, PCA

PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 1 Chicago School of Professional Psychology.

Why Scale -- 1 Summarising data –Allows description of developing competence Construct validation –Dealing with many items rotated test forms –check how.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.

Measurement Problems within Assessment: Can Rasch Analysis help us? Mike Horton Bipin Bhakta Alan Tennant.

Item Response Theory for Survey Data Analysis EPSY 5245 Michael C. Rodriguez.

Initial Data Analysis Central Tendency. Notation  When we describe a set of data corresponding to the values of some variable, we will refer to that.

Science Skills Chapter 1. What is Science Science – A system of knowledge and the methods you use to find that knowledge – Remember: you must be able.

The ABC’s of Pattern Scoring Dr. Cornelia Orr. Slide 2 Vocabulary Measurement – Psychometrics is a type of measurement Classical test theory Item Response.

Module 5 Week 11 Supplement 12. SPEAKING TRUTH EFFECTIVELY How to provide insightful and effective peer reviews.

Experiment Basics: Variables Psych 231: Research Methods in Psychology.

Pearson Copyright 2010 Some Perspectives on CAT for K-12 Assessments Denny Way, Ph.D. Presented at the 2010 National Conference on Student Assessment June.

Slide 1-1 Copyright © 2004 Pearson Education, Inc. Stats Starts Here Statistics gets a bad rap, and Statistics courses are not necessarily chosen as fun.

The ABC’s of Pattern Scoring

Copyright 2010, The World Bank Group. All Rights Reserved. Testing and Documentation Part II.

CHAPTER Basic Definitions and Properties  P opulation Characteristics = “Parameters”  S ample Characteristics = “Statistics”  R andom Variables.

Research planning. Planning v. evaluating research To a large extent, the same thing Plan a study so that it is capable of yielding data that could possibly.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.

Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.

The Design of Statistical Specifications for a Test Mark D. Reckase Michigan State University.

1 Research Methods in Psychology AS Descriptive Statistics.

Essentials for Measurement. Basic requirements for measuring 1) The reduction of experience to a one dimensional abstraction. 2) More or less comparisons.

Introduction to Power and Effect Size  More to life than statistical significance  Reporting effect size  Assessing power.

The Normal Distribution Chapter 3. When Exploring Data Always start by plotting your individual variables Look for overall patterns (shape, centre, spread)

Wednesday, Nov. 8 Take three index cards from shelf Get out some paper and your research articles. Today and tomorrow will be about learning how to take.

Plus: Exam Scoring How is it done. How many questions are there

Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.

Abstract (heading font size 48-60)

Not just for scientists!

Searching the Literature

Descriptive Statistics

Putting it all together: Writing research proposals and reports

4.05 Time Management.

Item Analysis: Classical and Beyond

Workshop framework KATRIN WAY WITH CUSTOMER MEETINGS Before, During & After.

STAT 250 Dr. Kari Lock Morgan

Scientific Method HONORS Biology.

Applied Statistical Analysis

Measuring Social Life: How Many? How Much? What Type?

Evaluating information

Strategies for Multiplication

Dirichlet process tutorial

Managing Your Time.

Presentations that Communicate Results

How to Write a Position Argument

An introduction to Bayesian reasoning Learning from experience:

Context University of Washington at Bothell

Chapter 2 Looking at Data— Relationships

Experimental Psychology PSY 433

Why does sampling work?.

Lies, Damned Lies & Statistical Analysis for Language Testing

Title of Paper or Topic you are Teaching

Chapter 3: Central Tendency

Spatial Data Analysis: Intro to Spatial Statistical Concepts

All goals are not created equally.

Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables

Two Halves to Statistics

Item Analysis: Classical and Beyond

TWO-PAGE SUMMARY A two-page summary must accompany all exhibits, documentaries, performances, and websites. Remember, in the summary you are selling yourself.

Title of Paper or Topic you are Teaching

Item Analysis: Classical and Beyond

Presentation transcript:

Deborah Schnipke, PhD Virtual Psychometrics Rasch vs. IRT Deborah Schnipke, PhD Virtual Psychometrics

Why this talk? Make people aware of the controversy Present the Rasch point to view Clear up misconceptions

Disclaimers Put this talk together today I’m used to Rasch vs. 3PL Difference is more extreme (the 3rd parameter really messes up measurement properties, compared to Rasch) Speak Ohio-style (even faster than New York)

The Controversy Different Philosophy/Different Point of View Difference is very esoteric/theoretical Hard to make this point in an article that is about something else Raschies: Very convinced of their position (can be hard to talk to) Use different notation (can be confusing) Small minority Larger following in Europe historically, but this is changing (IRT becoming more common in Europe too) IRTers: Very convinced of their position Majority of researchers/practitioners (in US, except for Chicago)

Rasch ≠ 1PL IRT Same model, different mindset Theory based, rather than data based Different set of diagnostic tools (different way of viewing data/results) Item maps Fit statistics that work

IRT Mindset Want to measure the items as well as possible Use as many parameters as needed

Rasch Mindset Want to create a measurement scale that has interval-level properties (not “more interval-level”) Use the model that leads to desirable measurement properties Adding discrimination parameter breaks some of the scale properties

Misconceptions Only reason to use Rasch is that the math is easier Rasch is not as good because it doesn’t have as many parameters Rasch is only a small step away from raw scores

Body Size Example IRT approach: measure everything that seems relevant E.g., weight, height, density, water displacement, etc Rasch approach: define exactly what you mean by size & measure that (1 variable)

Orange Juice Example Example: want to make 16oz juice from oranges Classical test theory: count the oranges IRT: describe oranges as well as possible & summarize with theta Size/diameter, juiciness, weight Rasch: measure what will lead to results you want (weight of oranges)

Which is Better? (OJ example) Raw scores: number of oranges Rasch: weight of oranges IRT: size? of oranges (juice-producing-ness?) To make 16 oz of juice, need 5-12 oranges, usually, maybe more, maybe less Classical test theory (raw score) approach 4 lbs of oranges Rasch: direct interpretation IRT: not quite sure what the interpretation is Scoring is done by an algorithm

Clinical Example Paper: “Use of Rasch person-item map in exploratory data analysis: A clinical perspective” from Journal of Rehabilitation Research & Development Study of Visual Impairment Rate 48 daily activities (“not difficult” to “impossible”) See item map & fit statistics (on website)

Links Clinical example found at http://www.vard.org/jour/04/41/2/stelmack.html Free demo of Rasch software www.winsteps.com/ministep.htm

Summary Differences between Rasch & IRT are usually small Rasch: probably better theoretically IRT: easier to get published in journals More people are IRTers & they are the reviewers/editors Both are much better than raw scores Small samples: use Rasch