Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.

Slides:

Advertisements

Similar presentations

Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.

Advertisements

Consistency in testing

Reliability Definition: The stability or consistency of a test. Assumption: True score = obtained score +/- error Domain Sampling Model Item Domain Test.

The Department of Psychology

Psychometrics William P. Wattles, Ph.D. Francis Marion University.

Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.

 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.

Descriptive Statistics and the Normal Distribution HPHE 3150 Dr. Ayers.

Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.

Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.

Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.

Reliability and Validity

Introduction to Educational Statistics

Session 3 Normal Distribution Scores Reliability.

PowerPoint Slides developed by Ms. Elizabeth Freeman

Chapter Six z-Scores and the Normal Curve Model. Copyright © Houghton Mifflin Company. All rights reserved.Chapter The absolute value of a number.

Research Methods in MIS

Measurement Joseph Stevens, Ph.D. ©  Measurement Process of assigning quantitative or qualitative descriptions to some attribute Operational Definitions.

Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.

Standardized Test Scores Common Representations for Parents and Students.

Classical Test Theory By ____________________. What is CCT?

Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.

Classroom Assessment Reliability. Classroom Assessment Reliability Reliability = Assessment Consistency. –Consistency within teachers across students.

Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.

PPA 501 – A NALYTICAL M ETHODS IN A DMINISTRATION Lecture 3b – Fundamentals of Quantitative Research.

@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.

MSE 600 Descriptive Statistics Chapter 10 in 6 th Edition (may be another chapter in 7 th edition)

Foundations of Educational Measurement

Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.

Psychometrics William P. Wattles, Ph.D. Francis Marion University.

Test item analysis: When are statistics a good thing? Andrew Martin Purdue Pesticide Programs.

Chapter 3 Understanding Test Scores Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition.

Chapter 11 Descriptive Statistics Gay, Mills, and Airasian

Descriptive Statistics

Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.

Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E.

Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.

Descriptive Statistics

Review of Basic Tests & Measurement Concepts Kelly A. Powell-Smith, Ph.D.

Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.

Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 4:Reliability and Validity.

NORMS. Scores on psychological tests are most commonly interpreted by reference to norms ; which represent the test performance of the standardized sample.

Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.

Determination of Sample Size: A Review of Statistical Theory

OPENING QUESTIONS 1.What key concepts and symbols are pertinent to sampling? 2.How are the sampling distribution, statistical inference, and standard.

Basic Assessment Principles Chapter 2.  Nominal  Ordinal  Interval  Ratio Measurement Scales.

RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.

BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.

Psychometrics. Goals of statistics Describe what is happening now –DESCRIPTIVE STATISTICS Determine what is probably happening or what might happen in.

Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.

Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.

1 Outline 1. Why do we need statistics? 2. Descriptive statistics 3. Inferential statistics 4. Measurement scales 5. Frequency distributions 6. Z scores.

Reliability: Introduction. Reliability Session Definitions & Basic Concepts of Reliability Theoretical Approaches Empirical Assessments of Reliability.

Chapter 6 - Standardized Measurement and Assessment

Chapter 3 Percentiles. Standard Scores A standard score is a score derived from raw data and has a known basis for comparison. A standard score is a score.

Chapter 3 Selection of Assessment Tools. Council of Exceptional Children’s Professional Standards All special educators should possess a common core of.

Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.

Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.

HMS 320 Understanding Statistics Part 2. Quantitative Data Numbers of something…. (nominal - categorical Importance of something (ordinal - rankings)

Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 10: Correlational Research 1.

Measuring Research Variables

Chapter 6 Becoming Acquainted With Statistical Concepts.

Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.

5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)

Becoming Acquainted With Statistical Concepts

Descriptive Statistics I REVIEW

Classical Test Theory Margaret Wu.

By ____________________

15.1 The Role of Statistics in the Research Process

Presentation transcript:

Chapter 2 Norms and Reliability

The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that the test developer can publish derived scores known as norms.

Raw scores In the case of criterion-referenced tests, norms are not essential. There are many different kinds of norms, but they share one characteristic: Each incorporate a statistical summary of a large body of scores.

Essential Statistical Concepts Frequency Distributions Measures of Central Tendency Measures of Variability The Normal Distribution Skewness

Raw Score Transformations Percentiles and Percentile Ranks Standard Scores T scores and Other Standardized Scores Normalizing Standard Scores Stanines, Stens, and C scale

Selecting A Norm Group Age and Grade Norms Local and Subgroup Norms Expectancy Tables

Criterion-Referenced Tests Criterion-referenced tests are used to compare examinees’ accomplishments to a predefined performance standard. The focus is on what the test taker can do rather than on comparisons to the performance levels of others. Table 3.7

Concepts of Reliability Classical test theory and the sources of measurement error Sources of measurement error Measurement error and reliability The reliability coefficient The correlation coefficient The correlation coefficient as a reliability coefficient

Concepts of Reliability Reliability as temporal stability Reliability as internal consistency Item response theory The new rules of measurement Special circumstances in the estimation of reliability The interpretation of reliability coefficients Reliability and the standard error of measurement

Classical test theory and the sources of measurement error Test scores result from the influence of two factors: 1.Factors that contribute to consistency. 2.Factors that contribute to inconsistency

Sources of measurement error Item selecting Test administration Test scoring Systematic measurement error

Measurement error and reliability 1.unsystematic measurement error act as random influences 2.mean error of measurement is zero. 3.measurement errors are not correlated with true scores 4.measurement errors are not correlated with errors on other tests.

The reliability coefficient The ratio of true score variance to the total variance of test scores.

The correlation coefficient as a reliability coefficient If test results are highly consistent, then scores of persons taking the test on two occasions will be strongly correlated. Reliability as temporal stability Reliability as internal consistency

Reliability as temporal stability Test-retest reliability Alternate-forms reliability

Reliability as internal consistency Split-Half reliability The Spearman-Brown Formula Coefficient Alpha The Kuder-Richardson Estimate of reliability Interscorer reliability

Item response theory Figure 3.13 Item response Function or item characteristic curve Information functions Invariance in IRT

Special circumstances in the estimation of reliability Unstable characteristics Speed and power tests Restriction of range Reliability of criterion-referenced tests

Reliability and the standard error of measurement Computing the standard error of measurement The SEM and individual test scores The SEM and differences between scores