McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.

Slides:

Advertisements

Similar presentations

Measurement, Evaluation, Assessment and Statistics

Advertisements

© McGraw-Hill Higher Education. All rights reserved. Chapter 3 Reliability and Objectivity.

© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.

VALIDITY AND RELIABILITY

Chapter 5 Measurement, Reliability and Validity.

Part II Sigma Freud & Descriptive Statistics

Part II Sigma Freud & Descriptive Statistics

QUANTITATIVE DATA ANALYSIS

Concept of Measurement

Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.

Reliability and Validity

FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,

FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,

Chapter 7 Correlational Research Gay, Mills, and Airasian

Measurement Concepts & Interpretation. Scores on tests can be interpreted: By comparing a client to a peer in the norm group to determine how different.

Technical Issues Two concerns Validity Reliability

Measurement and Data Quality

Validity and Reliability

Understanding Research Results

@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.

Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.

Chapter 3 Statistical Concepts.

Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.

Instrumentation.

Foundations of Educational Measurement

Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.

Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.

Chapter 11 Descriptive Statistics Gay, Mills, and Airasian

Descriptive Statistics

Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.

UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.

METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.

© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.

Descriptive Statistics

Chapter 4: Test administration. z scores Standard score expressed in terms of standard deviation units which indicates distance raw score is from mean.

Reliability & Validity

Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.

EDU 8603 Day 6. What do the following numbers mean?

Chapter 2 Statistical Concepts Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright.

An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.

Data Collection and Reliability All this data, but can I really count on it??

Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.

Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.

Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.

Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.

Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.

Chapter 13 Descriptive Data Analysis. Statistics  Science is empirical in that knowledge is acquired by observation  Data collection requires that we.

Chapter 6, part I: Educational Measurement EDUC 502 October 10, 2005.

Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.

Presented By Dr / Said Said Elshama  Distinguish between validity and reliability.  Describe different evidences of validity.  Describe methods of.

Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.

SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.

Chapter Eight: Using Statistics to Answer Questions.

Chapter 6: Analyzing and Interpreting Quantitative Data

Psychometrics. Goals of statistics Describe what is happening now –DESCRIPTIVE STATISTICS Determine what is probably happening or what might happen in.

©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.

Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.

Chapter 6 - Standardized Measurement and Assessment

Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.

Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.

© 2009 Pearson Prentice Hall, Salkind. Chapter 5 Measurement, Reliability and Validity.

Chapter 12 Understanding Research Results: Description and Correlation

Associated with quantitative studies

CHAPTER 5 MEASUREMENT CONCEPTS © 2007 The McGraw-Hill Companies, Inc.

Reliability & Validity

Introduction to Statistics

Basic Statistical Terms

Chapter Nine: Using Statistics to Answer Questions

Chapter 8 VALIDITY AND RELIABILITY

Presentation transcript:

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals for the Consumer Woolfolk / Perry Child and Adolescent Development © 2012 Pearson Education, Inc. All rights reserved. Sixth Edition

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Foundations of Educational Measurement Chapter 5

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 3 Discussion Topics Educational measurement Descriptive statistics Frequency Distributions Central tendency Variation Correlation Validity of measurement Reliability of measurement

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 4 Educational Measurement Measurement: assignment of numbers to differentiate values of a variable Evaluation: procedures for collecting information and using it to make decisions for which some value is placed on the results Assessment - multiple meanings Measurement of a variable Evaluation Diagnosis of individual difficulties Procedures to gather information on student performance

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 5 Educational Measurement Purpose of measurement for research Obtain information about the variables being studied Provide a standard format for recording observations, performances, or other responses of participants Provide for a quantitative summary of the results from many participants

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 6 Educational Measurement Four measurement scales Nominal – categories Race, gender, types of schools (e.g., public, private, parochial) Ordinal - ordered categories Finishing position in a race, grade levels Interval - equal intervals between numbers on the scale Test scores, achievement levels Ratio - equal intervals and an absolute zero (0) Height, weight, time

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 7 Descriptive Statistics Statistics: procedures that summarize and analyze quantitative data Descriptive statistics: statistical procedures that summarize a set of numbers in terms of central tendency, variation, or relationships Important for understanding what the data tells the researcher

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 8 Descriptive Statistics Frequency distributions An organization of the data set indicating the number of times (i.e., frequency) each score was present Types Frequency table Frequency polygon Histogram

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 9 Descriptive Statistics Frequency distributions Shapes (see Figure 5.2) Normal - scores are equally distributed around the middle Positively skewed - the set of scores is characterized by a large number of low scores and a small number of high scores Negatively skewed - the set of scores is characterized by a large number of high scores and a small number of low scores Outlier scores – scores that distort findings because they are so different from the other scores in the sample

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 10 Descriptive Statistics Central tendency What is the typical score? Three measures Mode: the most frequently occurring score Median: the score above and below which one- half of the scores occur Mean – The arithmetic average of all scores – Statistical properties make it very useful – Concerns related to outlying scores

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 11 Descriptive Statistics Variability How different are the scores? Two types Range: the difference between the highest and lowest scores Standard deviation – The average distance of the scores from the mean – The relationship to the normal distribution ±1 SD 68% of all scores in a distribution ±2 SD 97% of all scores in a distribution Use of percentile ranks - the percentage of scores at or below a specified score

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 12 Descriptive Statistics

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 13 Descriptive Statistics Relationship How do two sets of scores relate to one another? Correlation A measure of the relationship between two variables – Strength to 1.00 – Direction - positive (+) or negative (-) Scatterplots – graphic depictions of correlations – Interactive scatterplots Interactive scatterplots

Interpreting Descriptive Statistics McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 14

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 15 Validity of Measurement Validity: the extent to which inferences are appropriate, meaningful, and useful Refers to the interpretation of the results A matter of degree Specific to a particular use or interpretation A unitary concept Involves an overall evaluative judgment

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 16 Validity of Measurement Three sources of validity evidence Test content - evidence of the extent to which items on a test are representative of the larger domain of content or items from which they are drawn Internal structure - evidence of the extent to which the relationships between items and parts of the instrument are consistent with those reflected in the theoretical basis of the instrument or its intended use

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 17 Validity of Measurement Three sources of validity evidence Relationships with other variables - evidence of the extent to which scores from an instrument are related to similar as well as different traits Convergent evidence - scores correlate with measures of the same thing being measured Discriminate evidence - scores do not correlate with measures of something different than that being measured Predictability - the extent to which test scores predict performance on a criterion variable

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 18 Validity of Measurement Importance of validity to research If the research results are to have any value, validity of the measurement of a variable must exist Use of established and “new” instruments and the implications for establishing validity Importance of establishing validity prior to data collection (e.g., pilot tests)

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 19 Validity of Measurement Importance of validity to research Validity as a matter of degree (i.e., the extent to which...) Judged on the basis of available evidence Varying levels of validity evidence are reported in articles

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 20 Reliability of Measurement Reliability The extent to which scores are free from error Error is measured by consistency Sources of error Test construction and administration – Ambiguous questions, confusing directions, changes in scoring, interrupted testing, etc. Participants’ characteristics – Test anxiety, lack of motivation, fatigue, guessing, etc.

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 21 Reliability of Measurement Reliability Measurement Reliability coefficients range from 0.00 to 1.00 regardless of the formula used to calculate them 0.00 indicates no reliability or consistency 1.00 indicates total reliability or consistency

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 22 Reliability of Measurement Five types of reliability evidence Stability (i.e., test-retest) Testing the same subject using the same test on two occasions Limitation - carryover effects from the first to second administration of the test Equivalence (i.e., parallel form) Testing the same subject with two parallel (i.e., equal) forms of the same test taken at the same time Limitation - difficulty in creating parallel forms

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 23 Reliability of Measurement Equivalence and stability Testing the same participants with two forms of the same test taken at different times Limitation - difficulty in creating parallel forms Internal consistency Testing the same subject with one test and “artificially” splitting the test into two halves Limitations - must have a minimum of ten (10) questions

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 24 Reliability of Measurement Internal consistency (continued) Two forms – KR 20 Dichotomously scored (i.e., right or wrong) items Typical of cognitive measures – Cronbach alpha Non-dichotomously scored (e.g., strongly agree, agree, disagree, strongly disagree) items Typical of non-cognitive measures

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 25 Reliability of Measurement Agreement Used when traditional estimates such as stability, equivalence, equivalence and stability, or internal consistency are not applicable Typically some form of agreement is used (e.g., raters agreeing with one another)

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 26 Reliability of Measurement Agreement (continued) Situations in which this estimate is used – Observational measures - agreement between raters making the same observation – Insufficient numbers of test items on an instrument - agreement across the percentage of responses that are the same for several participants – Data with highly skewed distributions - percentage of agreement in the number of participants

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 27 Reliability of Measurement Importance of reliability If the results are to have any value, reliability of the measurement of a variable must exist Established prior to conducting the research (e.g., pilot study) Necessary but not sufficient condition for validity (i.e., to be valid, an instrument must be reliable, but a reliable instrument is not necessarily valid)

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 28 Reliability of Measurement Conditions affecting reliability Length of the test (i.e., longer tests are typically more reliable) Participants Greater reliability with heterogeneous samples Scores for older participants are typically more reliable than those for younger children Trait being measured (i.e., cognitive traits are more reliable than affective characteristics)

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 29 Reliability of Measurement Enhancing reliability Standardized administration procedures (e.g., directions, conditions, etc.) Appropriate reading level Reasonable length of the testing period Counterbalancing the order of testing if several tests are being given

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. 30 Validity and Reliability For a discussion of validity and reliability see the American Educational Research Association’s recently revised Standards for Educational and Psychological Testing