Indexes, Scales, and Typologies

Slides:

Advertisements

Similar presentations

Allyn & Bacon 2003 Social Work Research Methods: Qualitative and Quantitative Approaches Topic 7: Basics of Measurement Examine Measurement.

Advertisements

Research Methodology Chapter 8 & 9.

Agenda Levels of measurement Measurement reliability Measurement validity Some examples Need for Cognition Horn-honking.

Edouard Manet: The Bar at the Folies Bergere, 1882

Scales and Indices Scales and Indices combine several categories in a question or several questions into a “composite measure” that represents a larger.

Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.

Developing a Questionnaire

Conceptualization and Measurement

The Research Consumer Evaluates Measurement Reliability and Validity

Research Methodology Lecture No : 11 (Goodness Of Measures)

MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT

Measurement Reliability and Validity

Designing Research Concepts, Hypotheses, and Measurement

Indexes, Scales and Typologies. Content validity Achieved by including all the dimensions of a concept Most non-demographic variables require more than.

CH. 9 MEASUREMENT: SCALING, RELIABILITY, VALIDITY

1 Single Indicator & Composite Measures UAPP 702: Research Design for Urban & Public Policy Based on notes by Steven W. Peuquet. Ph.D.

RESEARCH METHODS Lecture 18

Non Comparative Scaling Techniques

Research Ethics Levels of Measurement. Ethical Issues Include: Anonymity – researcher does not know who participated or is not able to match the response.

1 Measurement PROCESS AND PRODUCT. 2 MEASUREMENT The assignment of numerals to phenomena according to rules.

Index and Scale Similarities: Both are ordinal measures of variables. Both rank order units of analysis in terms of specific variables. Both are measurements.

1 Measurement Measurement Rules. 2 Measurement Components CONCEPTUALIZATION CONCEPTUALIZATION NOMINAL DEFINITION NOMINAL DEFINITION OPERATIONAL DEFINITION.

Chapter 6 Indexes, Scales, and Typologies. Index and Scale  Index  Constructed by accumulating scores assigned to individual attributes.  Scale  Constructed.

Measurement of Abstract Concepts Edgar Degas: Madame Valpincon with Chrysantehmums, 1865.

Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Internal Consistency Reliability Analysis PowerPoint.

Measuring Social Life Ch. 5, pp

Reliability, Validity, & Scaling

Scales and Indices While trying to capture the complexity of a phenomenon We try to seek multiple indicators, regardless of the methodology we use: Qualitative.

MEASUREMENT OF VARIABLES: OPERATIONAL DEFINITION AND SCALES

What is a Research Question? The central idea of what you wish to focus on in your research The issue you wish to proof Limited, answerable and closed.

Foundations of Educational Measurement

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.

CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES

1 Cronbach’s Alpha It is very common in psychological research to collect multiple measures of the same construct. For example, in a questionnaire designed.

6. Conceptualization & Measurement

Advanced Research Methods Indices, Scales and Typologies By David Warren Kirsch.

Validity and Reliability Edgar Degas: Portraits in a New Orleans Cotton Office, 1873.

Chapter Thirteen Measurement Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.

Measurement Validity.

Research: Conceptualization and Measurement Conceptualization Steps in measuring a variable Operational definitions Confounding Criteria for measurement.

Research: Conceptualization and Measurement Conceptualization Steps in measuring a variable Operational definitions Confounding Criteria for measurement.

The Practice of Social Research Chapter 6 – Indexes, Scales, and Typologies.

Chapter 6 Indexes, Scales, And Typologies Key Terms.

Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.

Scales and Indices While trying to capture the complexity of a phenomenon We try to seek multiple indicators, regardless of the methodology we use: Qualitative.

Chapter 6 Indexes, Scales, And Typologies. Chapter Outline Indexes versus Scales Index Construction Scale Construction.

Chapter Twelve Copyright © 2006 McGraw-Hill/Irwin Attitude Scale Measurements Used In Survey Research.

Measurement Chapter 6. Measuring Variables Measurement Classifying units of analysis by categories to represent variable concepts.

Indices and Scales To construct composite measures of variables, we need indices and scales Social science studies deal with many composite measures Many.

DESIGNING GOOD SURVEYS Laura P. Naumann Assistant Professor of Psychology Nevada State College.

RELIABILITY AND VALIDITY Dr. Rehab F. Gwada. Control of Measurement Reliabilityvalidity.

Indexes and Scales Why use a “composite” measure of a concept? ▫ There is often no clear “single” indicator ▫ Increase the range of variation ▫ Make data.

Reliability Analysis.

Chapter 2 Theoretical statement:

Indexes, Scales, and Typologies

CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES

Part Two: Measurement Edgar Degas,

Social Research Methods MAN-10 Erlan Bakiev, Ph. D.

Journalism 614: Reliability and Validity

Research strategies & Methods of data collection

Chapter Nine Measurement and Scaling: Noncomparative Scaling

Measuring Social Life: How Many? How Much? What Type?

Chapter 6 Indexes, Scales, And Typologies

RESEARCH METHODS Lecture 18

Indexes, Scales, and Typologies

INDEXES, SCALES, & TYPOLOGIES

Reliability Analysis.

Research strategies & Methods of data collection

Chapter 6 Indexes, Scales, and Typologies

Indexes and Scales Why use a “composite” measure of a concept?

Presentation transcript:

Indexes, Scales, and Typologies Edgar Degas: The Absinthe Drinker (detail), 1875-76

Indexes versus Scales Index A composite measure based upon the summed responses to dichotomous indicators. This composite might be the sum of responses to the indicator items or some other calculation, such as the mean. Each indicator is given equal weight in measuring the construct.

Indexes versus Scales Index (Continued) An “exercise” index, for example, might be the total score (0-5) of whether one participated in five different physical activities in the past week. In this example, we want to know only whether one participated in the physical activity at all during the past week.

Indexes versus Scales Scale A composite measure based upon multiple continuous-level indicators. This composite might be the sum of responses to the indicator items or some other calculation, such as the mean. Responses to each indicator vary in their strength and therefore in their contribution to the total score for the construct.

Indexes versus Scales Scale (Continued) An “exercise” scale, for example, might be the total times (0-35) one engaged in five different physical activities last week. In this example, we want to know how many times the person engaged in each of the five activities on each day of the week.

Notes Regarding the Term “Scale” Confusing Language Unfortunately, in social science literature, the term “scale” is used to refer to the metric by which responses are recorded (i.e., a Likert scale, a Guttman scale, etc.) and to a measure that is constructed from multiple questions. These notes might clarify the difference in the use of the term “scale.”

Notes Regarding the Term “Scale” “Scale” as a Metric Suppose we wanted to measure a concrete object, such as the length of a table. We might measure its length in inches by using a ruler. In this sense, the ruler is the measuring instrument and its length is measured on a “scale” of inches. The use of the word “scale” in this context refers to the metric in which we measure the table: inches. We could use some other metric to measure the table, such as centimeters.

Notes Regarding the Term “Scale” “Scale” as a Metric (Continued) Suppose instead that we wanted to measure an abstract concept, such as marital satisfaction. We might measure marital satisfaction by simply asking the married person: “Are you satisfied with your marriage? And we might record the responses to this question on a Likert scale, wherein 1 = very unsatisfied, 2 = unsatisfied, 3 = neither satisfied nor unsatisfied, 4 = satisfied, 5 = very satisfied. In this context, the question, “Are you satisfied with your marriage?” is the measuring instrument and satisfaction is measured using the metric: 1-5.

Notes Regarding the Term “Scale” “Scale” as a Metric (continued) We might have used some other approach, such as a semantic differential, to measure marital satisfaction with the instrument “Are you satisfied with your marriage?” We have not created a “scale” to measure marital satisfaction because we are using just one question to measure it: “Are you satisfied with your marriage?” And we cannot assess reliability because we have just one measure of marital satisfaction: “Are you satisfied with your marriage?”

Notes Regarding the Term “Scale” “Scale” as a Metric (continued) The use of the word “scale” to refer to the metric of measurement (i.e., a Likert scale) is unfortunate because it creates confusion. The confusion is heightened because of the very common use of the word “scale” in journal articles to refer to a measurement metric. But there you have it!

Notes Regarding the Term “Scale” “Scale” as a Constructed Variable Suppose we use a series of three questions to measure marital satisfaction: Overall I am happy with my marriage. I am satisfied with my marriage. My marriage is a source of happiness to me. Suppose that we record the answers to each of these questions using a Likert scale, wherein 1 = strongly disagree, 2 = disagree, 3 = neither agree nor disagree, 4 = agree, and 5 = strongly agree.

Notes Regarding the Term “Scale” “Scale” as a Constructed Variable (continued) Suppose for each person we interview, we calculate the mean score on each of the three questions about marital satisfaction. For example, suppose that Mary answered the three questions, respectively, with the numbers: 5, 4, and 3. Mary’s average score for the three questions equals 4. Thus, her score on the three question marital satisfaction scale equals 4.

Notes Regarding the Term “Scale” “Scale” as a Constructed Variable (continued) In this sense, we have created a “scale.” And because we have more than one measurement that is included within the marital satisfaction scale (i.e., three questions), we can calculate the reliability of this scale with Cronbach’s alpha.

Indexes versus Scales Weighting Both indexes and scales can be unweighted or weighted. In an unweighted index or scale, each item is treated equally in the calculation of the measure. In a weighted index or scale, some items are given more weight than are others. We might decide, for example, that “running” should be given more weight than “stretching” in our measure of “exercise.”

Index Construction Item Selection Face Validity: The extent to which items seem to correspond to the definition of the construct (see also: content, logical). Dimensionality: Items intended to measure a construct should be exclusive to that construct. There are special exceptions to this criterion, such as in multi-trait, multi-method models. Variance: Items should have a strong correlation with the construct being measured.

Index Construction Examination of Empirical Relationships Bivariate Relationships: This direct relationship between the item and the construct indicates the extent to which the item measures the construct. Multivariate Relationships: Sometimes, once a bivariate relationship is calculated for one item and a construct, the relationship between a second item and the construct is seen as a weak one.

Index Construction Handling Missing Data Missing data always present empirical, conceptual, and even ethical problems. There are no perfect solutions to the problem. One might: Omit cases with missing data on any items. Insert the mean of all cases to items with missing data. Use specialized statistical procedures to estimate a value for the missing data.

Index Construction Index Validation Item Analysis: Examination of the empirical contribution of each item to measuring the construct (see also: internal validity). External Validation: Examination of the extent to which the construct is correlated with related constructs (see also: empirical validity). Bad Index or Bad Validators?: A lack of external validation might occur because the validating construct is not measured well.

Scale Construction Similarities to Index Construction As with the construction of indexes, scale construction must address issues of item selection, examination of the data, scoring, weighting, missing data, and validation. Central to scale construction is the development of items with a range of responses that can accurately reflect different levels of correspondence with the construct being measured.

Types of Scales Thurstone Scale This technique assesses the extent of agreement among a group of judges about the content validity of proposed items for a scale. For example, one might ask a group of persons to judge how closely 25 different items come to measuring self-esteem. Then, one might select the 10 items that received the highest average scores for having content validity with self-esteem.

Types of Scales Thurstone Scale (Continued) This technique can help find the best questions to ask to measure an abstract concept. The technique does not specify how a question or set of questions should be formatted on a questionnaire.

Types of Scales Likert Scale This technique assesses the extent of the subject’s agreement with items that have been judged (by some method) to have content validity with the construct being measured. For example, the researcher might ask the respondent to 1) strongly disagree; 2) disagree; 3) neither agree nor disagree; 4) agree; or 5) strongly agree with the statement, “I am a person of worth” as a means of measuring self-esteem.

Types of Scales Likert Scale (Continued) This technique can be used to ask many questions in a short amount of space (mailed survey) or time (telephone survey). The technique is intuitively appealing to most persons. The technique provides continuous-level data. The technique can become tiresome if used too extensively on a questionnaire.

Types of Scales Semantic Differential Scale This technique assesses the extent of the subject’s agreement with items, where the response for each item is shown on a continuum. Example: Skipping class in Sociology 302 is... Good for me. 1___2___3___4___5 Bad for me.

Types of Scales Semantic Differential (Continued) This technique can be used to ask many questions in a short amount of space (mailed survey) or time (telephone survey). The technique is intuitively appealing to most persons. The technique provides continuous-level data. The technique can become tiresome if used too extensively on a questionnaire. It sometimes can be difficult to label the end-points of a semantic scale.

Types of Scales Guttman Scale This technique assesses the extent of the subject’s agreement with items, where the items are meant to represent a continuum.For example, one might ask these questions: Do you drink alcohol? Do you smoke marijuana? Do you use cocaine? One might anticipate that all persons who answer “yes” to #3 would also answer “yes” to #1 and #2, and so forth.

Types of Scales Guttman Scale (Continued) This technique can be used to ask many questions in a short amount of space (mailed survey) or time (telephone survey). The technique is intuitively appealing to most persons. The technique provides continuous-level and ranked data. The items have to form a continuum that is accepted by respondents and the community of scholars.

Typologies Definition A nominal-level variable that summarizes two or more variables. For example, one might create a typology of males: The “man’s man”: outdoor-type, strong, beer drinker. The “womanizer”: suave, good-looking, wine drinker. And so forth...

Typologies Critique As with stereotypes, typologies do not accurately reflect anyone and provide oversimplifications of everyone. One should use typologies mainly to organize one’s thinking as part of exploratory research. It is extremely difficult to analyze a typology as a dependent variable because too much variation exists within each category.

Validity and Reliability Introduction Validity: The extent to which a measuring device measures what it is intended to measure. Reliability: The extent to which a measuring device provides consistent values for the same phenomenon being measured. These terms are described in detail at this accompanying web site: Validity and Reliability.