Surveys and Attitude Measurement The reason surveys seem to be everywhere is that they are tremendously flexible— you can ask people about anything, and.

Slides:



Advertisements
Similar presentations
Chapter 16 Inferential Statistics
Advertisements

Mean, Proportion, CLT Bootstrap
Survey Methodology Reliability and Validity EPID 626 Lecture 12.
The Research Consumer Evaluates Measurement Reliability and Validity
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Psychological Testing Principle Types of Psychological Tests  Mental ability tests Intelligence – general Aptitude – specific  Personality scales Measure.
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
Research Methodology Lecture No : 11 (Goodness Of Measures)
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Part II Sigma Freud & Descriptive Statistics
Chapter 7 Statistical Data Treatment and Evaluation
Measurement Reliability and Validity
Some terminology When the relation between variables are expressed in this manner, we call the relevant equation(s) mathematical models The intercept and.
Chapter 14 Comparing two groups Dr Richard Bußmann.
Reliability and Validity of Research Instruments
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Beginning the Research Design
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
A quick introduction to the analysis of questionnaire data John Richardson.
Conny’s Office Hours will now be by APPOINTMENT ONLY. Please her at if you would like to meet with.
Today Today: More Chapter 3 Reading: –Please read Chapter 3 –Suggested Problems: 3.2, 3.9, 3.12, 3.20, 3.23, 3.24, 3R5, 3R9.
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Stat Notes 4 Chapter 3.5 Chapter 3.7.
Sample size. Ch 132 Sample Size Formula Standard sample size formula for estimating a percentage:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
Determining the Size of
Test Validity S-005. Validity of measurement Reliability refers to consistency –Are we getting something stable over time? –Internally consistent? Validity.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Reliability, Validity, & Scaling
Measurement in Exercise and Sport Psychology Research EPHE 348.
Instrumentation.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 11 Part 3 Measurement Concepts MEASUREMENT.
Technical Adequacy Session One Part Three.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Test item analysis: When are statistics a good thing? Andrew Martin Purdue Pesticide Programs.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Lecture 14 Sections 7.1 – 7.2 Objectives:
Study of the day Misattribution of arousal (Dutton & Aron, 1974)
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Reliability & Validity
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Validity Validity: A generic term used to define the degree to which the test measures what it claims to measure.
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
JS Mrunalini Lecturer RAKMHSU Data Collection Considerations: Validity, Reliability, Generalizability, and Ethics.
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Experimental Research Methods in Language Learning Chapter 12 Reliability and Reliability Analysis.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
ES 07 These slides can be found at optimized for Windows)
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
As a data user, it is imperative that you understand how the data has been generated and processed…
This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
CRITICALLY APPRAISING EVIDENCE Lisa Broughton, PhD, RN, CCRN.
Estimating standard error using bootstrap
Reliability & Validity
Evaluation of measuring tools: reliability
Presentation transcript:

Surveys and Attitude Measurement The reason surveys seem to be everywhere is that they are tremendously flexible— you can ask people about anything, and most people will give you answers on just about anything. They aren’t perfect measuring instruments, of course, because people can shade what they choose to tell you, and survey methodologists spend a great deal of time and angst constructing surveys that are as minimally prone to problems as possible and by building in checks when they can. The topic of survey methodology is so extensive that it cannot be covered comprehensively here, but it is also a topic more appropriate for Marketing Research texts (such as that by Iacobucci and Churchill!) so we will assume some basic familiarity with survey design and sampling as we proceed. For this book on models, let us pick up on how the goodness of surveys is tested. No single item is perfect—but the hope is that an aggregate of several items will be more solid and provide a reasonably precise and accurate measure of customer sentiment (random errors on one item should cancel out those on another, etc.). Specifically, measurement theory says that a person’s answer to a question, X, is a function of their true opinion, T, plus some error ε, where the error is random and roughly normally distributed (so the positives cancel the negatives): X = T + ε.

Reliability Psychometricians speak of a scale’s reliability and validity. Reliability may be assessed in several ways. “Test-retest” reliability is a measure of the consistency of responses on the same survey from a sample at two points in time, and “alternate forms” reliability measures the consistency of responses from a sample on two surveys that are similar but not identical. These tests of reliability are rarely conducted in the real world because marketers feel lucky enough when they get customers to answer a survey once—to get them to do so again is usually unrealistic. More frequently, reliability is conceptualized and measured as “internal consistency,” that is, if part of a survey presents 5 questions asking customers about their service provider, then those 5 items should be fairly highly inter-correlated among themselves. If another section of the survey has 4 questions that ask customers about their perceptions of the fairness of the brand’s prices, those items should be correlated among themselves as well. (Whether the 5 questions about the service provider and the 4 questions about pricing are correlated is a completely different question that we’ll address later.)

Reliability The extent to which a set of items hang together is captured in two different indices. First, we can compute “item-total correlations.” As the name suggests, we would compute the correlation between question 1 and the average of questions 2, 3, 4, and 5 (we extract question 1 temporarily otherwise it would spuriously inflate the extent to which question 1 was correlated with the “total” given that the total would include the item itself). We then proceed to compute the correlation between item 2 and the average of items 1, 3, 4, and 5, and so on. Low item-total correlations are indicative of items that are candidates for extracting before subsequent modeling—it would seem that item doesn’t measure what the others are capturing, thus to include it would be including noise.

Reliability A second index of internal consistency is called Coefficient Alpha, defined as, where p is the number of items in the scale (5 for our example), is the variance of one of the items in the scale and is the variance of the entire scale (thus,, a sum over the variances of all p items and all of their covariances—remember that a covariance, σ ij is like a correlation coefficient in that it measures the extent of a linear relationship between items i and j, but the correlation coefficient is scaled by the two items’ standard deviation, which results in that nice property that -1.0 < r ij < +1.0; the covariance, σ ij = r ij /(σ i σ j ), thus is like summing over the whole p  p correlation matrix, but is actually the sum over the whole covariance matrix). Coefficient Alpha is closest to 1.0 (high reliability) if the items all hang together—then the sum of ’s (in the numerator) will be small compared to in the denominator ( sums those individual item variances but also all the covariances (like correlations), so it should be larger). If that ratio is small, then 1 minus the ratio (the quantity in parentheses) will be large, and a large Coefficient Alpha can be achieved. For more information, go online. It is good practice to have multi-item scales on surveys, but in the real world, replicate items often get lopped off in the interest of keeping surveys short.

Validity Content Construct Criterion-Related

Coefficient Alpha References Duhachek, Adam, Anne T. Coughlan and Dawn Iacobucci (2005), “Results on the Standard Error of the Coefficient Alpha Index of Reliability,” Marketing Science, 24 (2), Duhachek, Adam and Dawn Iacobucci (2004), “Alpha’s Standard Error (ASE): An Accurate and Precise Confidence Interval Estimate,” Journal of Applied Psychology, 89 (5), Iacobucci, Dawn and Adam Duhachek (2003), “Advancing Alpha: Measuring Reliability with Confidence,” Journal of Consumer Psychology, 13 (4),