Sarah E. Stegall, Darrin L. Rogers, Emanuel Cervantes.

Slides:



Advertisements
Similar presentations
Division of Student Assessment Prairie State Achievement Examination (PSAE) Illinois State Board of Education 100 North First Street, E-216 Springfield,
Advertisements

Method Participants 184 five-year-old (M age=5.63, SD=0.22) kindergarten students from 30 classrooms in central Illinois Teacher ratings The second edition.
EVAULATION OF THE NSCRG SCHOOL SAMPLE Donsig Jang and Xiaojing Lin Third International Conference on Establishment Surveys Montreal, Canada, June 21, 2007.
Presenter Name(s) Issue date National Student.
METHOD PARTICIPANTS 393 undergraduates completed the study, (M age = 20; 75% female; 50% Asian, 21% Caucasian). MATERIALS AND PROCEDURE All subjects completed.
Meta-Cognition, Motivation, and Affect PSY504 Spring term, 2011 April 20, 2011.
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
23/11/2007Asian School of Business, Trivandrum Business Research Scales and Questionnaire.
Inferential Statistics
EMR 6500: Survey Research Dr. Chris L. S. Coryn Kristin A. Hobson Spring 2013.
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Do Economic and Demographic Characteristics Differ between Web and Mail Respondents to the 2005 Census of Agriculture Content Test? By Nancy J. Dickey.
Correlational and Differential Research
Chapter Eight & Chapter Nine
The Research Consumer Evaluates Measurement Reliability and Validity
QUESTIONNAIRES ORANGE BOOK CHAPTER 9. WHAT DO QUESTIONNAIRES GATHER? BEHAVIOR ATTITUDES/BELIEFS/OPINIONS CHARACTERISTICS (AGE / MARITAL STATUS / EDUCATION.
Personality and Life Satisfaction: A Facet-Level Analysis Ulrich Schimmack Shigehiro Oishi R. Michael Furr David C. Funder.
4 Conducting Marketing Research 1. What is Marketing Research? Marketing research is the systematic design, collection, analysis, and reporting of data.
CH. 9 MEASUREMENT: SCALING, RELIABILITY, VALIDITY
Sampling Basics 2011, 9, 13. Last Class: Measurement  Scale of measurement –Nominal scale –Ordinal scale –Interval-ratio scale  Reliability: Free of.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 5 Making Systematic Observations.
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
Chapter10 Measurement in Marketing Research. The Measurement Process Empirical System (MKT Phenomena) Abstract System (Construct) Number System measurement.
Research Methods in MIS
Designing Questionnaires for Research By Janna McColgan EDUC 491/EDUC 403 Research Spring 2012.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 9 Surveys, Questionnaires, and Polls Most commonly used quantitative method Used for obtaining.
Business and Management Research
PhD Research Seminar Series: Writing the Method Section Dr. K. A. Korb University of Jos.
© 2013 Cengage Learning. Outline  Types of Cross-Cultural Research  Method validation studies  Indigenous cultural studies  Cross-cultural comparisons.
Measurement in Exercise and Sport Psychology Research EPHE 348.
Effectiveness of a Virtual Laboratory as a preparatory resource for Distance Education chemistry students Presenter: Yun-Ting,Wong Advisor: Ming-Puu,Chen.
Student Engagement Survey Results and Analysis June 2011.
APA Method Section.  Things to keep in mind...  Purpose: Replication Assess reliability & validity Make design clear Keep it precise, concise, and clear.
Final Study Guide Research Design. Experimental Research.
Implementing Basic Differences Tests
Chapter 2 Research Methods in Social Psychology. Chapter Outline  Characteristics of Empirical Research  Research Methods  Research in Diverse Populations.
Study of the day Misattribution of arousal (Dutton & Aron, 1974)
Students’ and Faculty’s Perceptions of Assessment at Qassim College of Medicine Abdullah Alghasham - M. Nour-El-Din – Issam Barrimah Acknowledgment: This.
Quantitative SOTL Research Methods Krista Trinder, College of Medicine Brad Wuetherick, GMCTE October 28, 2010.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
Chapter 4 – Research Methods in Clinical Psych Copyright © 2014 John Wiley & Sons, Inc. All rights reserved.
Examination of Public Perceptions of Four Types of Child Sexual Abuse Prevention Programs Brandon Kopp Raymond Miltenberger.
Learning Objective Chapter 9 The Concept of Measurement and Attitude Scales Copyright © 2000 South-Western College Publishing Co. CHAPTER nine The Concept.
Thurstone Scaling.
Measurement and Scaling
Measurement Theory in Marketing Research. Measurement What is measurement?  Assignment of numerals to objects to represent quantities of attributes Don’t.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 9 Surveys, Questionnaires, and Polls Most commonly used quantitative method –Used.
Chapter 6: Analyzing and Interpreting Quantitative Data
The effects of Peer Pressure, Living Standards and Gender on Underage Drinking Psychologist- Kanari zukoshi.
3.04 Interpret marketing information to test hypotheses and/ or solve issues Marketing Management.
A COMPARISON OF DIFFERENT METHODS OF DETECTING INATTENTIVE RESPONDENTS Avi Fleischer M.S.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Likert Scaling. Work quickly and effectively under pressure 49 Organize the work when directions are not specific. 39 Decide how to manage multiple tasks.
Accuracy, Reliability, and Validity of Freesurfer Measurements David H. Salat
Consumer Behavior, Ninth Edition Schiffman & Kanuk Copyright 2007 by Prentice Hall Chapter 2 Consumer Research.
In what ways are our superstitions and beliefs in the supernatural different to Chinese people?  The following is an example to help with your Tangjia.
Examining the Effect of Crime Prevention Signage Through Social Normative Theory and Attitude Structures Bruce Biggs and Meghan E. Norris Results  Neighborhood.
The Effect of the Conscientious Responders Scale on Random Responding Rates in Psychological Questionnaires Noor Shubear, Zdravko Marjanovic, Lisa Bajkov,
Depending on the Mean and the Midpoint, Random Responders Cause Type 1 Error Zdravko Marjanovic, Noor Shubear, Tsz Yin Fung, & Lisa Bajkov Thompson Rivers.
CHAPTER 4: HOW PSYCHOLOGISTS USE THE SCIENTIFIC METHOD.
Authors *Dr. Asma Parveen *Uzaina
Validity and Reliability
پرسشنامه کارگاه.
5. Reliability and Validity
Business and Management Research
Evaluating Research Is this valid research?.
Methodology Week 5.
Research Methods.
Causal Comparative Research Design
Presentation transcript:

Sarah E. Stegall, Darrin L. Rogers, Emanuel Cervantes

The Semantic Inconsistency Scale (SIS), a no- cost tool for measuring random responding in questionnaire research, was developed and validated in two independent samples. It shows strong initial evidence of validity, able to not only detect computer-generated random responses but also invalid responding caused by more realistic conditions.

Invalid responding can threaten validity and interpretation (Huang, 2012; however, see Costa & McCrae, 1997 for alternative views). Random responding (RR; Archer & Smith, 2008) scales measure participants consistency of to pairs of items with similaror oppositemeanings (e.g., MMPI2 scale VRIN; Butcher et al., 2001; PAI scale INC; Morey, 2007).

Methods previously used to develop and evaluate RR scales include: Comparing responses from participants instructed to answer questionnaires randomly with subjects given standard instructions (Berry et al., 1991; Cramer, 1995; Galen & Berry, 1996) and Comparing real responses with computer-generated random responses (Charter & Lopez, 2003) Real-world More ecologically valid manipulations more externally valid Not been used so far

Commercially-marketed assessments only Semantic inconsistency scale (SIS) Public-domain measure of RR for use with questionnaires

Participants: 482 undergraduate students 75% female, 25% male 95% Hispanic Data Collection Phases Phase 1 (February-July, 2012): N=286, 75% female. Phase 2 (August-December, 2012): N=196, 81% female.

Procedures & Materials: Anonymous online survey Big Five Inventory (BFI; John & Srivastava, 1999) SIS item pool I see myself as someone who…

30 pairs of items From International Personality Item Pool (Goldberg et al., 2006)

Judged to be semantically related Very similar in meaning Apparently opposite in meaning

Degree of inconsistency in responses RR Needs a push to get started Finds it difficult to get down to work. Spends time thinking about past mistakes Doesnt worry about things that have already happened. I see myself as someone who… Strongly Disagree Disagree Neither Agree Nor Disagree Agree Strongly Agree Strongly Disagree Disagree Neither Agree Nor Disagree Agree Strongly Agree *Note: reverse coded* Difference of 1 Difference of 3 Similar items Opposite items

Experimental Manipulation: Quick condition (Q or quick) Subtly encouraged to complete the task quickly In-test messages emphasized importance of students time Control condition (A or accurate) Instructed to complete the survey accurately In-test messages emphasized accuracy

Phase 1 Selection and validation of final item pairs Maximized correlations Resulting 22-item (11-pair) SIS Phase 2 SIS scale assessed using responses SIS score = mean discrepancy in SIS pairs (possible range: 0-4)

Q vs. A comparison Survey completion time Attention to survey content Real vs. random responses All results calculated on Phase 2 sample only (unless otherwise specified)

Median SIS scores Q > A (Wilcoxon test z=2.179, p<.05). Figure 1. Trimmed (20%) means for SIS scores in condition A (accurate) versus Q (quick).

Correlation SIS scores Time to complete the full survey Spearmans rho = -.13 (p =.06)

Multiple choice questions content of survey items they had just seen & responded to Number of questions answered incorrectly Prediction: positive correlation with SIS No association Spearmans rho =.04 (p >.05)

SIS discrimination between 100% random responding (computer-generated) Actual participant responses Compare Phase 2 responses to 100,000 records of randomly-generated responses. Score SIS on everything Real scores < Random-response scores (t=31.56, p<.001; Figure 2).

Figure 2. Distribution of true Phase 2 SIS scores (blue) versus randomly-generated profiles (red).

SIS sensitivity of discrimination between True Phase 2 records Equal number of randomly-generated records Receiver-Operator Characteristic (ROC) analysis Area under the curve (AUC) discrimination ability of the test AUC =.95 (excellent discrimination ability)

Figure 3. ROC analysis for Phase 2 responses vs. (100%) randomly- generated response records.

1. Dataset split in half randomly Control group: original (real) responses Random group: X% of responses replaced with random Randomly-selected X% of responses X goes from 1% to 100% (i.e., do this process 100 times) Control Group Original (real) Responses Random Group X% replaced with random 1% < X < 100%

2. SIS scored & AUC calculated SIS discrimination between Control & Random groups

3. Result: SIS discrimination between real and partial (from 1 to 100%) random responding 4. We repeated this entire process 100 times, to even out random selection

Each run: AUCs comparing real responses to real + partial random 0% to 100% random Mean of 100 AUCs at each point

Figure 4. AUCs for 100 runs of SIS discrimination between original profiles and partially (1% through 100%) random profiles. Light blue lines are AUCs for 100 individual runs; dark blue line indicates mean AUC at each point.

Semantic Inconsistency Scale (SIS) Phase 1: Scale development (22-item/11-pair) Phase 2: Validated Identification of random responding Excellent with 100% random responses Fair performance even with protocols having less than 20% random responding. Discriminate between Quick & Accurate: Participants primed and instructed to answer hastily Participants given regular instructions

Perform as well as (if not better than) comparable tests Easily inserted into a variety of psychological and personality tests Modification of item stems or formats may allow use with an even wider range.

Limitations and Future Directions: Not appropriate for all test varieties Very short research Clinical protocols Random responding is not always a problem Depends on clinical/research situation SIS might help you know whether it is

SIS = Robust and valid measure of random responding FREE: Creative Commons licensed

Archer, R. P., & Smith, S. R. (2008). Personality assessment. CRC Press. Berry, D. R., Wetter, M. W., Baer, R. A., Widiger, T. A., Sumpter, J. C., Reynolds, S. K., & Hallam, R. A. (1991). Detection of random responding on the MMPI-2: Utility of F, back F, and VRIN scales. Psychological Assessment: A Journal Of Consulting And Clinical Psychology, 3(3), doi: / Butcher, J. N., Graham, J. R., Ben-Porath,Y. S., Tellegen, A., Dahlstrom,W. G.,&Kaemmer, B. (2001). MMPI-2 (Minnesota Multiphasic PersonalityInventory-2): Manual for administration and scoring (2nd ed.). Minneapolis, MN: University of Minnesota Press. Charter, R. A., & Lopez, M. N. (2003). MMPI 2: Confidence intervals for random responding to the F, F Back, and VRIN scales. Journal of clinical psychology, 59(9), Costa Jr., P. T., & McCrae, R. R. (1997). Stability and Change in Personality Assessment: The Revised NEO Personality Inventory in the Year Journal Of Personality Assessment, 68(1), 86. Cramer, K. M. (1995). Comparing three new MMPI-2 randomness indices in a novel procedure for random profile derivation. Journal of personality assessment, 65(3), Gallen, R. T., & Berry, D. R. (1996). Detection of random responding in MMPI-2 protocols. Assessment, 3(2), Goldberg, L. R., Johnson, J. A., Eber, H. W., Hogan, R., Ashton, M. C., Cloninger, C. R., & Gough, H. G. (2006). The international personality item pool and the future of public-domain personality measures. Journal of Research in Personality, 40(1), Huang, J. L., Curran, P. G., Keeney, J., Poposki, E. M., & DeShon, R. P. (2012). Detecting and deterring insufficient effort responding to surveys. Journal of Business and Psychology, 27(1), John, O. P., & Srivastava, S. (1999). The Big Five trait taxonomy: History, measurement, and theoretical perspectives. Handbook of personality: Theory and research, 2, Morey, L. C. (2007). Personality assessment inventory (PAI).

Sarah Stegall: Darrin Rogers: Emanuel Cervantes: