Instructor: Vincent Duffy, Ph.D. Associate Professor of IE Lecture 11 – Questionnaire Design & Evaluation Thurs. Feb. 22, 2007 IE 486 Work Analysis & Design.

Slides:



Advertisements
Similar presentations
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Advertisements

Reliability and Validity
Reliability & Validity.  Limits all inferences that can be drawn from later tests  If reliable and valid scale, can have confidence in findings  If.
Statistical Issues in Research Planning and Evaluation
INFERENTIAL STATISTICS. Descriptive statistics is used simply to describe what's going on in the data. Inferential statistics helps us reach conclusions.
Copyright © Allyn & Bacon (2007) Data and the Nature of Measurement Graziano and Raulin Research Methods: Chapter 4 This multimedia product and its contents.
IE 486 Work Analysis & Design II Exam 2 Review Dr. Vincent G. Duffy Friday, March 23, 2007.
Measurement in Survey Research Developing Questionnaire Items with Respect to Content and Analysis.
Data Analysis Statistics. Inferential statistics.
IE 486 Work Analysis & Design II Instructor: Vincent Duffy, Ph.D. School of Industrial Eng. & Ag.& Bio Eng. Lab 3 – Evaluation of Questionnaire Data Friday,
Instructor: Vincent Duffy, Ph.D. Associate Professor Lecture 10: Research, Design & Evaluation Tues. Feb. 20, 2007 IE 486 Work Analysis & Design II.
A quick introduction to the analysis of questionnaire data John Richardson.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Data Analysis Statistics. Inferential statistics.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Internal Consistency Reliability Analysis PowerPoint.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Indexes, Scales, and Typologies
Measurement and Data Quality
Reliability, Validity, & Scaling
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
CHAPTER 4 Research in Psychology: Methods & Design
Statistics for Education Research Lecture 10 Reliability & Validity Instructor: Dr. Tung-hsien He
Instrumentation.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
User Study Evaluation Human-Computer Interaction.
Study of the day Misattribution of arousal (Dutton & Aron, 1974)
Tests and Measurements Intersession 2006.
Quantitative SOTL Research Methods Krista Trinder, College of Medicine Brad Wuetherick, GMCTE October 28, 2010.
Descriptive Research: Quantitative Method Descriptive Analysis –Limits generalization to the particular group of individuals observed. –No conclusions.
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Descriptive Research Study Investigation of Positive and Negative Affect of UniJos PhD Students toward their PhD Research Project Dr. K. A. Korb University.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 6: Analyzing and Interpreting Quantitative Data
Limit collection of categorical data Age – – – – – & Above Income ,000 10,001 – 25,000 25,001 – 35,000.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
PART 2 SPSS (the Statistical Package for the Social Sciences)
Chapter 13 Understanding research results: statistical inference.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Appendix I A Refresher on some Statistical Terms and Tests.
Stats Methods at IC Lecture 3: Regression.
The Basics of Social Science Research Methods
FACTOR ANALYSIS & SPSS.
Reliability Analysis.
Improving Student Engagement Through Audience Response Systems
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Evaluation of measuring tools: validity
Understanding Results
STEM Fair Graphs & Statistical Analysis
Reliability Analysis.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
15.1 The Role of Statistics in the Research Process
PSY 250 Hunter College Spring 2018
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Instructor: Vincent Duffy, Ph.D. Associate Professor of IE Lecture 11 – Questionnaire Design & Evaluation Thurs. Feb. 22, 2007 IE 486 Work Analysis & Design II

Administrative Briefly review QOTD answer sheet from L10 Questionnaire design & analysis –(in preparation for Lab 3 beginning Friday) Lab 2 due in class next Tuesday 27 th

QOTD- Lecture 11 Q.1. Briefly discuss what is meant by criteria, measures & dependability. Q.2 What methods are appropriate for evaluating questionnaire data? Q.3 Should the same analyses be used for nominal/category data and continuous data?

Questionnaire design QOTD –included in slides Methods in Work Analysis & Design –Dependability of measures Evaluating questionnaire data Factor analysis Internal consistency –Questionnaires examples & scales

Q.1 Briefly discuss what is meant by criteria, measures & dependability This is related to methods of data collection & evaluation A criterion is an evaluation standard that can be used as a surrogate or correlate of outcome measures…such as system effectiveness, human performance and attitudes. –Eg. For driving performance we previously considered stopping distance and time to lane change (or lane deviations)

Q.1 Briefly discuss what is meant by criteria, measures & dependability Important aspects of criteria include: –Relative strengths/weaknesses of the data collection methods –Balance between costs of methods & available resources (eg. Consider motion capture test bed). –Costs include money, time, personnel & expertise Relevance

Q.1 Briefly discuss what is meant by criteria, measures & dependability Relative strengths/weaknesses of the data collection methods (dependability) Today, aspects of criteria & methods of evaluation Dependability is related to the validity & reliability Brief intro to dependability and measures of validity & reliability – more in lab tomorrow Important aspects of criteria include: –Relevance, Linearity and Homogeneity

Q.1 Briefly discuss what is meant by criteria, measures & dependability Relevance –Evaluation criteria must contribute to overall system mission Eg. Speed, response time, errors, comfort, acceptance Linearity – usually assumed –However, for industrial performance vs. intelligence, initially, no relationship was shown Additional analysis showed up to IQ of 90, r=0.46; between 90 and 110 r=0.04. For IQ>110, r=-0.52 (piecewise linear; see graph) –(non linear) IQ initially contributes to performance, then no impact, then too high IQ appears related to boredom see also Salvendy & Carayon (1997) supplementary reading for more detail Performance IQ

Q.1 Briefly discuss what is meant by criteria, measures & dependability Homogeneity – performance vs. time of day –Note how performance changes with time there appears to be warm up & slow down for the same operator The lack of consistency in performance over time of day raises issues of how/when to best collect data

Methods – evaluating questionnaire data Q.2 Which statistical technique can be used to evaluate questionnaire data?

Methods – evaluating questionnaire data Q.2 Which statistical technique can be used to evaluate questionnaire data? –It will depend on the objective of the analysis...

Methods – evaluating questionnaire data Which statistical technique becomes especially appropriate when questionnaire responses are measured on a nominal scale? First, what is meant by nominal scale? –Quality categorized as high, medium, low (discrete data) –Machine breakdowns due to mechanical failure electrical failure or operator misuse –For analysis of discrete data you can use a Chi-Square analysis

Methods – evaluating questionnaire data Q.3 Which statistical technique becomes especially appropriate when questionnaire responses are measured on a nominal scale?

Methods – evaluating questionnaire data Q.3 Which statistical technique becomes especially appropriate when questionnaire responses are measured on a nominal scale? A Chi Squared analysis For example – you can test whether color choice is related to gender FemaleMaleTotal Green chose green 38.9%22.2%61.1% chose green 64.6%36.4%64.6% of those who chose green were female Blue chose blue 16.7%22.2%38.9% chose blue 42.9%57.1%57.1% of those who chose blue were male Is there a significant difference in color choice, depending on gender? A Chi square statistic (shown after analysis of the original data set in SAS) shows  2 =7.5, p=0.006 (p<0.05). Hence, we would conclude yes.

Methods – evaluating questionnaire data Q.3 How might a response scale be designed to maximize the chance the responses (on a questionnaire) will be on a ratio measurement scale? –Use continuous, numerical, and anchored scales and do pilot testing – the average of questions/items can give a continuous measure. –A factor analysis will likely be done before an ANOVA (test of differences). -Factor analysis tries to find a factor (can think of it as a new variable) that will provide the highest set of correlations with the original variables (squares of these variables) producing the largest eigenvalue.

Examples of Scales Eg. Range from Very important – Very unimportant –With Neither important or unimportant in the middle – could have ‘moderately’ important in between. Or …Strongly agree, agree, undecided, disagree, strongly disagree Very much, much, fair, a little, not at all Much worse than usual, worse than usual, about the same, better than usual, much better than usual Excellent, very good, good, fair, poor All, most, a good bit, most, some, little, none Always, very often, fairly often, sometimes, almost never, never

The Hackman & Oldham Job Satisfaction Survey (1975) How satisfied are you with this aspect of your job? __1.The amount of job security I have. __2. The amount of pay and fringe benefits I receive __3. The amount of personal growth and development I get in doing my job. __4. The people I talk to and work with on my job. __5. The degree of respect and fair treatment I receive from my boss. __6. The feeling of worthwhile accomplishment I get from doing my job. __7. The chance I get to know other people while on the job. 1 Extremely Dissatisfied 2 Dissatisfied 3 Slightly Dissatisfied 4 Neutral 5 Slightly Satisfied 6 Satisfied 7 Extremely Satisfied

Methods – evaluating questionnaire data -Check eigenvalues of each factor (or variable) before deciding how many variables to include/consider - Eigenvalues should be greater than 1 for each factor that you include -Check percent of variance explained by each factor by taking the eigenvalue divided by the # of items -For example, if the eigenvalue (from linear algebra) is 3.66 with 6 questions (max. 6 units in the eigenvalue) – -then 3.66/6 or 0.61 (61%) of total variance in the questionnaire (whole questionnaire) is explained by the first factor. -You would expect a sum of at least 50% of the variance to be explained by the factors you have chosen to represent important variables.

Methods – evaluating questionnaire data -Check each factor before deciding how many variables to include/consider - Eigenvalues should be greater than 1 for each factor that you include -Check percent of variance explained by taking eigenvalue divided by the # of items (eg. If eigenvalue is 3.66 with 6 questions (max. 6 units in the eigenvalue –eg. 3.66/6 or 0.61 (61%) of total variance in the questionnaire (whole questionnaire) is explained by the first factor. An illustration of a principal components factor analysis with varimax rotation – item coefficients are shown below…(look for coefficients > 0.4) The values shown below are after rotation - and are easier to interpret Factor 1Factor 2 Ques. 1Feel blue Ques. 2People stare at me Ques. 3People follow me Ques. 4Basically happy Ques. 5People want to hurt me Ques. 6Enjoy going to parties Note: an item/question that loads on two different factors (such as question 5) would likely be dropped from further consideration; these factor analyses tend to be more useful when n is large (eg. n>100 participants)

Methods – Dependability - evaluating questionnaire data (cont.) -Factor analysis tries to find a factor (you can think of it as a new variable) that will provide the highest set of correlations with the original variables (the squares of these variables) producing the largest eigenvalue.

Structured questionnaire –Anchor –Avoid ambiguity –Avoid leading questions –Measure component –Measure whole –Sensitivity of scale –Halo effect –Dependability

Internal Consistency – testing reliability of the measures Internal consistency is the extent to which tests or procedures assess the same construct. It is a measure of the precision between the observers or of the measuring instruments used in a study. Cronbach’s Alpha measures how well a set of items (or variables) measure a single latent construct. Cronbach’s Alpha can be used as a measure of internal consistency. Cronbach’s Alpha as a measure of Internal Consistency

General steps toward analysis using a questionnaire Step 1: Conduct the survey using the structured questionnaire Step 2: Analyze the collected data Step 3: Make recommendation; report, presentation of results

Examples of inferences that can be drawn from questionnaire data 1) Which generic features (across manufacturers and models) are liked most, disliked most, or having greater difficulties in usage? 2) Which manufacturer and which model of the manufacturer is preferred regarding to each feature by the survey customers? 3) Which generic features (within manufacturers) are liked most, disliked most, or having greater difficulties in usage?

Can a measure be valid if it is not reliable? A measure can not be valid if it is not reliable. –If we can not measure it consistently, it is then hard to imagine that it can be correct. –Shown quantitatively… In lab more on…Reliability of Predictors and Criterion

Reliability of Predictors and Criterion –R 0 = Observed correlation (Validity) between the predictor and criterion –R T = “True” correlation (Validity) between the predictor and criterion The “True” correlation is one previously reported – possibly in the literature. –R p = Reliability of predictor –R c = Reliability of criterion Criterion related validity

–Suppose there are four tests in a battery with reliabilities of R 1 =0.60, R 2 =0.70, R 3 =0.78 and R 4 =0.92 (eg. These could be internal consistency reliabilities); –and three criteria are utilized with the following reliabilities: r 1 =0.45, r 2 =0.60, and r 3 =0.75. In other words, these reliabilities (r) are believed to be the relationships between two (or more) measures intended to be the measuring same criterion –Then: (part of Lab exercise tomorrow) Impact of Reliability on Validity

Cronbach’s Alpha as a measure of Internal Consistency k is the number of items in the group. s 2 res is the variance of residual components, which can not be controlled. s 2 p is the variance component for person. * The alpha coefficient is interpreted as the ratio of true score variance to observed score variance.

Cronbach’s Alpha as a measure of Internal Consistency For example, suppose 13 people were asked to rate a pair of questions on a 7-point scale. The pair of questions look different but they are testing the same item. For example –How much do you like the weather today? –How do you feel about the weather today?

People Observed Score (for each question) Total (Yi.) p p235 8 p333 6 p443 7 p55510 p653 8 p76612 p853 8 p95510 p p p p Total ( Y.j ) (Y..) Layout of Data Sheet calculation as part of lab exercise tomorrow

Questionnaire Design Supplemental material in R.W. Bailey, Human Performance Engineering 3 rd Ed. pp (Appendix). –And Webpage as shown: