Download presentation
Presentation is loading. Please wait.
Published byReynard Little Modified over 6 years ago
1
Using questionnaires in research: The statistical aspects
Martin Kidd
2
Outline Statistical concepts Planning the project/questionnaire design
Data entry Statistical analysis
3
Statistical concepts Types of data Likert scale Variables & cases
Latent variables & items Latent variable structure Correlation Reliability (Cronbach Alpha) Confirmatory Factor Analysis (CFA) Exploratory Factor Analysis (EFA)
4
Types of data Categorical data Continuous data
Fixed number of outcomes Eg: Faculty – Sciences, Agriculture, Arts No order in the outcomes Divide sample into groups Continuous data Measurements (age, height, weight etc)
5
Types of data Ordinal data Order is important
Outcomes take on discrete values Eg Income: , , Likert scales: 0-disagree, to 5-agree Mostly treated as continuous, some times as categorical
6
Likert scale Way of quantifying people’s opinions, feelings experiences etc Most common: 1-5 scale or 1-7 Words assigned to each scale: 1 Completely disagree 2 Disagree 3 Neutral 4 Agree 5 Fully agree
7
Variables & cases Each item on a questionnaire is a variable
Other variables can be derived from items, eg: BMI=weight/height2 Each respondent included in the survey is a case
8
Latent variables Variable that cannot be directly measured:
Service quality Economic confidence index Family resilience Emotional intelligence It is measured indirectly through measurable variables like items on a questionnaire
9
Latent variables Other terms used: Score for latent variable:
Constructs Scales & sub scales Dimensions & sub-dimensions Score for latent variable: Sum or avg of measured items
10
Latent variable structure
More than one latent variable evaluated Structure: Indicates which items measure which variable How are the latent variables interrelated
11
Latent variable structure
LV1 LV2 Item1 Item2 Item3 Item4 Item5 Item6 Item7 Item8 Item9
12
Correlation Relationship between 2 continuous/ordinal variables
Take on values between -1,+1
13
Correlation r+1: Positive relationship
14
Correlation r-1: Negative relationship
15
Correlation r0: No relationship
16
Reliability – Cronbach alpha
How reliable is a set of items in measuring a latent variable If a person is supposed to score high on the latent variable, then the scores for the items should also be high and vice versa. If not, then the items are probably measuring something else
17
Reliability – Cronbach alpha
Thus there should be a high degree of correlation between the items Cronbach alpha: Summary measure of the correlation between the items Upper bound of 1 (perfect correlation) 0.7 a guideline for good reliability
18
Reliability – Cronbach alpha
Issues to keep in mind: The more items, the higher the reliability Will not indicate bi-modality: LV LV1 LV2 item1 item2 item3 item4 item5 item6
19
Confirmatory factor analysis (CFA)
Determines whether a set of data supports a pre-specified latent variable structure Emphasis here is on pre-specified From the underlying theory, a latent model is drawn up Use data to determine whether this theoretical model holds in practice
20
Confirmatory factor analysis
1 Item1 1 2 2 Item2 LV1 3 3 Item3 4 4 Item4 5 Item5 5 6 Item6 6 LV2 7 7 Item7 8 8 Item8 9 9 Item9
21
Confirmatory factor analysis
Analysis based on covariance matrix of measured items Goodness of fit: How well can the realised covariance matrix be reproduced by the CFA model If goodness of fit not good: Assumptions violated Model not correctly specified
22
Confirmatory factor analysis
Goodness of fit ok: Investigate individual parameter estimates Usually many parameters to be estimated You need lots of data!
23
Exploratory factor analysis (EFA)
Latent structure is derived from the data No prior structure needed, only the number of factors(latent variables) need to be specified There are ways of determining the optimal number of factors as guided by the data
24
Exploratory factor analysis
Explained variance: How much of the variance in the original data was captured by the factors Factor loadings: which items define (or load on) which factors Factor should be interpretable, otherwise its meaningless
25
Exploratory factor analysis
Example:
26
Planning the project Aim of project:
Drives all other activities What statistical analyses are necessary? Content of the questionnaire dependent on the aim
27
Questionnaire design Is the purpose to design a measuring instrument for future use? Must make provision for repeated surveys to test & re-test questionnaire changes Is it going to be used only once? Use existing instruments! Be aware of possible danger of weak reliability
28
Questionnaire design Think about questionnaire validity:
Content validity Reliability Discriminant validity Etc. Too many people ask these questions after the survey
29
Questionnaire design Types of responses for questions:
List of choices (categorical, ordinal) Only one option can be selected Be as complete as possible Include “other” category only when necessary Number of options – Too many options might fragment the data Example: Gender: male/female Divide responses into 2 groups
30
Questionnaire design Types of responses Multiple selections
More than one option can be selected In the analysis, each option becomes a variable Which of the following treatments have you had: Speech therapy Physiotherapy Psychotherapy
31
Questionnaire design Types of responses Continuous data Open ended
If an accurate number can be filled in, don’t use categories Eg age: fill in exact age, not age categories Open ended If list of choices type, try not to leave open ended Time consuming to analyse
32
Questionnaire design Types of responses Likert scales:
How many options? 4pt, 5pt, 7pt scale? Be careful of the wording Make sure its ordinal !!! 1 Completely disagree 2 Disagree 3 No opinion 4 Agree 5 Completely agree 1 Completely disagree 2 Disagree 3 Neutral 4 Agree 5 Completely agree
33
Questionnaire design Types of responses Likert scales:
Be careful with the coding: 1 Completely disagree 2 Disagree 3 Neutral 4 Agree 5 Completely agree 6 Not applicable Usually treated as missing
34
Data entry Mostly done in Excel A column for each item/variable
A row for each respondent First row (only first row) reserved for variable names Data for 1st respondent in row 2 Short concise names for variables
35
Data entry First column usually a respondent number
For possible future back reference Leave missing values blank Categorizing open ended questions: Be consistent with spelling Agric & Agriscience will be 2 different faculties Be careful with the spacebar: Leading and trailing spaces can cause problems Use Excel autofilter to clean up data Use Excel freeze panes
36
Statistical analysis Statistical analysis can be divided into two phases: Preparatory (measurement) phase Reliability analysis Factor analysis Confirmatory factor analysis Main analysis phase Relationships between latent variables & other measured variables
37
Main analysis phase Correlation analysis Regression analysis
Cross tabulation ANOVA Etc. Quality dependent on outcomes of preparatory phase
38
Data Analysis (measurement phase): Cronbach alpha
When do we do reliability analysis (calculate Cronbach alpha): When there are constructs underlying the items and you know what they are When you are not worried about bi-modality When you don’t have enough responses for CFA
39
Data Analysis (measurement phase) : Cronbach alpha
Cronbach alpha: What to watch out for: Many items inflate the Cronbach alpha Does not indicate bi-modality Negatively phrased questions need to be reversed scored When using existing measurement instrument, be sure to have correct instructions Alpha is too low: What now??
40
Data Analysis (measurement phase) : Cronbach alpha
Alpha too low: When can we use “alpha if deleted” column?
41
Data Analysis (measurement phase) : Cronbach alpha
When can we use “alpha if deleted” column? Strictly speaking only when you are designing a measurement instrument The adapted questionnaire can be re-tested on an independent set of data Otherwise you are being lead by the current data without any means of verification
42
Data Analysis (measurement phase) : Cronbach alpha
Question remains: What do we do when alpha is low? Results from the main analysis could be degraded Eg correlations will be less pronounced You might not find significant differences between groups where there should be
43
Data Analysis (measurement phase) : CFA
When do we do CFA: When a theoretical latent structure has been postulated When you have enough data When you are worried about bi-modality When you are in a position to adapt and re-test on independent data
44
Data Analysis (measurement phase) : CFA
Things to watch out for: Similar issues to Cronbach alpha What do we do when the fit indices are not good? There are limited guidelines on how the CFA results can be used to update the latent model Fact remains, you must be in a position to verify on independent data
45
Data Analysis (measurement phase) : EFA
When do we do EFA: When there is no latent structure When the items are generated independently of the researcher Eg focus group discussions Does it make sense to do EFA on a questionnaire designed from the basis of a theoretical structure? I don’t think so.
46
Data Analysis (measurement phase) : EFA
However it appears to be common practice How do the new latent structure (derived from the data) tie in with the original aim of the project? The new latent structure has to be verified on independent data Or justified from a theoretical point of view
47
Data Analysis (measurement phase) : EFA
Can one calculate Cronbach alphas or do CFA on EFA results? Not using the same set of data: Results will be too optimistic
48
Data Analysis (measurement phase) : EFA
Why are results optimistic: Lets say we have items X1..Xp Let S1 to Sk represent all possible latent structures Lets say we have 2 independent data sets EFA done on 1st Let 1i be the Cronbach alpha of Si for the 1st data set and 2i for the 2nd data set
49
Data Analysis (measurement phase) : EFA
Data set 1(EFA) Data set 2 S1 11 S1 21 S2 12 S2 22 EFA result Si 1i Max alpha Si 2i (2i< 1i) Sj 1j Sj 2j Max alpha Sk 1k Sk 2k
50
Summary Careful planning and thought
Try and sort out analysis issues before designing the questionnaire and doing the survey I have hammered on independent data: If possible, plan for more than one survey Divide data into train/test data Thank you for your attention
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.