Download presentation
Presentation is loading. Please wait.
Published byStephanie Murphy Modified over 9 years ago
1
Introduction to Statistical Methods By Tom Methven Digital slides and tools available at: www.macs.hw.ac.uk/~mjc/teaching/ResearchMethods
2
Moving Bell-curves
3
Designing the Experiment 1. Define exactly what you want to measure 2. Pick which statistical test to use, first 3. Decide on your experimental design
4
Worked Example Two different piano designs, iPiano and Wii Piano! This example will be built up through the notes to show how to go about designing/analysing a simple experiment.
5
Level Of Measurement (Non- Parametric) Nominal – Labelled results or characteristics – Can be used for equivalence and set membership – Can show commonality but leaves it undefined E.g. List of names, football players’ numbers Ordinal – Results in some sort of rank order – Results do not show differences between places! – No arithmetic is valid, you cannot say 3 rd place is the same as 1 st and 2 nd combined, for example E.g. Race results, top 10 films
6
Level Of Measurement (Parametric) Interval – Results placed along a regular scale, with an arbitrary zero-point – Multiplication/Division is meaningless but the ratio of differences is valid as differences are consistent E.g. Degrees Fahrenheit Ratio – Results placed along a regular scale with an absolute zero-point – Allows for statements such as 6 units is twice as much as 3 units E.g. Kelvin and most SI units
7
Statistic Basics ModeThe most common score MedianThe central score MeanThe ‘average’ of all the scores E.g. For the results 9,2,5,3,6,9,5,6,4,2,6 Mode: 6 Median: 5 Mean: 5.18
8
Worked Example Results Using time to successfully play a simple tune as the measure. Therefore, we’re using a ratio scaling, so we can use parametric tests. Results: Interface 1Interface 2 Person 14.284.38 Person 22.784.99 Person 37.634.30 Person 47.934.27 Person 57.195.50 Person 65.735.22 Person 78.404.09 Person 85.884.46 Person 95.604.00 Person 104.894.90 Mean:6.034.61
9
Randomisation and Ordering Effects Performing an experiment gives experience which could improve subjects’ ‘scores’ For a two-part test, an equal number of people should try each condition first When you have many conditions or trials, it is easiest to show then in a random order
10
Latin Squares A way of counter-balancing condition order for small amounts of conditions E.g. For three possible conditions: You can also get ‘balanced’ Latin squares for experiments with even number of conditions. Order of conditions or trials Group 1ABC Group 2BCA Group 3CAB
11
Accuracy of the Mean Variance: Standard Deviation: Standard Error:
12
Degrees of Freedom For sample populations, often ‘N – 1’ is used rather than N. This is the simplest calculation of D.O.F., but it can get very complex. We assume that the sample mean is the same as the population mean. Therefore, it is related to how many values are free to vary without altering the required mean value. (Rugby example)
13
Student’s T-Test Used for comparing the means of two sample groups using the variance Several types, dependant on the samples: – ‘Dependant/Paired’ – Using this one! – ‘Independent Equal Variance’ – ‘Independent Unequal Variance’ Assumes populations are normally distributed
14
Student’s T-Test Create a ‘null hypothesis’: – E.g. There is no difference in learning between iPiano and Wii Piano Create an ‘alternative hypothesis’: – One-tailed: Wii Piano is easier to learn – Two-tailed: Wii Piano is easier or harder to learn
15
Dependent T-Test Used to compare the results of two groups = Average difference = Expected difference (0 for null hypothesis) = Standard deviation of differences = Sample Size
16
Worked Example T Result = 1.420756421 = 1.985348881 = 10 t-value = 2.26
17
Interpreting T-Value The T-Test gives a t-value, which can be converted to a p-value If the p-value < 0.05 there is less than a 5% chance the null hypothesis is true Therefore we conclude that the change of condition caused the observed effect Our p-value was 0.025, so we can accept our alternate hypothesis: ‘Wii Piano is easier to learn’
18
Effect Size Tells us how important the result is in practical terms. Gives an ‘r-value’. – r = 0.10 (small effect) – 1% of total variance – r = 0.30 (medium effect) – 9% of total variance – r = 0.50 (large effect) – 25% of the variance – r DOES NOT work on a linear scale, so these can only be used as good guidelines. It is becoming more common to include the r-value as well as the p-value as it is a well understood and widely used check.
19
[letter]-values Recap: t-value: Result of the t-test – Normally converted into a ‘p-value’ for easy comparison between statistical tests. Used to calculate ‘r-value’ p-value: Is it statistically significantly? – If ‘p-value’ is less than 0.05, then you can discard the null hypothesis and accept your alternate hypothesis. r-value: Is the effect substantial in reality? – Does the result mean anything in real terms? HOW MUCH does our experiment explain the change in means?
20
Final Results p-value = 0.025 r-value = 0.60 Degrees of freedom = 9 “The results show that Wii Piano allows users to play a set tune successfully significantly faster than iPiano (p = 0.025). In addition, the effect size was large (r = 0.6), showing the result was substantial in real terms.”
21
Error Bars Error bars: Plot standard error
22
Excel Example TTEST in Excel will give a ‘p-value’ directly
23
Summing Up Dependant t-test when using a single group – For example, when using 10 friends to test two interfaces Avoid ordering effects – Randomise order and get an even number to do each test first Use ‘TTEST’ in Excel to get p-value easily – Easier than calculating it yourself, but harder to get r-value! Check p < 0.05 and quote the value and result – Always make sure to quote the actual p-value in your results as well as saying whether the result was statistically significant
24
Recommended Reading
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.