Some statistics questions answered:

Slides:



Advertisements
Similar presentations
Deciding which statistical test to use:. Tests covered on this course: (a) Nonparametric tests: Frequency data - Chi-Square test of association between.
Advertisements

The end (of the beginning):. Questions to ask in deciding which test is appropriate: 1. How has each participant contributed to the data? (Individual.
Hypothesis testing 5th - 9th December 2011, Rome.
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
PSY 307 – Statistics for the Behavioral Sciences
Some statistics questions answered:
The t-test:. Answers the question: is the difference between the two conditions in my experiment "real" or due to chance? Two versions: (a) “Dependent-means.
Inferential Stats for Two-Group Designs. Inferential Statistics Used to infer conclusions about the population based on data collected from sample Do.
Lecture 9: One Way ANOVA Between Subjects
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Today Concepts underlying inferential statistics
The Research Skills exam: The four horsemen of the apocalypse: pestilence, war, famine and the RS1 exam.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
ANOVA Analysis of Variance.  Basics of parametric statistics  ANOVA – Analysis of Variance  T-Test and ANOVA in SPSS  Lunch  T-test in SPSS  ANOVA.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
General Linear Model 2 Intro to ANOVA.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Chapter 13 Understanding research results: statistical inference.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Quantitative methods and R – (2) LING115 December 2, 2009.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to – Discuss normal curve Classify parametric and non parametric tests.
More statistics questions answered: "The brain? I think it's in here"
Independent-Samples t test
Comparing Multiple Groups:
Exploring Group Differences
Inferential Statistics
Data measurement, probability and Spearman’s Rho
Dependent-Samples t-Test
Learning Objectives: 1. Understand the use of significance levels. 2
INF397C Introduction to Research in Information Studies Spring, Day 12
Non-Parametric Tests 12/1.
Using the t-distribution
Non-Parametric Tests 12/1.
Practice & Communication of Science
Hypothesis Testing Review
Non-Parametric Tests 12/6.
Statistics.
CHOOSING A STATISTICAL TEST
Non-Parametric Tests.
Data measurement, probability and statistical tests
Spearman’s rho Chi-square (χ2)
Elementary Statistics
Internal Validity – Control through
Comparing Multiple Groups: Analysis of Variance ANOVA (1-way)
Kin 304 Inferential Statistics
Statistical Analysis Determining the Significance of Data
Introduction to ANOVA.
Non-parametric tests, part A:
Chapter 13 Group Differences
Non – Parametric Test Dr. Anshul Singh Thapa.
Data measurement, probability and statistical tests
Parametric versus Nonparametric (Chi-square)
Analysis of Variance: repeated measures
The Research Skills exam:
Experiments with More Than Two Groups
UNIT-4.
Inferential Statistical Tests
InferentIal StatIstIcs
Inferential testing.
Examine Relationships
Presentation transcript:

Some statistics questions answered: "Well, am I significantly more popular than Nick Clegg?"

Requests: What's the difference between the SD and SEM? What are "degrees of freedom"? How to calculate t-tests. When do you use an ANOVA? How to use tables to assess statistical significance.

Standard deviation and standard error: Standard deviation (SD): a measure of how much scores are spread out around their mean. The bigger the SD, the less representative the mean is of the set of scores from which it was calculated. 1. Find the mean. 2. Find difference between each score and the mean. 3. Square the differences. 4. Add them up. 5. Divide by number of scores. 6. Take the square root of the result. In practice - use the "standard deviation" function on your calculator!

Standard deviation and standard error: Standard error of the mean (SE): a measure of how much sample means are spread out around the population mean. The bigger the SE, the less confident we can be that the sample mean truly reflects the population mean. 1. Find the standard deviation. 2. Divide it by the square root of the number of scores. means of different samples actual population mean means of different samples actual population mean successive attempts

What are "degrees of freedom"?: Simple answer: the number of scores that are free to vary. 3 4 2 5 14 1 7 11 9 8 Top row total = 14: once you know the first three scores, fourth score is fixed - it's not free to vary. Hence Chi-Square d.f. are (number of rows - 1) * (number of columns -1) Here, d.f. = (4-1) * (5-1) = 12

Degrees of freedom for different tests: Pearson's r: number of pairs of scores -2 Repeated measures t-test : number of participants - 1 Independent measures t-test: (no. of participants in group A - 1) + (no. of participants in group B - 1) Kruskal-Wallis: d.f. = number of groups - 1 Friedman's: d.f. = number of conditions - 1 Mann-Whitney: N1 and N2 = number of participants in each group Wilcoxon: N = number of participants.

Degrees of freedom for different tests: One-way independent measures ANOVA: Total variation d.f. = no. scores - 1 Between groups variation d.f. = no. groups -1 Within-groups variation d.f. = no. scores in group A -1 plus no. scores in group B -1, plus no. scores in group C - 1, etc. e.g. ANOVA with 5 groups and 20 participants in each: Source SS df MS F Between groups 4 (= no. of groups - 1) Within groups 95 (= 5 x 19) Total 99 (= total no. scores – 1) The value of F for (4,95) degrees of freedom is what would be reported from this.

One-way repeated measures ANOVA: Total variation d.f. = no. scores - 1 Between subjects variation d.f. = number of subjects -1 Within-subjects variation d.f. = no. subjects Within-subjects variation due to experimental manipulations d.f. = number of conditions - 1 Within-subjects variation due to random factors ("error") d.f. = total within-subjects d.f. minus experimental d.f. Source SS d.f. Mean Squares F-ratio Between subjects 5 Within subjects 12 Experimental 2 Error 10 Total 17

Degrees of freedom for different tests: e.g. a one-way repeated measures ANOVA, with 3 conditions and 6 participants: Source SS d.f. MS F Between subjects 5 (= number of participants -1) Within subjects 12 (here = 6 * (3-1) = 6*2 ) Experimental 2 (= no. conditions -1) Error 10 (here = (6-1) *(3-1) = 5*2 ) Total 17 (= total no. scores – 1) The value of F for (2,10) degrees of freedom is what would be reported from this.

Tests whose output you should be able to understand: Tests you need to be able to calculate by hand (for section 2 of the exam): Chi-Square test of association , Chi-Square goodness of fit Pearson's r , Spearman's rho Wilcoxon , Mann-Whitney Friedman's , Kruskal-Wallis Repeated-measures t-test , Independent-measures t-test Tests whose output you should be able to understand: One-way independent-measures ANOVA One-way repeated-measures ANOVA

Independent-measures t-test: The t value represents the size of the difference between two means (essentially a z-score for small sample sizes). The bigger the value of t, the more confident we can be that the difference between the means is "real", i.e. that it has not occurred just by chance. t is the difference between two means, divided by an estimate of how much this difference is likely to vary from occasion to occasion (estimated standard error of the difference between means). Repeated-measures t-test: Same logic, except that we can capitalise on the fact that the same people did both conditions (and hence random variation in performance is likely to be less).

Independent-measures t-test: t is the difference between two means, divided by an estimate of how much this difference is likely to vary from occasion to occasion (estimated standard error of the difference between means). mean A mean B male height female height probability of occurrence value of t ©McMaster University 12

Effects of food additives on children's activity levels: Group A: eat tartrazine-containing nosh. Group B: same nosh without tartrazine. DV: time spent running around. A: additive B: no additive 11 5 15 6 4 14 7 18 mean: 14.00 5.57 SD: 2.68 0.98 Σ X1 = 84 Σ X2 =39 Data OK for a parametric test? 1. Normally distributed? Here, too few scores to tell! 2. Interval or ratio? Ratio (time). 3. Homogeneity of variance? Yes - SD is roughly similar proportion of the mean (Levene's test p = .07).

Effects of food additives on children's activity levels: Group A: tartrazine: mean = 14.00. Group B: no tartrazine: mean = 5.57. top line of equation:

Effects of food additives on children's activity levels: Group A: tartrazine. Group B: no tartrazine. DV: time spent running around. bottom line of equation: Quicker ways of calculating these... = 36.00 =5.714

Effects of food additives on children's activity levels: Group A: tartrazine. Group B: no tartrazine. Two-tailed test (just predicting "an effect" of additives): t = 7.79 with 11 d.f., p < .0001

My obtained value of t (9) = -3 My obtained value of t (9) = -3.2, which is smaller than the critical value of 2.26 (because -3.2 is smaller than zero). Does this mean it is not significant? upper t-critical value 2.26 lower -2.26 0.025 Ignore the sign of t when using the table: whether t is positive or negative merely depends on which group you called X, and which you called Y.

When do you use a t-test, and when do you use a correlation? Depends on whether you have an experimental or correlational design: t-test: Two groups or conditions (two levels of one IV) Looking for differences between them on a single DV. Does alcohol affect driving performance? IV "alcohol dosage" (sober versus inebriated). DV number of crashes. Correlation: Two different DVs (alcohol consumption and number of crashes). Looking for a relationship between them - as alcohol consumption increases, so too should number of crashes.

When do you use Analysis of Variance? One-way independent measures ANOVA: Three or more groups of participants (three or more levels of one IV) Looking for differences between the means for these conditions on a single DV. Does alcohol affect driving performance? IV "alcohol dosage" (sober versus two pints versus four pints). DV number of crashes. Three or more conditions, all done by the same participants (three or more levels of one IV) DV number of crashes: each person produces three scores.