Comparing Two Means Prof. Andy Field.

Slides:



Advertisements
Similar presentations
Comparing Two Means Dr. Andy Field.
Advertisements

Comparing Two Means.
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Hypothesis Testing Steps in Hypothesis Testing:
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Quantitative Data Analysis: Hypothesis Testing
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
Objectives (BPS chapter 24)
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
T-Tests.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
T-Tests.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
T-tests Part 2 PS1006 Lecture 3
Comparing Means: Independent-samples t-test Lesson 14 Population APopulation B Sample 1Sample 2 OR.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Lecture 9: One Way ANOVA Between Subjects
Comparing Means: Independent-samples t-test Lesson 13 Population APopulation B Sample 1Sample 2 OR.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Today Concepts underlying inferential statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Between Group & Within Subjects Designs Mann-Whitney Test.
5-3 Inference on the Means of Two Populations, Variances Unknown
Simple Linear Regression Analysis
Relationships Among Variables
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Active Learning Lecture Slides
Chapter 9 Two-Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
AM Recitation 2/10/11.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Lecture 14 Testing a Hypothesis about Two Independent Means.
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
T tests comparing two means t tests comparing two means.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Hypothesis Testing Using the Two-Sample t-Test
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
CHAPTER 7 COMPARING TWO MEANS Difference between two groups.
Experimental Design and Statistics. Scientific Method
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Chapter 10 The t Test for Two Independent Samples
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
T tests comparing two means t tests comparing two means.
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 13.
Nonparametric Tests with Ordinal Data Chapter 18.
T tests comparing two means t tests comparing two means.
Chapter 13 Understanding research results: statistical inference.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Chapter 9 Introduction to the t Statistic
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Inferential Statistics
Comparing Two Means Prof. Andy Field.
Comparing Two Means t-tests: Rationale for the tests t-tests as a GLM
Comparing several means: ANOVA (GLM 1)
What are their purposes? What kinds?
InferentIal StatIstIcs
Type I and Type II Errors
Presentation transcript:

Comparing Two Means Prof. Andy Field

Aims T-tests Rationale for the tests Interpretation Reporting results Dependent (aka paired, matched) Independent Rationale for the tests Assumptions Interpretation Reporting results Calculating an Effect Size Categorical predictors in the linear model.

Experiments The simplest form of experiment that can be done is one with only one independent variable that is manipulated in only two ways and only one outcome is measured. More often than not the manipulation of the independent variable involves having an experimental condition and a control. E.g., Is the movie Scream 2 scarier than the original Scream? We could measure heart rates (which indicate anxiety) during both films and compare them. This situation can be analysed with a t-test

T-test Dependent t-test Independent t-test Significance testing Compares two means based on related data. E.g., Data from the same people measured at different times. Data from ‘matched’ samples. Independent t-test Compares two means based on independent data E.g., data from different groups of people Significance testing Testing the significance of Pearson’s correlation coefficient Testing the significance of b in regression.

Rationale to Experiments Group 1 Group 2 Lecturing Skills Variance created by our manipulation Removal of brain (systematic variance) Variance created by unknown factors E.g. Differences in ability (unsystematic variance) Slide 5

Rational for the t-test Two samples of data are collected and the sample means calculated. These means might differ by either a little or a lot. If the samples come from the same population, then we expect their means to be roughly equal. Although it is possible for their means to differ by chance alone, we would expect large differences between sample means to occur very infrequently. We compare the difference between the sample means that we collected to the difference between the sample means that we would expect to obtain if there were no effect (i.e. if the null hypothesis were true). We use the standard error as a gauge of the variability between sample means. If the difference between the samples we have collected is larger than what we would expect based on the standard error then we can assume one of two: There is no effect and sample means in our population fluctuate a lot and we have, by chance, collected two samples that are atypical of the population from which they came. The two samples come from different populations but are typical of their respective parent population. In this scenario, the difference between samples represents a genuine difference between the samples (and so the null hypothesis is incorrect). As the observed difference between the sample means gets larger, the more confident we become that the second explanation is correct (i.e. that the null hypothesis should be rejected). If the null hypothesis is incorrect, then we gain confidence that the two sample means differ because of the different experimental manipulation imposed on each sample.

Rationale to the t-test = observed difference between sample means − expected difference between population means (if null hypothesis is true) estimate of the standard error of the difference between two sample means

The Independent t-test

The Dependent t-test

Assumptions of the t-test Both the independent t-test and the dependent t-test are parametric tests based on the normal distribution. Therefore, they assume: The sampling distribution is normally distributed. In the dependent t­-test this means that the sampling distribution of the differences between scores should be normal, not the scores themselves. Data are measured at least at the interval level. The independent t-test, because it is used to test different groups of people, also assumes: Variances in these populations are roughly equal (homogeneity of variance). Scores in different treatment conditions are independent (because they come from different people).

Independent t-test Example Are invisible people mischievous? 24 Participants Manipulation Placed participants in an enclosed community riddled with hidden cameras. 12 participants were given an invisibility cloak. 12 participants were not given an invisibility cloak. Outcome measured how many mischievous acts participants performed in a week.

Independent t-test using SPSS

Independent t-test Output

Bootstrapping Output

Calculating the Effect Size

Reporting the independent t-test On average, participants given a cloak of invisibility engaged in more acts of mischief (M = 5, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). This difference, 1.25, BCa 95% CI [2.606, 0.043], was not significant t(22) = −1.71, p = .101; however, it did represent a medium-sized effect d = .65.

Matched-samples t-test Example Are invisible people mischievous? 24 Participants Manipulation Placed participants in an enclosed community riddled with hidden cameras. For first week participants normal behaviour was observed. For the second week, participants were given an invisibility cloak. Outcome measured how many mischievous acts participants performed in week 1 and week 2.

Paired- samples t-test Output

Paired- samples t-test Output Continued

Calculating an Effect Size

Reporting the paired-samples t-test On average, participants given a cloak of invisibility engaged in more acts of mischief (M = 5, SE = 0.48), than those not given a cloak (M = 3.75, SE = 0.55). This difference, 1.25, BCa 95% CI [−1.67, −0.83], was significant t(11) = −3.80, p = .003 and represented a medium-sized effect d = .65.

Categorical predictors in the linear model

Cloak group The group variable = 0 Intercept = mean of baseline group

No Cloak Group The group variable = 1 b1 = Difference between means

Output from a Regression

When Assumptions are Broken Dependent t-test Mann-Whitney Test Wilcoxon rank-sum test Independent t-test Wilcoxon Signed-Rank Test Robust Tests: Bootstrapping Trimmed means