Group Comparisons What is the probability that group mean differences occurred by chance? With Excel (or any statistics program) we computed p values to.

Slides:



Advertisements
Similar presentations
Statistical vs. Practical Significance
Advertisements

INFERENTIAL STATISTICS. Descriptive statistics is used simply to describe what's going on in the data. Inferential statistics helps us reach conclusions.
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Intro to Statistics for the Behavioral Sciences PSYC 1900
Lecture 9: One Way ANOVA Between Subjects
Comparing Several Means: One-way ANOVA Lesson 14.
The t Tests Independent Samples.
Introduction to Analysis of Variance (ANOVA)
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
The basic idea So far, we have been comparing two samples
Repeated Measures ANOVA
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Comparing Several Means: One-way ANOVA Lesson 15.
ANOVA. Independent ANOVA Scores vary – why? Total variability can be divided up into 2 parts 1) Between treatments 2) Within treatments.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ANOVA: Analysis of Variance.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Tests after a significant F
Simple ANOVA Comparing the Means of Three or More Groups Chapter 9.
Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.Introduction To Basic Ratios 3.Basic Ratios In Excel 4.Cumulative.
Chapter 13 Understanding research results: statistical inference.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 12 Introduction to Analysis of Variance
Psych 200 Methods & Analysis
Dependent-Samples t-Test
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
Comparing Group Means.
INF397C Introduction to Research in Information Studies Spring, Day 12
Research Design and Analysis
Inferential Statistics
Hypothesis testing using contrasts
Single-Variable, Independent-Groups Designs
Post Hoc Tests on One-Way ANOVA
12 Inferential Analysis.
Internal Validity – Control through
Inferential Statistics
2 independent Groups Graziano & Raulin (1997).
Kin 304 Inferential Statistics
Analysis based on normal distributions
Last class Tutorial 1 Census Overview
Analysis of Variance (ANOVA)
Comparing Populations
I. Statistical Tests: Why do we use them? What do they involve?
Analysis of Variance (ANOVA)
Psych 231: Research Methods in Psychology
12 Inferential Analysis.
One way ANOVA One way Analysis of Variance (ANOVA) is used to test the significance difference of mean of one dependent variable across more than two.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Chapter 9 Introduction to the Analysis of Variance
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
ANOVA Between-Subject Design: A conceptual approach
Psych 231: Research Methods in Psychology
Chapter 10 Introduction to the Analysis of Variance
Psych 231: Research Methods in Psychology
Experiments with More Than Two Groups
InferentIal StatIstIcs
Presentation transcript:

Group Comparisons What is the probability that group mean differences occurred by chance? With Excel (or any statistics program) we computed p values to figure this out. Then we focused on things to pay attention to while making these comparisons.

Confounding Variations of Group Mean Comparisons Anticipating the direction of the change in mean scores (1 or 2-tailed tests) The similarity of the samples (random error) Random selection (sampling) Sample size (practical significance) Multiple simultaneous t-tests (ANOVA)

Research Design and Analysis

Research Design Groups by Treatment (Independent Variable) Data Gathering Events (Dependent Variable)

Did direct instruction improve students’ ability to recall math facts? Independent ⇓ Data Gathering Pre-Test Post-Test DI to 4th Grade Class group data t-test, paired if possible

Do students who receive DI achieve better than those who do not? Independent ⇓ Data Gathering Test DI to 4th Grade Class group data Non-DI to different 4th Grade Class t-test, independent samples

Do students who receive DI for math facts retain learning over the summer? Independent ⇓ Data Gathering Pre-Test Post-Test Post-Post-Test DI to 4th Grade Class group data Repeated Measures

Which instructional strategy works better for teaching math facts? Independent ⇓ Data Gathering Test DI group data Cooperative group data group data Inquiry Single Factor

Multiple Tests on the Same Data The last two examples had 3 groups to be compared. An unfortunate thing happens when multiple tests are done on the same data.

Multiple Group Comparisons Number of Groups to be Compared in the Study 2 3 4 5 6 Possible Group Comparisons A-B A-C B-C A-D B-D C-D A-E B-E C-E D-E A-B C-D A-C C-E A-D C-F A-E D-E A-F D-F B-C E-F B-F Total Possible Comparisons 1 10 15

Being Wrong Test Group Mean 5% (.05) We say that occurring randomly less than 5% of the time is really unlikely so it is not random. But, that statement would be wrong 5% of the time. Type 1 Error: Saying it is not random when it was.

Cumulative Error If we test a data set for statistically significant differences we know we have the potential to be wrong 5% of the time. (Type I error) If we test the same data set again the potential for a Type I error increases because we have given ourselves more opportunity to be wrong. Type I error accumulates with multiple assessments of the same data.

Multiple Group Comparisons Number of Groups to be Compared in the Study 2 3 4 5 6 Possible Group Comparisons A-B A-C B-C A-D B-D C-D A-E B-E C-E D-E A-B C-D A-C C-E A-D C-F A-E D-E A-F D-F B-C E-F B-F Total Possible Comparisons 1 10 15 Maximum Probability of a Type I Error .05 .15 .30 .50 .75

Cumulative Error Very quickly the probability of a Type I error gets unacceptably large. One way to solve this is to reduce the alpha level because that reduces the probability of a Type I error for any given comparison. Accumulation of error would be slower. But, you would also reduce the probability of discovering statistically significant differences.

The Way Out of This Problem Remember that SD is the average distance of all of the scores from the mean? We got to it by adding up the squared differences of each score. The average of the squared differences is called the variance of the sample. The SD is the square root of the variance so they are closely related.

Variance Variance: The summary of the group is the mean. The distance that everyone is from the mean happened for unknown reasons. Variance is error in the model. If everyone had the same score there would be no error in the measurement.

Comparison of Three Sets of Scores Within group variance (error within groups) Figure out what the total variance is inside of all of the distributions. The is called within group variance and it represents random error in the model.

Comparing Variance Total variance Within group variance (error within groups) Then take all of the scores together and make a new distribution with its own mean and standard error. This represents all of the random error plus the variance from the differences in the groups. The differences in the groups is there because of the independent variable—your intervention.

Comparing Variance Between group variance (impact of the intervention) Subtract the within group variance from the total variance and you are left with the between group variance—how much of the variance is because of just the differences among the groups.

Comparing Variance Between group variance (impact of the intervention) What is left is the variance caused by the intervention.

The F Test Divide the between group variance (impact of the intervention) by the within group variance (random error). The result is compared to a sampling distribution of F to figure out the probability that your result is likely to happen by chance.

Analysis of Variance Simplified, what the ANOVA does is look at a ratio Between Group Variance (impact of the independent variable) Within Group Variance (random error)

F Distributions What is the probability that the f-test ratio could have occurred by chance?

ANOVA Is the variance of these mean scores likely to have occurred by chance? The problem is that if we know they did not appear by chance then which means are significantly different? The analysis only tells us there is a difference—not where.

Multiple Tests Simultaneously When multiple (more than two) groups are to be compared on the same measure it is not appropriate to test each pair separately. The comparisons are not independent. (Type I error) Analysis of Variance ANOVA An ANOVA only tells if significant differences exist between at least two groups. It does not tell which group pairs are significant. A post hoc analysis is necessary to figure out which group differences are significant.

Download OWM Data from the site

ANOVA in EZAnalyze Single factor compares different groups on a single measure. Repeated measures compares a single group on multiple uses of a single measure.

Single Factor ANOVA A more conservative analysis because of the amount of error represented by the different groups. For EZAnalyze there must be one variable that shows which group each individual is in (the independent variable). Then there must be one variable the shows the measure for each individual (the dependent variable)

Repeated Measures ANOVA A less conservative analysis because it is the same group every time. For EZAnalyze there must be one variable for each iteration of the measure for each individual (the dependent variable) There is no need for a grouping variable because it is the same group in all of the iterations of the measure.

Significance of the whole ANOVA

ANOVA Post Hoc Tests Use a Tukey’s HSD (honest significant difference) to compute multiple mean differences. Accurate with groups of equal size. Conservative with unequal variance.

Post hoc test results

ANOVA Practical Significance ANOVAs have the same problem as comparison of two group means. Large sample sizes make significance appear more easily. Practical significance is measure by asking: How much of the total variance is represented by the between group variance? How much of the variance is explained by the intervention?

Eta Squared (η2) Eta Squared is a ratio: Total variance Between group variance (impact of the intervention) Random error Eta Squared is a ratio: The larger the effect, the closer eta squared is to one. Effect Variance (impact of the intervention) Total Variance (impact of the intervention plus random error)

Interpretation: .01 small .06 medium .14 large .50 very large

Cohen’s d is better with two group comparisons

APA ANOVA Table

Descriptive Paragraph Pretest, posttest, and 2 month delayed assessment scores for 38 students were compared using repeated measures ANOVA analysis (Table 1). Post and delayed scores were statistically significantly higher than pretest scores (p < .001). No statistically significant difference appeared between posttest and delayed scores.

Things to remember… You have to figure out which t-test to use by judging the similarity of the groups. Decide if your comparison should be one-tailed or two. If you are comparing more than two groups simultaneously you have to use an ANOVA not a t-test. Compute effect sizes, particularly if the groups are large. Be random when you can.

ANOVA Practice Download the Reading Attitude data from the Excel tab. Determine the correct analysis and do it. Build an APA table. Write a descriptive paragraph for the table. Send me a Word document with the table and the paragraph.