PSY 1950 Post-hoc and Planned Comparisons October 6, 2008.

Slides:



Advertisements
Similar presentations
Locating Variance: Post-Hoc Tests
Advertisements

Dr. AJIT SAHAI Director – Professor Biometrics JIPMER, Pondicherry
Lesson #24 Multiple Comparisons. When doing ANOVA, suppose we reject H 0 :  1 =  2 =  3 = … =  k Next, we want to know which means differ. This does.
Relationship Mining Correlation Mining Week 5 Video 1.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 18, 2012.
Session 3 ANOVA POST HOC Testing STAT 3130 Statistical Methods I.
C82MST Statistical Methods 2 - Lecture 4 1 Overview of Lecture Last Week Per comparison and familywise error Post hoc comparisons Testing the assumptions.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Analysis of Variance (ANOVA) Statistics for the Social Sciences Psychology 340 Spring 2010.
One-Way ANOVA Multiple Comparisons.
Locating Variance: Post-Hoc Tests Dr James Betts Developing Study Skills and Research Methods (HL20107)
Using Statistics in Research Psych 231: Research Methods in Psychology.
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Comparing Means.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Lecture 9: One Way ANOVA Between Subjects
Two Groups Too Many? Try Analysis of Variance (ANOVA)
Finals Schedule n Section 1: 9:00 AM Monday, May 15.
Analyses of K-Group Designs : Omnibus F & Pairwise Comparisons ANOVA for multiple condition designs Pairwise comparisons and RH Testing Alpha inflation.
Statistics for the Social Sciences Psychology 340 Spring 2005 Analysis of Variance (ANOVA)
Business 205. Review 2 Factor ANOVAs Excel 2 Independent T-tests ANOVA 2-Factor ANOVAs.
One-way Between Groups Analysis of Variance
K-group ANOVA & Pairwise Comparisons ANOVA for multiple condition designs Pairwise comparisons and RH Testing Alpha inflation & Correction LSD & HSD procedures.
Comparing Means.
Comparing Several Means: One-way ANOVA Lesson 14.
Today Concepts underlying inferential statistics
Using Statistics in Research Psych 231: Research Methods in Psychology.
If = 10 and = 0.05 per experiment = 0.5 Type I Error Rates I.Per Comparison II.Per Experiment (frequency) = error rate of any comparison = # of comparisons.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
Comparisons among groups within ANOVA
Intermediate Applied Statistics STAT 460
1 Multiple Comparison Procedures Once we reject H 0 :   =   =...  c in favor of H 1 : NOT all  ’s are equal, we don’t yet know the way in which.
When we think only of sincerely helping all others, not ourselves,
ANOVA Greg C Elvers.
Comparing Means. Anova F-test can be used to determine whether the expected responses at the t levels of an experimental factor differ from each other.
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Comparing Several Means: One-way ANOVA Lesson 15.
Statistics 11 Confidence Interval Suppose you have a sample from a population You know the sample mean is an unbiased estimate of population mean Question:
Everyday is a new beginning in life. Every moment is a time for self vigilance.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Post Hoc Tests. What is a Post Hoc Test? Review: – Adjusting Alpha Level – Multiple A Priori Comparisons What makes a test Post Hoc? – Many tests could.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, October 15, 2013 Analysis of Variance (ANOVA)
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
STA MCP1 Multiple Comparisons: Example Study Objective: Test the effect of six varieties of wheat to a particular race of stem rust. Treatment:
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Comparisons among groups within ANOVA. Problem with one-way anova There are a couple issues regarding one-way Anova First, it doesn’t tell us what we.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: One-way ANOVA Marshall University Genomics Core.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 13: Multiple Comparisons Experimentwise Alpha (α EW ) –The probability.
Psy 230 Jeopardy Related Samples t-test ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Statistics for the Social Sciences Psychology 340 Spring 2009 Analysis of Variance (ANOVA)
Tests after a significant F
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Chapters Way Analysis of Variance - Completely Randomized Design.
ANalysis Of VAriance (ANOVA) Used for continuous outcomes with a nominal exposure with three or more categories (groups) Result of test is F statistic.
Statistics (cont.) Psych 231: Research Methods in Psychology.
1 השוואות מרובות מדדי טעות, עוצמה, רווחי סמך סימולטניים ד"ר מרינה בוגומולוב מבוסס על ההרצאות של פרופ' יואב בנימיני ופרופ' מלכה גורפיין.
10 December 2009 MRC CBU Graduate Statistics Lectures 10: Post-hoc tests, Multiple comparisons, Contrasts and Interactions1 MRC Cognition and Brain Sciences.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Дисперсионный анализ ANOVA
Inferential Statistics Psych 231: Research Methods in Psychology.
Posthoc Comparisons finding the differences. Statistical Significance What does a statistically significant F statistic, in a Oneway ANOVA, tell us? What.
Multiple Comparisons Q560: Experimental Methods in Cognitive Science Lecture 10.
Studentized Range Statistic
I. Statistical Tests: Why do we use them? What do they involve?
Presentation transcript:

PSY 1950 Post-hoc and Planned Comparisons October 6, 2008

Preamble Presentations Tutoring Problem 1e: If you decide to reject the null hypothesis, you know the probability that you are making the wrong decision Visual depiction of F-ratio

Subpopulations Cournot (1843): “...it is clear that nothing limits... the number of features according to which one can distribute [natural events or social facts] into several groups or distinct categories.” e.g., the chance of a male birth: –Legitimate vs. illegitimate –Birth order –Parent age –Parent profession –Parent health –Parent religion “… usually these attempts through which the experimenter passed don’t leave any traces; the public will only know the result that has been found worth pointing out; and as a consequence, someone unfamiliar with the attempts which have led to this result completely lacks a clear rule for deciding whether the result can or can not be attributed to chance.”

Large Surveys and Observational Studies Abundant data Limited a priori hypotheses e.g., Genome Superstruct Project (GSP) –Genetic testing –Cognitive testing –Structural brain imaging –Functional brain imaging

ANOVA One-way ANOVA –k(k-1)/2 possible pairwise comparisons –e.g., with 5 levels, 10 possible comparisons Factorial ANOVA –The issue above plus –Multiple possible main effects/interactions –e.g., with a 2 x 2 x 2, 7 possible effects

Families Set of hypotheses = Family Type I error rate for a set of hypotheses = Familywise error rate –e.g., across pairwise comparisons in one-way ANOVA If no mean differences exist, what is the chance of finding a significant one? –e.g., across main effects/interactions in factorial ANOVA If no main effects or interactions exist for a particular ANOVA, what is the chance of finding a significant one –e.g., whole experiment with multiple ANOVAs If no effects exist for the entire experiment, what is the chance of finding a significant one?

Family Size "If these inferences are unrelated in terms of their content or intended use (although they may be statistically dependent), then they should be treated separately and not jointly” –Hochberg and Tamhane (1987) e.g., suicide rates for 50 states, with 1225 possible pairwise comparisons –From a federal perspective, how big is the family? –How about from a state perspective?

Familywise  If family consists of two independent comparisons with  =.05, AND if both corresponding null hypotheses are true: –The probability of NOT making a Type I error on both tests is:.95 x.95 =.9025 –The probability of making one or more type I errors is: =.0975 If family consists of c independent comparisons with  =.05, AND if all corresponding null hypotheses are true: –The probability of NOT making a Type I error on all tests is: (1 -.05) c –The probability of making one or more Type I errors is: 1 - (1 -.05) c

A Priori vs. Post-hoc Comparisons A priori comparisons –Chosen before data collection –Limited, deliberate comparisons Post hoc (a posteriori) comparisons –Conducted after data collection –Exhaustive, exploratory comparisons

Significance of Overall F Prerequisite for some tests (e.g., Fisher’s LSD) Efficient test of overall null hypothesis Need MS within for many tests

A Priori Comparisons Single stage tests –Multiple t-tests –Linear contrasts –Bonferroni t (Dunn’s test) –Dunn-Sidak test Multistage tests –Bonferroni/Holm

Multiple t-tests Replace s 2 pooled with MS within Use df within

Linear Contrasts Compare more than one mean with another mean

Bonferroni t (Dunn’s Test) If c independent tests are performed  corrected =  / c p corrected = p x c Imprecise math –e.g., for p corrected =.05 with c = 21, p corrected  1.05 – p corrected = 1 - (1 -.05) c Bonferroni, C. E. (1936). Teoria statistica delle classi e calcolo delle probabilit. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze, 8, Perneger, T.V. (1998). What is wrong with Bonferroni adjustments. BMJ,136,

Dunn-Sidak Test Identical to Bonferroni, except uses correct math Less conservative than Bonferroni –e.g., for p corrected =.05 with c = 10: p Bonferroni =.50 p Sidak =.40

Multistage Bonferroni (e.g., Holm) Calculate t for all c contrasts of interest Order results based on |t| |t 1 | > |t 2 | > |t 3 | Apply different Bonferroni corrections for  or p based on position in above sequence, stopping when t is insignificant –For t 1, c 1 = 3; if p 1 >.05/3, then… –For t 2, c 2 = 2; if p 2 >.05/2, then… –For t 1, c 1 = 3; use  =.05/1

Post-hoc Comparisons Fisher’s LSD Tukey’s test Newman-Keuls test The Ryan procedure (REGWQ) Scheffe’s test Dunnett’s test

Fisher’s LSD Test LSD = Least significant difference Two-stage process: –Conduct ANOVA If F is nonsignificant, stop If F is significant… –Make pairwise comparisons using Ensures familywise  =.05 for complete null Ensures familywise  =.05 for partial null when c = 3

Studenized Range Statistic (q) If M l and M s represent the largest and smallest means and r is the number of means in the set: Order means from smallest to largest Determine r, calculate q, lookup p

Tukey’s HSD Test Determines minimum difference between treatment means that is necessary for significance HSD = honestly significant difference

Scheffe Not for post-hoc pairwise comparisons Not for a priori comparisons Howell: “I can’t imagine when I would ever use it, but I have to include it here because it is such a standard test”

Newman-Keuls (S-N-K) Test Readjusts r based upon means tests Doesn’t control for familywise  =.05

Comparing Different Procedures

Which Test? One contrast –Simple: t-test –Complex: linear contrast Several contrasts –A priori: Multistage Bonferroni (e.g., Holm) –Post-hoc: Fisher’s LSD Many contrasts –Ryan REGRQ or Tukey Find critical values for different tests –with a control: Dunnett –planned: Bonferroni –not planned: Scheffé

Imaging Data 200,000 tests on 200,000 voxels 1000 false positives when  =.05 Bonferroni? –No, requires voxel independence