1 השוואות מרובות מדדי טעות, עוצמה, רווחי סמך סימולטניים ד"ר מרינה בוגומולוב מבוסס על ההרצאות של פרופ' יואב בנימיני ופרופ' מלכה גורפיין.

Slides:



Advertisements
Similar presentations
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
Advertisements

1 1 Slide Hypothesis Testing Chapter 9 BA Slide Hypothesis Testing The null hypothesis, denoted by H 0, is a tentative assumption about a population.
AP Statistics – Chapter 9 Test Review
Chapter 16: Inference in Practice STAT Connecting Chapter 16 to our Current Knowledge of Statistics ▸ Chapter 14 equipped you with the basic tools.
Chapter Seventeen HYPOTHESIS TESTING
Introduction to Hypothesis Testing
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Differentially expressed genes
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
PSY 1950 Post-hoc and Planned Comparisons October 6, 2008.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Chapter 9 Hypothesis Testing.
Lecture 14: Thur., Feb. 26 Multiple Comparisons (Sections ) Next class: Inferences about Linear Combinations of Group Means (Section 6.2).
5-3 Inference on the Means of Two Populations, Variances Unknown
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
False Discovery Rate (FDR) = proportion of false positive results out of all positive results (positive result = statistically significant result) Ladislav.
Introduction to Hypothesis Testing
Business Statistics - QBM117 Introduction to hypothesis testing.
Multiple testing in high- throughput biology Petter Mostad.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.
Intermediate Applied Statistics STAT 460
More About Significance Tests
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Overview Basics of Hypothesis Testing
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Multiple Testing in Microarray Data Analysis Mi-Ok Kim.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Back to basics – Probability, Conditional Probability and Independence Probability of an outcome in an experiment is the proportion of times that.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Multiple Testing Matthew Kowgier. Multiple Testing In statistics, the multiple comparisons/testing problem occurs when one considers a set of statistical.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Chapter 16: Inference in Practice STAT Connecting Chapter 16 to our Current Knowledge of Statistics ▸ Chapter 14 equipped you with the basic tools.
Chapter 8 1-Way Analysis of Variance - Completely Randomized Design.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
The False Discovery Rate A New Approach to the Multiple Comparisons Problem Thomas Nichols Department of Biostatistics University of Michigan.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Chapters Way Analysis of Variance - Completely Randomized Design.
Guidelines for Multiple Testing in Impact Evaluations Peter Z. Schochet June 2008.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Review Statistical inference and test of significance.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 -Hypothesis Testing
CONCEPTS OF HYPOTHESIS TESTING
Discrete Event Simulation - 4
1-Way Analysis of Variance - Completely Randomized Design
Sequence comparison: Multiple testing correction
Confidence Intervals.
Presentation transcript:

1 השוואות מרובות מדדי טעות, עוצמה, רווחי סמך סימולטניים ד"ר מרינה בוגומולוב מבוסס על ההרצאות של פרופ' יואב בנימיני ופרופ' מלכה גורפיין

2 Family: Any collection of inferences for which it is meaningful to take into account some combined measure of errors (Hochberg and Tamhane, 1987). Family: The smallest set of items of inference from which selection of results for presentation or highlighting was made (Prof. Yoav Benjamini). It follows from the definition that a family must consist of hypotheses related in terms of their intended use by the researcher, otherwise it would make no sense to select the hypotheses within that family. It is upon the researcher to decide what is (are) the set (sets) of related hypotheses from which the interesting finding/s may/will be selected. Once the decision is made, the interpretation of the results must be made accordingly. The definition of families should depend on the desired interpretation of the results. For a single data set, the families may be defined in various ways leading to different interpretation of the results, each of which is important form some point of view. Family of hypotheses

תזכורת ממצגת 1 … … … … … … … … Y. Benjamini and M.Bogomolov “Selective inference on multiple families hypotheses” JRSS-B

For a given family of hypotheses, let m be the number of hypotheses within the family. 4 Multiple comparison procedure (MCP): statistical test procedure designed to account for and properly control the multiplicity effect through some combined measure of erroneous inferences. HypothesesNot rejectedRejectedTotal TrueUV FalseTS TotalWR S, T, U, V-unobservable random variables; R,W-observable r.v. unknown. - - fixed numbers, Notations

Control of Familywise Error Rate The Familywise Error-Rate (in the weak sense): The Familywise Error-Rate (in the strong sense): For any configuration of true and false null hypotheses 5 Thus by assuring FWER ≤ , the probability of making even one type I error in the family, is controlled at level  Example: Fisher’s LSD procedure controls FWER in the weak sense, but does not control the FWER in the strong sense.

Error rates The Familywise Error-Rate For any configuration of true and false null hypotheses The Per Family Error-Rate For any configuration of hypotheses Expected number of type I errors The Per Comparison Error-Rate For any configuration of hypotheses Note: testing at (nominal) level  assures per comparison level is  ; amounts to “don’t worry – be happy” approach. *Other error-rates will be introduced later. 6

Comparison of error-rates The Familywise Error-Rate The Per Family Error-Rate The Per Comparison Error-Rate A multiple comparison procedure which controls the PFER also controls FWER and PCER, but not vice versa. 7

Power 8 Analogous to extending the type I error rate, power can be generalized in various ways when moving from single to multiple hypotheses test problems. The individual power: the rejection probability for a false hypothesis Average power: the expected number of correct rejections among all false null hypotheses The disjunctive power: the probability of rejecting at least one false null hypothesis. The conjunctive power: the probability of rejecting all false null hypotheses.

Bonferroni procedure If we test each hypothesis separately at level So to assure we may use Bonferroni multiple testing procedure: test each hypothesis separately at level Bonferroni procedure assures  control of PFER and FWER). –Is any condition needed? Example: GWAS, common threshold for significance: 9 Bonferroni inequality

Sidak’s procedure If the test statistics are independent, and we test each hypothesis separately at level To assure we may use Sidak’s multiple testing procedure: test each hypothesis separately at level Sidak’s procedure controls FWER at level when the test statistics are independent. Note: if Sidak’s procedure assures 10

Comparison between Bonferroni and Sidak’s procedures Idea: use dependency structure to get a better test. How much better? Very little gain even for small m: 11 m

Adjusted p-value 12 One hypothesis: p-value is the smallest significance level for which the hypothesis can be rejected. Family of hypotheses: an adjusted p-value is defined as the smallest error rate level for which one rejects the hypothesis given a particular multiple comparison procedure. For FWER, if such an exists, and otherwise. If the null hypothesis can be rejected while controlling the FWER at level The marginal p-value for : denoted as the unadjusted p-value.

Bonferroni procedure and Sidak’s procedure: Adjusted p-value 13 Bonferroni procedure: test each hypothesis separately at level i.e. reject if Adjusted p-value for : Sidak’s procedure: test each hypothesis separately at level at level i.e. reject if Adjusted p-values: Homework. Computation of Bonferroni adjusted p-values using R: >p=c(0.01,0.015,0.005) >p.adjust(p, method="bonferroni") [1]

Example: Behavioral genetics 14 Study the genetics of behavioral traits: Hearing, sight, smell, alcoholism, locomotion, fear, exploratory behavior Compare inbred strains, crosses, knockouts…

15 Significance of strain differences Behavioral Endpoint P-value Prop. Lingering Time # Progression segments Median Turn Radius (scaled) Time away from wall Distance traveled Acceleration # Excursions Time to half max speed Max speed wall segments Median Turn rate Spatial spread Lingering mean speed Homebase occupancy # stops per excursion Stop diversity Length of progression segments Activity decrease Bonferroni.05/17=.0029 Unadjusted

Multiplicity problem for confidence intervals 16

Multiplicity problem for confidence intervals 17 Parameters: The observations are X (possibly X=(X 1,…,X m )). is a 1-  confidence interval (nominally/marginally) for parameter if Recall: Testing vs by rejecting H 0i iff the 1-  CI does not include, is an  -level test Therefore, same multiplicity problem for CIs as for testing

Multiplicity problem for confidence intervals 18 A set of confidence intervals are simultaneous confidence intervals at level for if Or Testing vs by rejecting iff where are simultaneous confidence intervals for at level is an FWER- controlling procedure at level

Simultaneous confidence intervals: examples 19 Bonferroni confidence intervals: marginal confidence intervals at nominal level. Show that Bonferroni confidence intervals are simultaneous confidence intervals at confidence level of at least 1-  Construct simultaneous confidence intervals that are equivalent to Sidak’s procedure (in the sense given in slide18). For each type of adjustments: Are there assumptions needed? If yes, what are they?