An example of using BFDP for identifying noteworthy associations Joan Dong.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Is it statistically significant?
Decision Errors and Power
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
AP Statistics – Chapter 9 Test Review
Hypothesis testing Week 10 Lecture 2.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Stat 301 – Day 21 Adjusted Wald Intervals Power. Last Time – Confidence Interval for  When goal is to estimate the value of the population proportion.
Hypothesis : Statement about a parameter Hypothesis testing : decision making procedure about the hypothesis Null hypothesis : the main hypothesis H 0.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Hypothesis Testing for the Mean and Variance of a Population Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Determining Statistical Significance
Descriptive Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical hypothesis testing – Inferential statistics I.
False Discovery Rate (FDR) = proportion of false positive results out of all positive results (positive result = statistically significant result) Ladislav.
Experimental Statistics - week 2
HYPOTHESIS TESTING Dr. Aidah Abu Elsoud Alkaissi
Hypothesis Testing.
8 - 1 © 2003 Pearson Prentice Hall Chi-Square (  2 ) Test of Variance.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Overview Basics of Hypothesis Testing
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
1 Statistics 300: Elementary Statistics Section 8-2.
AP STATISTICS LESSON 10 – 4 ( DAY 1 ) INFERENCE AS DECISION.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
IMST 2008 / FIM XVI Decision theoretic Bayesian hypothesis testing with focus on skewed alternatives By Naveen K. Bansal Ru Sheng Marquette University.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
May 2004 Prof. Himayatullah 1 Basic Econometrics Chapter 5: TWO-VARIABLE REGRESSION: Interval Estimation and Hypothesis Testing.
Multiple Testing Matthew Kowgier. Multiple Testing In statistics, the multiple comparisons/testing problem occurs when one considers a set of statistical.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Power and Sample Size Anquan Zhang presents For Measurement and Statistics Club.
Math 4030 – 9a Introduction to Hypothesis Testing
Statistical Significance Victor I. Piercey February 9, 2010.
Inference as Design Target Goal: I can calculate and interpret a type I and type II error. 9.1c h.w: pg 547: 15, 19, 21.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Chapter Ten McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. One-Sample Tests of Hypothesis.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
Hypothesis Testing.  Hypothesis is a claim or statement about a property of a population.  Hypothesis Testing is to test the claim or statement  Example.
ANalysis Of VAriance (ANOVA) Used for continuous outcomes with a nominal exposure with three or more categories (groups) Result of test is F statistic.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Hypothesis Testing Steps : 1. Review Data : –Sample size. –Type of data. –Measurement of data. –The parameter ( ,  2,P) you want to test. 2. Assumption.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
Hypothesis Testing Chapter Hypothesis Testing  Developing Null and Alternative Hypotheses  Type I and Type II Errors  One-Tailed Tests About.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Chapter 21 More About Tests.
P-value Approach for Test Conclusion
Chapter 9: Hypothesis Testing
Chapter 9: Hypothesis Tests Based on a Single Sample
Statistics 300: Elementary Statistics
Statistics Chapter 10 Section 4.
Jon Wakefield  The American Journal of Human Genetics 
Inference as Decision Section 10.4.
Calculating t X -  t = s x X1 – X2 t = s x1 – x2 s d One sample test
Statistical Power.
Presentation transcript:

An example of using BFDP for identifying noteworthy associations Joan Dong

Recent technological advances allow the huge numbers of genetic information for analysis. Produces new challenges in terms of statistical analysis and interpretation. Some reports of associations between genetic variants and common cancer sites and other complex diseases are false positives. A major reason for this unfortunate situation is the strategy of declaring statistical significance based on a P value alone.

The P-value is by far the most commonly used measure, but requires careful calibration when the a priori probability of an association is small.

Standard statistical decision making α=0.05 is universal Analysis – Prior probability not considered formally – Loss from bad decisions not considered – probability that positive report is a false positive is not considered

A false discovery is the result of the Type I error of rejecting the null hypothesis stating that the marker has no effects, when in fact it has. To prevent false discoveries, one can propose stringent criteria (critical P-values, independent replications, etc).

Controlling the false discovery rate for multiple testing: - Adjusting for multiple testing is typically based on the number of the tests that are performed (the Bonferroni correction). A disadvantage of these kind of family-wise correction is that when many tests are performed, the critical P value becomes very small, resulting in a low statistical power to detect the true effects.

This presentation will introduce the BFDP method proposed by Jon Wakefield and show how to assess the BFDP and how to use it to decide whether a finding is deserving of attention or noteworthy. BFDP – Bayesian False Discovery Probability

Bayes’s theorem gives the probability of the hypothesis H0 given data y as Pr (H0 |y,H0 ∪ H1)=p(y|H0)Pr (H0|H0 ∪ H1)/p(y|H0 ∪ H1 ) Where p(y|H0 ∪ H1 )=p(y|H0 )Pr (H0 |H0 ∪ H1 )+p(y|H1 )Pr (H1 |H0 ∪ H1 ) Then Pr (H0|y)=p(y|H0)pi0/p(y|H0 )pi0 + p(y|H1 )(1-pi0 ) Pi0=Pr(H0) is the prior on the null Pr(H0|y)=BF*PO/BF*PO+1 Where BF=p(y|H0)/p(y|H1) - Bayes factor PO=pi0/1-pi0 Use ABF (approximate Bayes factor) for BF ABF=p(theta|H0 )/p(theta|H1) BFDP=ABF*PO/ABF*PO+1

Example of how to us the BFDP program

SAS macro was developed for calculating BFDP. %macro cal_BFDP(dataout,datain,pi1,pi2,pi3,pi4,rcost,upodd); data &dataout; set &datain; rcost=&rcost;*ratio of costs; BFDPt=rcost/(1+rcost);*BFDP threshold; pi1=&pi1;pi2=&pi2;pi3=&pi3;pi4=&pi4;*prior; upOdd=&upodd; w=round((log(upOdd)/1.96)**2,0.001); r=w/(w+se**2); z=log(or)/se; ABF=exp(-Z*Z*r*0.5)/sqrt(1-r); priorOdds1=(1-&pi1)/&pi1; priorOdds2=(1-&pi2)/&pi2; priorOdds3=(1-&pi3)/&pi3;priorOdds4=(1-&pi4)/&pi4; BFDP1=ABF*priorOdds1/(ABF*priorOdds1+1); BFDP2=ABF*priorOdds2/(ABF*priorOdds2+1); BFDP3=ABF*priorOdds3/(ABF*priorOdds3+1); BFDP4=ABF*priorOdds4/(ABF*priorOdds4+1); Run; %mend;

An example – Four levels of prior probability for SNP analysis have been suggested: (i) prior probability 0.01 (ii) prior probability 0.03 (iii) prior probability 0.05 (iv) prior probability The upper OR was set at 3, and the selected levels of noteworthiness for BFDP at 0.8, as recommended in the original papers.

Results

Conclusion BFDP can be used for reducing the number of false positive discoveries.

REFERENCES Wakefield, J. (2007). A Bayesian measure of the probability of false discovery in genetic epidemiology studies. American Journal of Human Genetics, 81,

Thank You !