Adapted from: Wulff HR, Andersen B, Brandenhoff P, Guttler F (1987): What do doctors know about statistics? Statistics in Medicine 6:3-10 Suppose we conduct.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 2. Hypothesis Testing.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Response – variable of interest; variable you collect - #Fish, %Coral cover, temperature, salinity, etc Factor – variable by which response is divided;
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
T-Tests Lecture: Nov. 6, 2002.
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 8 Introduction to Hypothesis Testing
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Essential Statistics in Biology: Getting the Numbers Right
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Chapter 8 McGrew Elements of Inferential Statistics Dave Muenkel Geog 3000.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
From the Data at Hand to the World at Large
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Hypothesis Testing State the hypotheses. Formulate an analysis plan. Analyze sample data. Interpret the results.
PY 603 – Advanced Statistics II TR 12:30-1:45pm 232 Gordon Palmer Hall Jamie DeCoster.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Analyzing Statistical Inferences July 30, Inferential Statistics? When? When you infer from a sample to a population Generalize sample results to.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
_ z = X -  XX - Wow! We can use the z-distribution to test a hypothesis.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
T tests comparing two means t tests comparing two means.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
Lec. 19 – Hypothesis Testing: The Null and Types of Error.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Chapter 9 Introduction to the t Statistic
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Tests: One Sample
Hypothesis Testing: Hypotheses
CONCEPTS OF HYPOTHESIS TESTING
Chapter 9 Hypothesis Testing.
Chapter Review Problems
Hypotheses Hypothesis Testing
Section 12.2: Tests about a Population Proportion
Hypothesis Tests for Proportions
Chapter 10 Analyzing the Association Between Categorical Variables
More About Tests Notes from
Presentation transcript:

Adapted from: Wulff HR, Andersen B, Brandenhoff P, Guttler F (1987): What do doctors know about statistics? Statistics in Medicine 6:3-10 Suppose we conduct a t-test of the difference between two means and obtain a p-value <.05. Does this mean: a)There is less than a 5% chance that the results are due to chance. b)If there really is no difference between the population means, there is less than a 5% chance of obtaining a difference this large or larger. c)There is a 95% chance that if the study is repeated, the result will be replicated. d)There is a 95% chance that there is a real difference between the two population means.

What is a p-value? The probability of obtaining a test statistic (data) that departs as much as or more than the observed test statistic (data) if the null hypothesis were true.

Which Null Hypotheses are Meaningful and Testable? Those that precisely specify a probability model for the data.

A Perspective Samples Populations  We study:  We wish to obtain knowledge about: Data Nature

Gene Family-Based Hypothesis Testing Sketch of Typical (outmoded and inappropriate) Approach: 1.For Genes 1 to K, define a vector, R, of length K that contains the values of a categorical variable denoting group membership. 2.For Genes 1 to K, define a vector, C, of length K that contains the values of a binary variable denoting whether or not the gene was ‘significant’ or ‘interesting’ by some standard. 3.Conduct some frequentist significance test for an association between R and C.

The Independence Issue: A Real Example

Gene Family-Based Hypothesis Testing Which Null Hypothesis is Being Tested? 1.None of the genes in family c are differentially expressed (associated, methylated, etc.). 2.The proportion of genes in family c that are differentially expressed is equal to the proportion of genes in the remainder of the genome that are differentially expressed (beware of ‘anti-Bayesian’ element). 3.The proportion of genes in family c that are differentially expressed to an extent greater than  is equal to the proportion of genes in the remainder of the genome that are differentially expressed. Note: These can all be subsumed under the general: H 0 :

 Union-Intersection The compound hypothesis is rejected if any one of the individual hypotheses are rejected Multiplicity adjustment procedure is required to control type I error rate The rejection region for this test is the union of rejection regions corresponding to the individual tests  Intersection-Union The compound hypothesis is rejected only if all of the individual hypotheses are rejected Overall type I error rate of α is maintained without multiplicity adjustment The rejection region for this test is the intersection of the rejection regions corresponding to the individual tests Union-Intersection vs Intersection-Union Tests Methods not yet well established. Bayesian methods involving posterior probabilities in place of p-values may be especially useful. When P << N, methods are well established (e.g., multiple regression. When P >> N optimal methods are not yet clear.

 Normality?  Exchangeability?  Independence?  Other? What assumptions are being made? Non-Parametric: Non-Panacea (Cohen, J.) Asymptotic  Exact

Major Issues to Ask About in Selecting a Method for Gene Family or Pathway Testing ► What is the null? ► Does the method assume that all components (e.g., SNPs or gene expression levels) are independent? ► Is the method ‘anti-Bayesian’? ► Does the method use the continuity of information (not simply significant or not)?