Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Hypothesis Testing And the Central Limit Theorem.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Confidence Intervals © Scott Evans, Ph.D..
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Statistical Inference: Hypothesis Testing
Chapter Sampling Distributions and Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Introduction to Biostatistics and Bioinformatics
Estimation and Hypothesis Testing Now the real fun begins.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Hypothesis Testing.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Chapter 8 Introduction to Hypothesis Testing
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Significance testing and confidence intervals Col Naila Azam.
Chapter 20 Testing hypotheses about proportions
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Statistical Inference: Hypothesis Testing.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
No criminal on the run The concept of test of significance FETP India.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Hypothesis Testing – A Primer. Null and Alternative Hypotheses in Inferential Statistics Null hypothesis: The default position that there is no relationship.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
© Copyright McGraw-Hill 2004
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
One Sample Inf-1 In statistical testing, we use deductive reasoning to specify what should happen if the conjecture or null hypothesis is true. A study.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
15 Inferential Statistics.
Type I and Type II Errors
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Hypothesis Testing And the Central Limit Theorem

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.2 Central Limit Theorem A very important result in statistics that permits use of the normal distribution for making inferences (hypothesis testing and estimation) concerning the population mean

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.3 Central Limit Theorem  If a variable X (with any distribution) has a population mean  and standard deviation , then: the distribution of sample means (from samples of size n taken from the population), has the following distribution as n tends to infinity:

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.4 Central Limit Theorem  Insert CLT.pdf

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.5 Central Limit Theorem  Variable X, population mean=100, SD=15  Samples of size 25 (for example)  Sample 1, mean=90  Sample 2, mean=115  Sample 3, mean=101  Sample 4, mean=94 .

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.6 Central Limit Theorem  Plot sample means (histogram), then:  The sample means have mean 100  The sample means have a SD of 15/5=3  The distribution of sample means would tend to be normal as n gets large.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.7 Central Limit Theorem  Thus we can combine this normality results from the CLT with standardization to make probabilistic statements about the population mean.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.8 Scientific Method  Hypothesis testing is rooted in the scientific method.  To prove something: assume that the opposite (complement) is true, and look for contradictory evidence.  We then look for evidence to disprove the complement.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.9 Hypothesis Testing  Hypothesis  A prediction  about a population parameter or  about the relationship between parameters from 2 or more populations  Formulate hypotheses BEFORE beginning an experiment  Hypothesis Testing  Used to evaluate hypotheses

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.10 Hypotheses  Null hypotheses  H 0 : No effect or no difference  Assume true but look for evidence to disprove  Alternative hypotheses  H A : Presence of an effect or difference  Try to prove  Well-formulated hypotheses are quantifiable and testable  Need to think about directionality (e.g., what are you trying to prove?)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.11 Hypothesis Testing  Make a decision to reject (or fail to reject) the H 0 by comparing what is observed to what is expected if the H 0 is true (I.e., p- values)  Note that the hypotheses concern (unknown) population parameters. We make decisions about these parameters based on the (observable) sample statistics.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.12 Hypothesis Testing vs. Court Trials  Analogous to a court trial:  People are assumed innocent until proven guilty  H 0 : Innocent  H A : Guilty  If VERDICT = Guilty (i.e., reject H 0 ), then:  We can say that enough evidence was found to prove guilt (to some standard of proof)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.13 Hypothesis Testing vs. Court Trials  If VERDICT = Not Guilty (i.e., do not reject H 0 ), then:  We cannot say that we have proven innocence  We can say that we failed to find enough evidence to prove guilt. (There is a subtle but important difference between the two).  “Absence of evidence is not evidence of absence.”

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.14 Hypothesis Testing  Evidence is used to disprove hypotheses  We can prove the alternative hypothesis to some standard of proof.  We cannot prove the null hypothesis (we can only fail to reject it) – we cannot prove what we have already assumed to be true.  Thus we do not “accept” H 0. We simply fail to reject it.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.15 Hypothesis Testing  The gap between the population and the sample always requires a leap of faith.  The only statements that can be made with certainty are of the form:  “our conclusions fit the data”  “our data are consistent with the theory that …”

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.16 P-values  Used to make decisions concerning hypotheses  Probability (Conditional)  Ranges from 0-1  A function of sample sizes  Careful when N is very large (everything is significant)  Careful when N is very small (everything is non-significant)  Based upon the notion of repeated sampling

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.17 P-values  Probability of observing a result (data) as extreme or more extreme than the one observed (due to chance alone) given that H 0 is true.  P(data|H 0 )  It is NOT P(H 0 |data)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.18 P-values  Thus if a p-value is small, then the probability of seeing data like that observed is very small if H 0 is true.  In other words, we are either:  Seeing a very rare event (due to chance), or  H 0 is not true

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.19 P-values  P-value is small  reject H 0  P-value is not small  fail to reject H 0  What is “small”  Arbitrary (but strategic) choice by investigator  Often 0.05 (or 0.10 or 0.01)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.20 P-values  Although commonly reported, p- values are controversial in some circles  Epidemiology  Bayesian statistics  vs. estimation  Draws an arbitrary line in the sand  P=0.051 vs. p=0.049

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.21 P-values  Random variable that varies from sample to sample.  Not appropriate to compare p-values from difference experiments  Do not reflect the strength of a relationship  Do not provide information concerning the magnitude of the effect.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.22 Truth H o TrueH o False Test Result Reject H o Type I error (  ) correct Do not reject H o correct Type II error (  )

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.23  Error   =P(making a Type I error|null hypothesis is true)  Probability of rejecting when we should not reject  = significance level  = Type I error  This is under our control  Conventionally, we often choose  =0.05

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.24  Error  Power = 1-   =P(not making a Type II error|H A is true)  Probability of rejecting when we should reject  Control with sample size  Larger sample size  higher power

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.25 Determining  and   is chosen by the investigator  is chosen by the investigator in prospectively designed studies (e.g., clinical trials) where sample size is determined by the investigator.  If the investigator does not control the sample size, then  is not “controlled- for” and is simply a result of the sample size

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.26 Determining  and   Weigh the costs of making  versus  error  and  are inversely related  Examples  Car alarms  Biosurveillance  Sensitivity/Specificity trade-off

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.27 Statistical vs. Practical Significance  Statistical significance does not imply practical relevance.  Results should be both: (1) statistically and (2) practically significant in order to influence policy.  Example: A drug may induce a statistically significant reduction in blood pressure. However, if this reduction is 1 mmHg in your systolic BP, then it is not a useful (practical and clinically relevant) drug.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.28 Hypothesis Testing  Test only relevant hypotheses.  Multiple testing  multiple places for error  Example: Food supplement for sexual dysfunction  Specify hypotheses a priori  Post-hoc: some studies use data to identify hypotheses (hypothesis generating)  This is done as an exploratory analyses (but not as a confirmatory analyses)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.29 Example - Interpretation  NCEP recommends LDL < 100 mg/dL (for that for patients with two or more CHD risk factors  A new drug is being developed to (hopefully) lower LDL levels in such patients  A group of 25 patients with high LDL levels and CHD risk factors are recruited for a study of the new drug

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.30 Example - Interpretation  The LDL level of each patient is measured before the drug is given and again after 12 weeks of treatment with the drug  The difference (change=post-pre,  ) is calculated for each patient

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.31 Example - Interpretation  The study investigators made the following statement: “data from this study are consistent with the hypothesis that this drug lowers LDL in patients with elevated LDL and >2 CHD risk factors (mean(  )=-30, p=0.008)”.  What does this mean?

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.32 Example - Interpretation  Wherever there is a p-value, there is a hypotheses also. Determine H 0 and H A.  In this case:  H 0 : mean(  )=0; I.e., the drug has no effect  H A : mean(  )  0  Although these hypotheses are not stated explicitly, they are implied by the language (this can be tricky when reading journal articles)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.33 Example - Interpretation  Assuming that the drug has no effect (e.g., H 0 is true), then there are two possible reasons for observing Mean(  )=- 30  Chance  The drug truly decreases LDL  P-values separate the two; indicating the probability that chance is producing the observed data

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.34 Interpreting the p-value  Assume that H 0 is true (i.e., the drug has no effect)  Hypothetically repeat the study a very large number of times and we observe the changes

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.35 Interpreting the p-value  Trial 1, Mean(  )=-25  Trial 2, Mean(  )=10  Trial 3, Mean(  )=0  Trial 4, Mean(  )=-5  Trial 5, Mean(  )=18 .

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.36 Interpreting the p-value  Note some differences are small, some are large, some are near 0. This is called sampling variability.  Differences between the trials are simply due the the random samples chosen for each study (e.g., observed differences between studies are simply a function of who enrolled into the study. Remember, we assume H 0 is true … the drug has no effect.)

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.37 Remember the Big Picture Populations and Samples Sample / Statistics x, s, s 2 Population Parameters , ,  2

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.38 Interpreting the p-value  P=0.008 says that if the drug has no effect (i.e., H 0 is true), the the probability of observing a result as extreme or more extreme that the one we observed in this study is  0.8% of the hypothetical trials on our list would have differences that large

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.39 Interpreting the p-value  We often reject at 0.05 (our standard of proof)  If p<0.05 then the probability of observing a result this extreme if the drug had no effect, is too small (a very rare event)  Thus we reject the null hypothesis that the drug has no effect and conclude that evidence suggests a drug effect  Note that when the drug truly has no effect, we will incorrectly conclude that it does 5% of the time.

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.40 Example  The Neuropsychological and Neurologic Impact of HCV Co- Infection in HIV-Infected Subjects  Insert HIV_HCV.pdf

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.41 Example  ACTG A5087: A prospective, multicenter, randomized trial comparing the efficacy and safety of fenofibrate versus pravastatin in HIV-infected subjects with lipid abnormalities  Insert A5087.pdf

Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.42 Hypothesis Testing – Step by Step 1.State H 0 (i.e., what you are trying to disprove) 2.State H A 3.Determine  (at your discretion) 4.Determine the test statistic and associated p-value. 5.Determine whether to reject H 0 or fail to reject H 0