Introduction to Biostatistics and Bioinformatics

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Statistics for Business and Economics
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 9-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 8 Introduction to Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Ch. 9 Fundamental of Hypothesis Testing
Chapter 8 Introduction to Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 TUTORIAL 6 Chapter 10 Hypothesis Testing.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Statistics for Managers Using Microsoft® Excel 5th Edition
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
Statistics for Managers Using Microsoft Excel
Descriptive statistics Inferential statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
© 2002 Prentice-Hall, Inc.Chap 7-1 Statistics for Managers using Excel 3 rd Edition Chapter 7 Fundamentals of Hypothesis Testing: One-Sample Tests.
© 2003 Prentice-Hall, Inc.Chap 9-1 Fundamentals of Hypothesis Testing: One-Sample Tests IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION.
Chapter 8 Hypothesis Testing (假设检验)
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Fundamentals of Hypothesis Testing: One-Sample Tests
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
6 - 1 © 2000 Prentice-Hall, Inc. A First Course in Business Statistics Inferences Based on a Single Sample: Tests of Hypothesis Chapter 6.
8 - 1 © 1998 Prentice-Hall, Inc. Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 10 Hypothesis Testing
© 2003 Prentice-Hall, Inc.Chap 7-1 Business Statistics: A First Course (3 rd Edition) Chapter 7 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Statistical Methods 5.
8 - 1 © 2000 Prentice-Hall, Inc. Statistics for Business and Economics Inferences Based on a Single Sample: Tests of Hypothesis Chapter 8.
Statistical Inference
Testing of Hypothesis Fundamentals of Hypothesis.
HYPOTHESIS TESTING. Statistical Methods Estimation Hypothesis Testing Inferential Statistics Descriptive Statistics Statistical Methods.
Topic 8 Hypothesis Testing Mathematics & Statistics Statistics.
© 2002 Prentice-Hall, Inc.Chap 7-1 Business Statistics: A First course 4th Edition Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Statistics for Managers 5th Edition Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
© 2004 Prentice-Hall, Inc.Chap 9-1 Basic Business Statistics (9 th Edition) Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
© 2001 Prentice-Hall, Inc.Chap 9-1 BA 201 Lecture 14 Fundamentals of Hypothesis Testing.
© Copyright McGraw-Hill 2004
Chapter 8 Hypothesis Testing (假设检验)
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
What is a Hypothesis? A hypothesis is a claim (assumption) about the population parameter Examples of parameters are population mean or proportion The.
1 Lecture 5: Section B Class Web page URL: Data used in some examples can be found in:
1 of 53Visit UMT online at Prentice Hall 2003 Chapter 9, STAT125Basic Business Statistics STATISTICS FOR MANAGERS University of Management.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Learning Objectives Describe the hypothesis testing process Distinguish the types of hypotheses Explain hypothesis testing errors Solve hypothesis testing.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Hypothesis Testing: One-Sample Inference
Statistics for Managers using Excel 3rd Edition
Statistics for Business and Economics
Presentation transcript:

Introduction to Biostatistics and Bioinformatics This Lecture Introduction to Biostatistics and Bioinformatics Hypothesis Testing I By Judy Zhong Assistant Professor Division of Biostatistics Department of Population Health Judy.zhong@nyumc.org

Statistical Methods Statistical Methods Descriptive Inferential Statistics Inferential Statistics Hypothesis Others Estimation Testing 5

Hypothesis testing Research hypotheses are conjectures or suppositions that motivate the research Statistical hypotheses restate the research hypotheses to be addressed by statistical techniques. Formally, a statistical hypothesis testing problem includes two hypothesis Null hypothesis (H0) Alternative hypothesis (Ha, H1) In statistical hypothesis testing, we start off believing the null hypothesis, and see if the data provide enough evidence to abandon our belief in H0 in favor of Ha

What’s a Hypothesis? A Belief about a population parameter Parameter is population mean, proportion, variance Hypothesis must be stated before analysis I believe the mean birth weight in the general population is 120 oz © 1984-1994 T/Maker Co.

Birth Weight Example Average birth weight in the general population is 120 oz. You take a sample of 100 babies born in the hospital you work at (that is located in a low-SES area), and find that the sample mean birth weight is 115 oz. You wonder: is this observed difference merely due to chance OR is the mean birth weight of SES babies indeed lower than that in the general population? Suppose we want to test the hypothesis that mothers with low socioeconomic status (SES) deliver babies whose birthweights are lower than “normal”.

Null Hypothesis Parameter interest: the mean birth weight of SES babies, denoted by  Begin with the assumption that the null hypothesis is true E.g. H0 : the mean birth weight of SES babies is equal to that in the general population Similar to the notion of innocent until proven guilty H0:   120 Could even has inequality sign: ≤ or ≥ (more complex tests)

Alternative Hypothesis Is set up to represent research goal Opposite of null hypothesis E.g. Ha : the mean birth weight of SES babies is lower than that in the general population Ha:  < 120 Always has inequality sign: ,, or   will lead to two-sided tests < , > will lead to one-sided tests

One-Sided vs Two-Sided Hypothesis Tests H0:   0 or H0:   0 Ha:  < 0 Ha:   0 Two-sided: H0:   3 Ha:   3 It is very important to remember that hypothesis statements are about populations and NOT samples. We will never have a hypothesis statement with either xbar or p-hat in it.

Making Decisions—four possible scenarios Fail to reject H0 when in fact H0 is true (good decision) Fail to reject H0 when in fact H0 is false (an error) Reject H0 when in fact H0 is true (an error) Reject H0 when in fact H0 is false (good decision)

Errors in Making Decision Type I Error Reject null hypothesis H0 when H0 is true Has serious consequences Probability of type I error is (alpha) Called level of significance Type II Error Do not reject H0 when H0 is false (H0 is true) Probability of type II error is (beta)

Possible Outcomes in Hypothesis Testing

Type I & II Error Relationship Type I and Type II errors cannot happen at the same time Type I error can only occur if H0 is true Type II error can only occur if H0 is false If Type I error probability () , then Type II error probability ()

 &  Have an Inverse Relationship Can’t reduce both errors simultaneously: trade-off!  

Reject hypothesis! Not close. Hypothesis Testing I believe the population mean age is 50 (hypothesis). Reject hypothesis! Not close. Population       Random sample  Mean X = 20  

Sampling Distribution of Sample Mean (Xbar) Basic Idea: CLT Sampling Distribution of Sample Mean (Xbar) m = 50 Sample Mean H0

Sampling Distribution Basic Idea Sampling Distribution It is unlikely that we would get a sample mean of this value ... 20 m = 50 Sample Mean H0

Sampling Distribution Basic Idea Sampling Distribution It is unlikely that we would get a sample mean of this value ... ... if in fact this were the population mean 20 m = 50 Sample Mean H0

Sampling Distribution Basic Idea Sampling Distribution It is unlikely that we would get a sample mean of this value ... But, how unlikely is unlikely, is there a rule? ... if in fact this were the population mean 20 m = 50 Sample Mean H0

Rejection Region Def: the range of values of the test statistics xbar for which H0 is rejected We need a critical (cut-off) value to decide if our sample mean is “too extreme” when null hypothesis is true. Designated (alpha) Typical values are .01, .05, .10 selected by researcher at start = P(Rejecting H0 when H0 is true) = P(xbar<c, when H0 is true)

Rejection Region (One-Sided Test) Sampling Distribution Ho Value Critical a Sample Statistic Rejection Region Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value.

Rejection Region (One-Sided Test) Sampling Distribution Ho Value Critical a Sample Statistic Rejection Region Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value. Sample Statistic Observed sample statistic

Rejection Region (One-Sided Test) Sampling Distribution Ho Value Critical a Sample Statistic Rejection Region Nonrejection 1 -  Level of Confidence Level of Confidence 1 -  Rejection region does NOT include critical value.

Rejection Regions (Two-Sided Test) Sampling Distribution Critical Value 1/2 a Rejection Region Ho Sample Statistic Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value.

Rejection Regions (Two-Sided Test) Sampling Distribution Critical Value 1/2 a Rejection Region Ho Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value. Observed sample statistic

Rejection Regions (Two-Tailed Test) Sampling Distribution Critical Value 1/2 a Rejection Region Ho Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value.

Rejection Regions (Two-Tailed Test) Sampling Distribution Critical Value 1/2 a Rejection Region Ho Nonrejection 1 -  Level of Confidence Rejection region does NOT include critical value.

Hypotheses Testing Steps State H0 State Ha Choose  Choose n Choose test Set up critical values Collect data Compute test statistic Make statistical decision Express decision

Test for Mean ( Unknown) Assumptions Population Is normally distributed If Not Normal, only slightly skewed & large sample (n  30) taken T test statistic Use T table

Two-Sided t Test Example You work for the FTC. A manufacturer of detergent claims that the mean weight of detergent is 3.25 lb. You take a random sample of 64 containers. You calculate the sample average to be 3.238 lb. with a standard deviation of .117 lb. At the .01 level, is the manufacturer correct? Allow students about 10 minutes to finish this. 3.25 lb.

Two-Tailed t Test Solution* H0:  = 3.25 Ha:   3.25   .01 df  64 - 1 = 63 Critical Value(s): Test Statistic: Decision: Conclusion: Do not reject at  = .01 t 2.6561 -2.6561 .005 Reject H0 There is no evidence average is not 3.25

p-Value Probability of obtaining a test statistic as extreme or more extreme than actual sample value given H0 is true Called observed level of significance Smallest value of  H0 can be rejected Used to make rejection decision If p-value  , do not reject H0 If p-value < , reject H0

Two-sided test: 1. T value of sample statistic (observed) 0.82 -0.82

Two-sided test: 2. From T Table 3 p-value is P(T  -.82 or T  .82) = .2*2 T .82 -.82 1/2 p-Value=.2

Test statistic is in ‘Do not reject’ region (p-Value = .4)  ( = .01); Do not reject. .82 -.82 T Reject 1/2 p-Value = .2 1/2  = .005

Probability of rejecting false H0 (Correct Decision) Power of Test Probability of rejecting false H0 (Correct Decision)

Power of Test Used in determining test adequacy Affected by True value of population parameter 1- increases when difference with hypothesized parameter increases Significance level  1- increases when  increases Standard deviation 1- increases when  decreases Sample size n 1- increases when n increases

What we learned today.. Hypotheses testing concepts Decision making risks: Type I error, Type II error and Power P-value method Two-tailed t-test of mean (sigma unknown) One-tailed t-test of mean (sigma unknown) Power of a test As a result of this class, you will be able to ...