BS704 Class 7 Hypothesis Testing Procedures

Slides:



Advertisements
Similar presentations
Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Advertisements

Chapter 7 Hypothesis Testing
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
CPH Exam Review Biostatistics
Inferential Statistics
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
A Decision-Making Approach
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
IENG 486 Statistical Quality & Process Control
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Statistics for Managers Using Microsoft® Excel 5th Edition
Sample Size Determination
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Experimental Statistics - week 2
Chapter 10 Hypothesis Testing
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Chapter 7 Using sample statistics to Test Hypotheses about population parameters Pages
Confidence Intervals and Hypothesis Testing - II
Hypothesis Testing – Examples and Case Studies
Hypothesis Testing Procedures. Objectives Define null and research hypothesis, test statistic, level of significance and decision rule Understand Type.
8 - 1 © 2003 Pearson Prentice Hall Chi-Square (  2 ) Test of Variance.
Fundamentals of Hypothesis Testing: One-Sample Tests
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Power and Sample Size Determination Anwar Ahmad. Learning Objectives Provide examples demonstrating how the margin of error, effect size and variability.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Chapter 10 Hypothesis Testing
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
CPH Exam Review Biostatistics Lisa Sullivan, PhD Associate Dean for Education Professor and Chair, Department of Biostatistics Boston University School.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
MTH3003 PJJ SEM II 2014/2015 F2F II 12/4/2015.  ASSIGNMENT :25% Assignment 1 (10%) Assignment 2 (15%)  Mid exam :30% Part A (Objective) Part B (Subjective)
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
What is a Test of Significance?. Statistical hypotheses – statements about population parameters Examples Mean weight of adult males is greater than 160.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Hypothesis Testing Concepts of Hypothesis Testing Statistical hypotheses – statements about population parameters Examples mean weight of adult.
Chapter 9 -Hypothesis Testing
Chapter 9: Inferences Involving One Population
Estimation & Hypothesis Testing for Two Population Parameters
Chapter 9 Hypothesis Testing.
Comparing Populations
Hypothesis Testing: The Difference Between Two Population Means
Presentation transcript:

BS704 Class 7 Hypothesis Testing Procedures

Please complete Quiz 8 Before Oct 26 HW Set #6 Chapter 7 Problems 4, 7, 8, 20 and 25 R Problem Set 6 (on Blackboard) Due October 26 Please complete Quiz 8 Before Oct 26

Objectives Define null and research hypothesis, test statistic, level of significance and decision rule Understand Type I and Type II errors Differentiate hypothesis testing procedures based on type of outcome variable and number of samples

Hypothesis Testing Research hypothesis is generated about unknown population parameter Sample data are analyzed and determined to support or refute the research hypothesis

Hypothesis Tests About m Example: A large, national study was conducted in 2007 and found that the mean systolic blood pressure for males aged 50 was 130. In 2008, an investigator hypothesizes systolic blood pressures have increased. To test this hypothesis we set up two competing hypotheses Null H0: m = 130 Research H1: m > 130

Hypothesis Tests About m To test the hypotheses a random sample is selected from the population of interest. Suppose a sample of n=108 males age 50 in 2008 is selected and their systolic blood pressures are analyzed. Of primary interest is the mean systolic blood pressure in the sample ( ).

If the sample mean is 130, which is more likely true?

If the sample mean is 150, which is more likely true?

If the sample mean is 135, which is more likely true?

Hypothesis Tests About m We must determine a critical value such that if our sample mean is less than the critical value we will conclude that H0 is true (i.e., m=130), and if our sample mean is greater than the critical value we will conclude that H1 is true (i.e., m > 130).

Hypothesis Tests About m Instead of determining critical values for the sample mean - which would be specific to each application (since depends on the unit of measurement), - appeal to the Central Limit Theorem.

Hypothesis Tests About m For large n - is approximately normally distributed. Assuming that H0 is true (or "under the null hypothesis") we can standardize , producing a Z score (test statistic). If  130 then Z  0  H0 probably true. If >130 then Z > 0  H1 probably true. What value of Z is considered “large” ? In Example suppose that s=15 and n=108.

Hypothesis Tests About m We must select a level of significance, denoted a, which is defined as the probability of rejecting H0 when H0 is true. The level of significance is generally in the range of 0.01 to 0.10. Once a level of significance is selected, a decision rule is formulated.

Hypothesis Tests About m Decision rule:  Reject H0 if Z > 1.645 Do Not Reject H0 if Z < 1.645 Once the decision rule is in place, we compute the value of the test statistic. Suppose in example, X=135.

Hypothesis Tests About m The final step - draw a conclusion. The test statistic falls in the rejection region - we reject H0 (3.46 > 1.645). We have significant evidence, a = 0.05, to show that the mean systolic blood pressure for males aged 50 in 2008 has increased from 130.

Hypothesis Testing Procedures 1. Set up null and research hypotheses, select a 2. Select test statistic 3. Set up decision rule 4. Compute test statistic 5. Draw conclusion & summarize significance (p-value)

P-values P-values represent the exact significance of the data Estimate p-values when rejecting H0 to summarize significance of the data (can approximate with statistical tables, can get exact value with statistical computing package) P-value is the smallest a where we still reject H0

Hypothesis Testing for m Continuous outcome 1 Sample H0: m=m0 H1: m>m0, m<m0, m≠m0 Test Statistic n>30 (Find critical value in Table 1C, n<30 Table 2)

Hypothesis Testing for m The National Center for Health Statistics (NCHS) reports the mean total cholesterol for adults is 203. Is the mean total cholesterol in Framingham Heart Study participants significantly different? In 3310 participants the mean is 200.3 with a standard deviation of 36.8.

Hypothesis Testing for m 1. H0: m=203 H1: m≠203 a=0.05 2. Test statistic 3. Decision rule Reject H0 if z > 1.96 or if z < -1.96

Hypothesis Testing for m 4. Compute test statistic 5. Conclusion. Reject H0 because -4.22 < -1.96. We have statistically significant evidence at a=0.05 to show that the mean total cholesterol is different in the Framingham Heart Study participants.

Hypothesis Testing for m Significance of the findings. Z = -4.22. Table 1C. Critical Values for Two-Sided Tests a Z 0.20 1.282 0.10 1.645 0.05 1.960 0.010 2.576 0.001 3.291 0.0001 3.819 p<0.0001.

Interpreting P-Values If p < a then reject H0

Errors in Hypothesis Tests Conclusion of Statistical Test Do Not Reject H0 Reject H0 H0 true Correct Type I error H0 false Type II error Correct

Practice Example – Is social networking a health risk? Hypertexting (>120 text messages per day) has been associated with health risks (Frank et al, Nov 2010). In 2010, the mean number of texts per day was 55. Is texting increasing in 2012? A sample of 75 teens report sending a mean of 61 texts per day (SD = 15). Is there evidence of an increase in texting in 2012?

Practice Example – Is social networking a health risk? 1. H0: m = 55 H1: m > 55 a=0.05 2. Test statistic 4. Test Statistic 3. Decision rule 5. Reject H0. Reject H0 if z > 1.645

In an upper tailed test with a=0. 05. If Z=-2 In an upper tailed test with a=0.05. If Z=-2.5 would you reject H0: m=50? Yes No

We run a test and do not reject H0. Which is most likely… We made the correct decision We committed a Type I error We committed a Type II error 1 or 2 1 or 3

New Scenario Outcome is dichotomous (p=population proportion) Result of surgery (success, failure) Cancer remission (yes/no) One study sample Data On each participant, measure outcome (yes/no) n, x=# positive responses,

Hypothesis Testing for p Dichotomous outcome 1 Sample H0: p=p0 H1: p>p0, p<p0, p≠p0 Test Statistic (Find critical value in Table 1C)

Hypothesis Testing for p The NCHS reports that the prevalence of cigarette smoking among adults in 2002 is 21.1%. Is the prevalence of smoking lower among participants in the Framingham Heart Study? In 3536 participants, 482 reported smoking (482/3536=0.136).

Hypothesis Testing for p 1. H0: p=0.211 H1: p<0.211 a=0.05 2. Test statistic 3. Decision rule Reject H0 if z < -1.645

Hypothesis Testing for p 4. Compute test statistic 5. Conclusion. Reject H0 because -10.93 < -1.645. We have statistically significant evidence at a=0.05 to show that the prevalence of smoking is lower among the Framingham Heart Study participants. (p<0.0001)

Practice Example – Is social networking a health risk? Hypertexting (>120 text messages per day) has been associated with health risks (Frank et al, Nov 2010). In 2010, 19% of teens were hypertexting. Is hypertexting increasing in 2012? In a sample of 75 teens, 16 report sending more than 120 texts per day. Is hyper-texting increasing in 2012?

Practice Example – Is social networking a health risk? 1. H0: p=0.19 H1: p > 0.19 a=0.05 2. Test statistic 4. Test Statistic 3. Decision rule 5. Do not reject H0. Reject H0 if z > 1.645 Sample Data: n=75, x=16 𝑝 = 16 75 =0.21

New Scenario Outcome is continuous SBP, Weight, cholesterol Two independent study samples Data On each participant, identify group and measure outcome

Two Independent Samples RCT: Set of Subjects Who Meet Study Eligibility Criteria Randomize Treatment 1 Treatment 2 Mean Trt 1 Mean Trt 2

Two Independent Samples Cohort Study - Set of Subjects Who Meet Study Inclusion Criteria Group 1 Group 2 Mean Group 1 Mean Group 2

Hypothesis Testing for (m1-m2) Continuous outcome 2 Independent Sample H0: m1=m2 (m1-m2 = 0) H1: m1>m2, m1<m2, m1≠m2

An RCT is planned to show the efficacy of a new drug vs An RCT is planned to show the efficacy of a new drug vs. placebo to lower total cholesterol. What are the hypotheses? H0: mP=mN H1: mP>mN H0: mP=mN H1: mP<mN H0: mP=mN H1: mP≠mN

Hypothesis Testing for (m1-m2) Test Statistic n1>30 and (Find critical value n2> 30 in Table 1C, n1<30 or Table 2) n2<30

Pooled Estimate of Common Standard Deviation, Sp Previous formulas assume equal variances (s12=s22) If 0.5 < s12/s22 < 2, assumption is reasonable

Hypothesis Testing for (m1-m2) A clinical trial is run to assess the effectiveness of a new drug in lowering cholesterol. Patients are randomized to receive the new drug or placebo and total cholesterol is measured after 6 weeks on the assigned treatment. Is there evidence of a statistically significant reduction in cholesterol for patients on the new drug?

Hypothesis Testing for (m1-m2) Sample Size Mean Std Dev New Drug 15 195.9 28.7 Placebo 15 217.4 30.3

Hypothesis Testing for (m1-m2) 1. H0: m1=m2 H1: m1<m2 a=0.05 2. Test statistic 3. Decision rule, df=n1+n2-2 = 28 Reject H0 if t < -1.701

Assess Equality of Variances Ratio of sample variances: 28.72/30.32 = 0.90

Hypothesis Testing for (m1-m2) 4. Compute test statistic 5. Conclusion. Reject H0 because -2.92 < -1.701. We have statistically significant evidence at a=0.05 to show that the mean cholesterol level is lower in patients on treatment as compared to placebo. (p<0.005)

A two sided test for the equality of means produces p=0.20. Reject H0? Yes No Maybe

New Scenario Outcome is continuous SBP, Weight, cholesterol Two matched study samples Data On each participant, measure outcome under each experimental condition Compute differences (D=X1-X2)

Two Dependent/Matched Samples Subject ID Measure 1 Measure 2 1 55 70 2 42 60 . Measures taken serially in time or under different experimental conditions

Crossover Trial Treatment Treatment Eligible R Participants Placebo Placebo Each participant measured on Treatment and placebo

Hypothesis Testing for md Continuous outcome 2 Matched/Paired Sample H0: md=0 H1: md>0, md<0, md≠0 Test Statistic n>30 (Find critical value in Table 1C, n<30 Table 2)

Hypothesis Testing for md Is there a statistically significant difference in mean systolic blood pressures (SBPs) measured at exams 6 and 7 (approximately 4 years apart) in the Framingham Offspring Study? Among n=15 randomly selected participants, the mean difference was -5.3 units and the standard deviation was 12.8 units. Differences were computed by subtracting the exam 6 value from the exam 7 value.

Hypothesis Testing for md 1. H0: md=0 H1: md≠0 a=0.05 2. Test statistic 3. Decision rule, df=n-1=14 Reject H0 if t > 2.145 or if t < -2.145

Hypothesis Testing for md 4. Compute test statistic 5. Conclusion. Do not reject H0 because -2.145 < -1.60 < 2.145. We do not have statistically significant evidence at a=0.05 to show that there is a difference in systolic blood pressures over time.

New Scenario Outcome is dichotomous Result of surgery (success, failure) Cancer remission (yes/no) Two independent study samples Data On each participant, identify group and measure outcome (yes/no)

Hypothesis Testing for (p1-p2) Dichotomous outcome 2 Independent Sample H0: p1=p2 H1: p1>p2, p1<p2, p1≠p2 Test Statistic (Find critical value in Table 1C)

Hypothesis Testing for (p1-p2) Is the prevalence of CVD different in smokers as compared to nonsmokers in the Framingham Offspring Study? Free of CVD History of CVD Total Nonsmoker 2757 298 3055 Current smoker 663 81 744 3420 379 3799

Hypothesis Testing for (p1-p2) 1. H0: p1=p2 H1: p1≠p2 a=0.05 2. Test statistic 3. Decision rule Reject H0 if Z < -1.96 or if Z > 1.96

Hypothesis Testing for (p1-p2) 4. Compute test statistic

Hypothesis Testing for (p1-p2) 5. Conclusion. Do not reject H0 because -1.96 < 0.927 < 1.96. We do not have statistically significant evidence at a=0.05 to show that there is a difference in prevalent CVD between smokers and nonsmokers.

Study of Single verses Weekly Antenatal Corticosteroids What do p-values mean in Table 1? What do p-values mean in Table 3?

Study of Single verses Weekly Antenatal Corticosteroids Did randomization work?

Primary outcome Is trial a success?

Practice Problem Run Test for Primary Outcome Composite Morbidity Group Sample Size # Events Weekly 256 56 Single 246 66

Solution 3. Decision rule Reject H0 if Z < -1.96 or if Z > 1.96 1. H0: p1=p2 H1: p1≠p2 a=0.05 2. Test statistic 3. Decision rule Reject H0 if Z < -1.96 or if Z > 1.96

Solution 4. Compute test statistic 5. Do not Reject H0 since -1.96<-1.294<1.96.