Biostat 200 Lecture 5 1. Today Where are we Review of finding percentiles and cutoff values, CLT and 95% confidence intervals Hypothesis testing in general.

Slides:

Advertisements

Similar presentations

Inference Sampling distributions Hypothesis testing.

Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.

STATISTICAL INFERENCE PART V

Lab 4: What is a t-test? Something British mothers use to see if the new girlfriend is significantly better than the old one?

EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.

1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.

Sociology 601 Class 7: September 22, 2009

Topic 2: Statistical Concepts and Market Returns

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.

BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.

Chapter 8 Introduction to Hypothesis Testing

BCOR 1020 Business Statistics

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 TUTORIAL 6 Chapter 10 Hypothesis Testing.

5-3 Inference on the Means of Two Populations, Variances Unknown

Statistics for Managers Using Microsoft® Excel 5th Edition

Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.

Fall 2012Biostat 5110 (Biostatistics 511) Discussion Section Week 8 C. Jason Liang Medical Biometry I.

Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.

Chapter 10 Hypothesis Testing

Confidence Intervals and Hypothesis Testing - II

Fundamentals of Hypothesis Testing: One-Sample Tests

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.

Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.

More About Significance Tests

Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Introduction to Statistical Inferences Inference means making a statement about a population based on an analysis of a random sample taken from the population.

+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.

STATISTICAL INFERENCE PART VII

Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.

Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.

Chapter 10 Hypothesis Testing

1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:

Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.

Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.

6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.

Biostat. 200 Review slides Week 1-3. Recap: Probability.

10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.

Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.

Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.

Testing of Hypothesis Fundamentals of Hypothesis.

1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.

CHAPTER 9 Testing a Claim

Lecture 4. For assignment 2, recode the smoking variable replace smoke=smoke-1.

Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.

10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.

Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.

Biostat 200 Lecture 5 1. Normal distribution If X ~ N(µ,σ) then Z=(X- µ)/σ – To find the proportion of values in the population that exceed some threshold,

Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.

Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.

MATH 2400 Ch. 15 Notes.

Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry

Chapter 20 Testing Hypothesis about proportions

Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.

1 When we free ourselves of desire, we will know serenity and freedom.

Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.

Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.

26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.

Lecture 4 1. Today Review of binomial and normal distribution Sampling Central limit theorem Confidence intervals for means Normal approximation to the.

Review: Large Sample Confidence Intervals 1-  confidence interval for a mean: x +/- z  /2 s/sqrt(n) 1-  confidence interval for a proportion: p +/-

© Copyright McGraw-Hill 2004

The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.

Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.

BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.

4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.

Presentation transcript:

Biostat 200 Lecture 5 1

Today Where are we Review of finding percentiles and cutoff values, CLT and 95% confidence intervals Hypothesis testing in general Hypothesis testing of one mean Hypothesis testing of one proportion Types of error 2

Where are we? Week 1: Variables, tables, graphs Week 2: Probability: Independence  multiplicative rule Mutual exclusivity  additive rule Conditional probability Week 3: Some probability distributions, how to get probabilities if you assume these distributions Week 4: CLT for the distribution of sample means and 95% CIs for means and proportions Week 5: Hypothesis testing in general and for comparing one mean or one proportion to a hypothesized value Week 6: More hypothesis tests 3

Review: Normal distribution If X ~ N(µ,σ) then Z=(X- µ)/σ – To find the proportion of values in the population that exceed some threshold, i.e. P(X>x) Transform x to z=(x - µ)/σ and look up P(Z>z) using display 1-normal(z) – To find the percentiles of the distribution E.g. to find the 97.5 percentile, the value for which 97.5% of the values are smaller, find z by using display invnormal(.975) and solve z=(x - µ)/σ for x 4

For normal distribution – Stata calculates P(Z<z) – If you have z and you want p, use display normal(z) – If you have p and you want z, use display invnormal(p) For t distribution – Stata calculates P(T>t) – If you have t and you want p, use display ttail(df,t) – If you have p and you want t, use display invttail(df,p) 5

But usually you want to summarize the values, and make inference about the mean The Central Limit Theorem says that the distribution of a sample mean X has a normal distribution with mean µ and standard deviation σ/ √n 6

Imagine that we did the following 1.Draw a sample of size n from a population 2.Calculate the sample mean 3.Save the sample mean as an observation in the data set 4.Repeat steps 1-3 many times, enough to draw a reasonable histogram The histogram will show you the distribution of the sample means If the samples themselves were large enough (n), the distribution of the sample means will appear normal 7

Using the central limit theorem, we can make probability statements about values of X ̅ These probability statements can be used to construct confidence intervals Confidence intervals for means are statements about the probability that a given interval contains the true population mean µ Confidence intervals can be constructed for any point estimate, e.g., odds ratios, hazards ratios, correlations, regression coefficients, standard deviations, … The methods for constructing these will vary 8 Confidence intervals

The width of the confidence interval depends on the confidence level (1-  ) and n If σ is unknown, we use the t distribution and the sample standard deviation s in our computation 9

Confidence intervals for means We know from the normal distribution that P(-1.96 ≤ Z ≤ 1.96) = 0.95 Substituting the formula for Z into the above we get After rearranging, the left and right are the confidence limits And is σ is unknown,

Confidence intervals for proportions As n increases, the binomial distribution approaches the normal distribution The normal approximation: as n increases  X ~ N( np, √(np(1-p)) ) X is the total count of the successes np is the mean of the binomial √ np(1-p) is the standard deviation of the binomial  X/n = p̂ ~ N( p, √(p(1-p)/n) ) X/n is the proportion of successes p is the population proportion √ (p(1-p)/n) is the standard deviation of X/n if X is binomial So we can use the normal approximation to calculate confidence intervals for p 11

Confidence intervals for proportions But if n is small or p is small or both then the normal approximation is not good and the intervals using the normal approximation are not good We need to use the binomial distribution Confidence intervals using the binomial distribution are called exact confidence intervals Exact confidence intervals are not symmetric (because neither is the binomial for small n or small p) 12

13 For p=0.05, even for large n, the distribution of X doesn’t exactly look normal For p=0.35, the distribution does look fairly normal at small sample sizes

.. gen cigs10=1 if cigs>=10 & cigs !=. (539 missing values generated).. replace cigs10=0 if cigs<10 & smoke==1 (23 real changes made). tab cigs10 cigs10 | Freq. Percent Cum | | Total | Normal approximation CI formula:. di sqrt( *( )/28) di *sqrt( *( )/28) di *sqrt( *( )/28) So 95% CI is using the normal approximation 14 E.g. Proportion of smokers in our class data set who smoke 10 or more cigarettes/day

So 95% CI using the normal approximation is Using Stata CI or proportion command : ci cigs10 Variable | Obs Mean Std. Err. [95% Conf. Interval] cigs10 | proportion cigs10 Proportion estimation Number of obs = | Proportion Std. Err. [95% Conf. Interval] cigs10 | 0 | |

So 95% CI using the normal approximation: Using Stata CI or proportion command: Using Stata ci, bionmial command: ci cigs10, binomial -- Binomial Exact -- Variable | Obs Mean Std. Err. [95% Conf. Interval] cigs10 | You can also calculate the exact CI using the invbinomialtail function in Stata. di invbinomialtail(28,5,.025) di invbinomialtail(28,6,.975)

Hypothesis testing Confidence intervals tell us about the uncertainty in a point estimate of the population mean or proportion (or some other entity) Hypothesis testing allows us to draw a conclusion about a population parameter that we have estimated in a sample Remember, we are taking samples from the population and we never know the truth 17

Hypothesis testing To conduct a hypothesis test about the mean of a population, we postulate a value for it, and call that value µ 0. – We write this H 0 : µ = µ 0 – We call this the null hypothesis We set an alternative hypothesis for all other values of µ – We write this: H A : µ ≠ µ 0 – We call this the alternative hypothesis 18

Hypothesis testing for a mean After formulating the hypotheses, draw a random sample Compare the mean of the random sample, X, to the hypothesized mean µ 0 Is the difference between the sample mean X and the hypothesized population mean µ 0 likely due to chance (remember you have taken a sample), or too large to be attributed to chance? If too large to be due to chance, we say we have statistical significance 19

Just like a 95% confidence interval has a 5% probability of not including the population parameter (e.g. the mean or the proportion), there is some probability that we will incorrectly reject the null hypothesis – We set that probability in advance, before collecting the data and doing the analysis – That probability is called the significance level – The significance level is denoted by , and is frequently set to

Criminal trials in the US – innocent until proven guilty In hypothesis testing, we assume the null hypothesis is true, and only reject the null if there is enough evidence that the sample did not come from the hypothesized population (e.g. that the mean is not µ 0 ) In jury trials there is a chance that someone who is innocent is found guilty, but our method of innocent until proven guilty is in place to minimize this In hypothesis testing, a true null hypothesis might be mistakenly rejected – This mistake is called a Type I error – The probability of this mistake is chosen in advance, and is the significance level, which is referred to as  21

The probability of obtaining a mean as or more extreme as the observed sample mean X, given that the null hypothesis is true, is called the p-value of the test, or p. 22

Terminology Before the test, set the significance level,  – This is the probability of rejecting the null hypothesis when it is true (also called a Type I error) When you run the test, you get a probability of observing a mean as extreme as you did, given (conditional on, or assuming) that the null hypothesis is true – This probability is the p-value If the p-value is less than , (e.g.  =0.05 and p=0.013), then the test is called statistically significant and the null hypothesis is rejected 23

Hypothesis testing 3 steps – Specify the null and alternative hypothesis E.g. null: Is the mean level of CD4 count in a sample of new HIV positives in Uganda below 350 cells/mm 3 ? – Determine the compatibility of the data with the null hypothesis We do this by using the data to calculate a test statistic that will be compared to a theoretical statistical distribution, e.g., the standard normal distribution or the t-distribution If the test statistic is very large, then our data are very unlikely under the null hypothesis – Either: Reject the null OR Fail to reject the null 24

1. State the hypotheses In epidemiology the null hypothesis is often that there is no association between the exposure and the outcome E.g., – The difference in disease risk=0 R exposed -R not exposed =0 – The ratio of risk/prevalence/etc=1 R exposed /R not exposed =1 – Clinical trials: The mean value of something does not differ between two groups The alternative hypothesis is usually an association or risk difference – E.g., R exposed -R not exposed >0 R exposed /R not exposed>1 The mean response in the treated group is greater than in the untreated group H 0 (the null) and H A (the alternative) must be mutually exclusive (no overlap) and include all the possibilities 25

2. Determine the compatibility with the null. We determine if there is enough evidence to reject the null hypothesis (i.e. calculate the test statistic and look up the p-value) – If the null is true, what is the probability of obtaining the sample data as extreme or more extreme? – This probability is call the p-value 26

3. Reject or fail to reject If the p-value (the probability of obtaining the sample data if the null is true) is sufficiently small (we often use 5%) then we reject the null and say the test was statistically significant. If we fail to reject the null, it does not mean that we accept it. – We might fail to reject the null if the sample size is too small or the variability in the sample was too large 27

Hypothesis testing Significance level, set a priori p-value, the result of your statistical test If p< , reject the null  test is statistically significant If p≥ , do not reject the null  test is not statistically significant

Tests of one mean We want to test whether a mean is equal to some hypothesized value If we believe there might be deviations in either direction, we use a two- sided test Two sided test: – Null hypothesis: H 0 : μ=μ 0 – Alternative hypothesis: H A : μ≠μ 0 If we only care about values above or below a certain value, we use a one- sided test One sided test: – Null hypothesis: H 0 : μ≥μ 0 – Alternative hypothesis: H A : μ<μ 0 or – Null hypothesis: H 0 : μ≤μ 0 – Alternative hypothesis: H A : μ>μ 0 29

Lexicon For one sided tests, people often say they are testing the hypothesis that the mean is less than or more than xxx (μ 0 ). When they say this they are usually stating the alternative hypothesis. For two sided tests, people often say they are testing the hypothesis that the mean is μ 0. When they say this they are stating the null hypothesis. Since you know that the null is the complement of the alternative, you don’t usually state both in practicality (but we will for this class) 30

Tests of one mean The distribution of a sample mean if n is large enough is normal. For a normally distributed random variable, calculate the Z statistic If the standard deviation (σ) is not known, calculate the t-statistic. Compare the test statistic to the appropriate distribution to get the p-value – Find P(Z>z stat ) or P(T>t stat ) for a one-sided hypothesis test – Find 2*P(Z>z stat ) or *2P(T>t stat ) for a two-sided hypothesis test 31

Test of one mean, one sided Non-pneumatic anti-shock garment for the treatment of obstetric hemorrhage in Nigeria Mean initial blood loss of 83 women ml, SD=491.3 Our question: Are these women hemorrhaging (>750 ml blood loss)? – H 0 : μ≤750 H A : μ>750  =0.05 T-statistic: p-value: P(T>12.3) 32 Data adapted from Miller, S et al. Int J Gynecol Obstet (2009)

Test of one mean is off the graph So the probability of observing a sample mean of 1413 with n=83 and SD=431 if the true mean is <=750 is very very low

Test of one mean, one sided hypothesis, Stata P(T>test stat). display ttail(82, 12.3) 1.308e-20 We reject the null 34 P(T>t) Use ttail(df,t stat )

Test of one mean, one sided hypothesis, Stata Another way: Stata immediate code for ttests: ttesti samplesize samplemean samplesd hypothesizedmean ttesti One-sample t test | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] x | mean = mean(x) t = Ho: mean = 750 degrees of freedom = 82 Ha: mean 750 Pr(T |t|) = Pr(T > t) =  We reject the null 35

Test of one mean, two sided hypothesis Example: Do children with congenital heart disease walk at a different age than healthy children? Healthy children start walking at mean age of 11.4 months – Null hypothesis: H 0 : μ= 11.4 months (μ 0 ) – Alternative hypothesis: H A : μ≠ 11.4 months – Significance level=0.05 – Data: Sample of children with congenital heart defects, n=9, sample mean=12.8, sample SD=2.4 36

Calculate the test statistic and its associated p value P value is P(T 1.75) = 2*P(T>1.75) display ttail(8,1.75)* Or run ttesti in Stata ttesti One-sample t test | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] x | mean = mean(x) t = Ho: mean = 11.4 degrees of freedom = 8 Ha: mean 11.4 Pr(T |t|) = Pr(T > t) =  Fail to reject the null 37

Test of one mean For data already in Stata, the ttest command also works E.g. We want to test that the mean CD4 cell count <350 among persons newly diagnosed with HIV in Uganda – Null hypothesis: H 0 : μ>350 cells/mm 3 – Alternative hypothesis: H A : μ <=350 cells/mm 3 – Significance level=

Test of one mean Use ttest varname== hypothesized value. ttest cd4count==350 One-sample t test Variable | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] cd4count | mean = mean(cd4count) t = Ho: mean = 350 degrees of freedom = 269 Ha: mean 350 Pr(T |t|) = Pr(T > t) =  We reject the null 39

Hypothesis testing One sided hypotheses : H A : μ μ 0 For a one sided hypothesis, if you know σ, you calculate either P(Z z) for the above alternative hypotheses respectively. You are only looking at one tail of the distribution – For P(Z<z) to be <0.05, z must be < – For P(Z>z) to be

Hypothesis testing Two sided hypotheses H A : μ≠μ 0 – For a two sided hypothesis, you are considering the probability that μ>μ 0 or μ<μ 0, so you calculate P(Z>z) or P(Z<-z) if you know σ, which is both tails of the distribution – This is then 2*P(Z>z) – For 2*P(Z>z) to be 1.96 or <

Hypothesis testing So at the same significance level, here  =0.05, less evidence is needed to reject the null for a one sided test as compared to a two sided test So a two sided test is more conservative What if you ran a test and got z=1.83? If your hypothesis was one-sided, you’d reject. If your hypothesis was two- sided, you’d fail to reject. Therefore it is very important to specify your test a priori! For clinical trials, most people use two-sided hypotheses – That way no one will suspect you of changing your hypothesis test a posteriori – On the other hand, does it really make sense to have an alternative hypothesis that a new treatment is either better or worse than the old one? 42

Inference on proportions Variables that are measured as successes or failures, disease or no disease, etc., can be considered binomial random variables – x=number of successes – n=number of “trials” – x/n = proportion diseased 43

Hypothesis test of a proportion Under the CLT, the sampling distribution of an estimated proportion p̂, if n is large enough, is approximately a normal distribution with mean p and standard deviation=√(p(1-p)/n) 44

Hypothesis test of a proportion Therefore we can test that a sample proportion is equal to, greater than, or less than some hypothesized p 0 using the Z statistic 45

Hypothesis test of a proportion Example: Micronutrient intake of black women in South Africa Study: Pre-menopausal women randomly selected based on geographic location Micronutrient intake was determined using a Quantitative Food Frequency Questionnaire developed for South Africans Question: Do more than 10% of the women have the micronutrient intake of less than the RDA? 46 Data adapted from Hattingh Z. et al, West Indian Med J 2008: 57 (5):431

Hypothesis test of a proportion Null hypothesis: – Fewer than 10% of the women aged have dietary folate levels of less than 268 µg (a cutoff based on RDA) – H 0 : p 0 < 0.10 Alternative hypothesis: – 10% or more of the women aged have dietary folate levels of less than 268 µg – H A : p 0 ≥ 0.10 Significance level set at =

Hypothesis test of a proportion The data: 158/279=56.6% had folate levels of less than the cutoff, 268 µg Using the normal approximation to the binomial distribution (formula on slide 44): z = ( )/√(0.10*0.90/279) =25.9 The chance of observing a z statistic of this large (25.9) is very very small (<0.05)  So we reject the null hypothesis 48

Hypothesis testing of a proportion # prtesti samplesize observedp hypothp. prtesti One-sample test of proportion x: Number of obs = Variable | Mean Std. Err. [95% Conf. Interval] x | p = proportion(x) z = Ho: p = 0.1 Ha: p 0.1 Pr(Z |z|) = Pr(Z > z) = display 1-normal(25.9) 0 49

Using the binomial distribution Command is: bitesti samplesize observed_proportion hypothesized_proportion. bitesti N Observed k Expected k Assumed p Observed p Pr(k >= 158) = (one-sided test) Pr(k <= 158) = (one-sided test) Pr(k >= 158) = (two-sided test) note: lower tail of two-sided p-value is empty Or we could have used:. di binomialtail(279,158,.1) 1.268e-82 50

Hypothesis testing of a proportion Another example Null hypothesis: – Fewer than 10% of the women aged have dietary zinc levels of less than 5 mg – H 0 : p < 0.10 Alternative hypothesis: – 10% or more of the women aged have dietary zinc levels of less than 5 mg – H A : p ≥ 0.10 Significance level =

Hypothesis testing of a proportion The data: 31/279=11.1% had zinc levels of less than 5 mg prtesti One-sample test of proportion x: Number of obs = Variable | Mean Std. Err. [95% Conf. Interval] x | p = proportion(x) z = Ho: p = 0.1 Ha: p 0.1 Pr(Z |z|) = Pr(Z > z) = The data are NOT inconsistent with the null hypothesis, therefore we fail to reject the null 52

Using the binomial distribution. bitesti N Observed k Expected k Assumed p Observed p Pr(k >= 31) = (one-sided test) Pr(k <= 31) = (one-sided test) Pr(k = 31) = (two-sided test) Or we could have used:. di binomialtail(279,31,.1)

Hypothesis testing of a proportion Remember that if n or p are small, you may need to use an exact test The commands in Stata to do this are display binomialtail(n,k,p) or bitesti samplesize observedp hypothp 54

Hypothesis tests versus confidence intervals You can reach the same conclusion with confidence intervals as with two-sided hypothesis tests – Reject the null if the 95% confidence interval does not include the hypothesized value μ 0 Confidence intervals give us more information: the range of reasonable values for μ and the uncertainty in our estimate X. Many statisticians and others prefer using confidence intervals to using hypothesis tests. 55

Types of error Type I error – significance level of the test  =P(reject H 0 | H 0 is true) Incorrectly reject the null We take a sample from the population, calculate the statistics, and make inference about the true population. If we did this repeatedly, we would incorrectly reject the null 5% of the time that it is true if  is set to

Types of error Type II error –  = P(do not reject H 0 | H 0 is false) Incorrectly fail to reject the null This happens when the test statistic is not large enough, even if the underlying distribution is different 57

Types of error Remember, H 0 is a statement about the population and is either true or false We take a sample and use the information in the sample to try to determine the answer Whether or if we make a Type I error or a Type II error depends on whether H 0 is true or false We set our chance of a Type I error, and can design our study to minimize the chance of a Type II error 58 True state DecisionH 0 is trueH 0 is false Do not reject H 0 Correct Type II error=  Reject H 0 Type I error=  Correct

Chance of a type II error , chance of failing to reject the null if the alternative is true

If the alternative is very different from the null, the chance of a Type II error is low

If the alternative is not very different from the null, the chance of a Type II error is high

Finding , P(Type II error) Example: Mean age of walking – H 0 : μ< 11.4 months (μ 0 ) – Alternative hypothesis: H A : μ>11.4 months – Known SD=2 – Significance level=0.05 – Sample size=9 We will reject the null if the z stat (assuming σ known) > So we will reject the null if For our example, the null will be rejected if X > 1.654*2/ =

But if the true mean is really 16, what is the probability that the null will not be rejected? – The probability of a Type II error? The null will be rejected if the sample mean is >13.9, not rejected if is ≤13.9 What is the probability of getting a sample mean of ≤13.9 if the true mean is 16? P(Z<( )/(2/3)). di normal(( )/2*3) So if the true mean is 16 and the sample size is 9, the probability of rejecting the null incorrectly is

Note that this depended on: – The alternative population mean (e.g. 16) – The chance of failing to reject the null will increase if the true population mean is closer to the null value What is the probability of failing to reject the null if the true population mean is 15? – P(Z<( )/.6667)). di normal(( )/2*3) What is the probability of failing to reject the null if the true population mean is 14? – P(Z<( )/.6667)). di normal(( )/2*3)  is called the power of a statistical test 64

Power The power of a test is the probability that you will reject the null hypothesis when it is not true and it is 1-  You can construct a power curve by plotting power versus different alternative hypotheses You can also construct a power curve by plotting power versus different sample sizes (n’s) This curve will allow you to see what gains you might have versus cost of adding participants Power curves are not linear 65

The power of a statistical test is lower for alternative values that are closer to the null value. The power of a statistical test can also be increased by increasing n For a fixed n, the power of a statistical test can also be increased by increasing , the probability of a Type I error The balance between type I error and type II error (or power) should depend on the context of the study When setting up a study, most investigators make sure that there will be at least 80% power. Sample size formulas allow you to specify the desired  and  levels. 66

For next time Read Pagano and Gauvreau – Chapter 10 and 14 (pages ) today’s material – Pagano and Gauvreau Chapters 11-12, and 14 (pages )