Fall 2002Biostat 511186 Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
STATISTICAL INFERENCE PART V
Chapter 10: Hypothesis Testing
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
8 - 10: Intro to Statistical Inference
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
TESTING HYPOTHESES FOR A SINGLE SAMPLE
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
BCOR 1020 Business Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical hypothesis testing – Inferential statistics I.
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Confidence Intervals and Hypothesis Testing - II
Statistical Techniques I
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Fundamentals of Hypothesis Testing: One-Sample Tests
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
STATISTICAL INFERENCE PART VII
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Chapter 20 Testing hypotheses about proportions
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
1 When we free ourselves of desire, we will know serenity and freedom.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Welcome to MM570 Psychological Statistics
BPS - 5th Ed. Chapter 151 Thinking about Inference.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Confidence Intervals and Hypothesis Testing Using and.
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Hypothesis Testing Summer 2017 Summer Institutes.
Presentation transcript:

Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed (but unknown) parameter (1 -  ) of the time CI for ,  assumed known:  confidence  wider interval  sample size  smaller SE  narrower interval

Fall 2002Biostat

Fall 2002Biostat Confidence Intervals Q: When we do not know the population parameter, how can we use the sample to estimate the population mean, and use our knowledge of probability to give a range of values consistent with the data? Parameter:  Estimate: Given a normal population, or large sample size, we can state: Note: this is not

Fall 2002Biostat Confidence Intervals We can do some rearranging: The interval is called a 95% confidence interval for . Interpretation: If we repeat this procedure 100 times, the interval constructed in this manner will include the true mean (  ) 95 times.

Fall 2002Biostat Confidence Intervals  known When  is known we can construct a confidence interval for the population mean, , for any given confidence level, (1 -  ). Instead of using 1.96 (as with 95% CI’s) we simply use a different constant that yields the right probability. So if we desire a (1 -  ) confidence interval we can derive it based on the statement That is, we find constantsand that have exactly (1 -  ) probability between them. A (1 -  ) Confidence Interval for the Population Mean or, equivalently,

Fall 2002Biostat Confidence Intervals  known - EXAMPLE Suppose gestational times are normally distributed with a standard deviation of 6 days. A sample of 30 second time mothers yield a mean pregnancy length of days. Construct a 90% confidence interval for the mean length of second pregnancies based on this sample.

Fall 2002Biostat Summary General (1 -  ) Confidence Intervals: (statistic + Q (  /2)  SE statistic, statistic + Q (1-  /2)  SE statistic ) CI for ,  assumed known  Z.  confidence  wider interval  sample size  smaller SE  narrower interval

Fall 2002Biostat Statistical Inference - Hypothesis Testing Null Hypothesis Alternative Hypothesis Significance level Statistically significant Critical value Acceptance / rejection region p-value power Types of errors: Type I (  ), Type II (  ) One-sided (one-tailed) test Two-sided (two-tailed) test

Fall 2002Biostat The ideas in hypothesis testing are based on deductive reasoning - we assume that some probability model is true and then ask “What are the chances that these observations came from that probability model?”.

Fall 2002Biostat Hypothesis Testing Motivation A coin is flipped 50 times. Of the 50 flips, 45 are heads. Do you think it is a fair coin? (what do we mean by a fair coin?) (what is your reasoning behind a yes or no answer?)

Fall 2002Biostat

Fall 2002Biostat Hypothesis Testing Motivation Among year-old males the mean serum cholesterol is 211 mg/ml with standard deviation of 46 mg/ml. In a sample of 25 hypertensive men we find a mean serum-cholesterol level ( ) of 220 mg/ml. Is this strong enough evidence to convince us that the mean for the population of hypertensive men differs from 211 mg/ml? Is the data consistent with that model? What if= 230 mg/ml? What if = 250 mg/ml? What if the sample was of 100 instead of 25?

Fall 2002Biostat Hypothesis Testing Define:  = population mean serum cholesterol for male hypertensives Hypotheses: 1.Null Hypothesis: Generally, the hypothesis that the unknown parameter equals a fixed value. H 0 :  = 211 mg/ml 2.Alternative Hypothesis: contradicts the null hypothesis. H A :   211 mg/ml Typically, the alternative hypothesis is the thing you are trying to prove. The null hypothesis is a “straw man”. There is a reason for this, as we will see in a moment.

Fall 2002Biostat Hypothesis Testing Decision / Action: Either H 0 or H A must be true. Based on the data we will choose one of these hypotheses. But what if we choose wrong?! What is the probability of that happening?  = significance level = P(reject H 0 | H 0 true) 1-  = power = P(reject H 0 | H A true) Let’s construct a procedure that will help us decide between the two hypotheses …

Fall 2002Biostat Hypothesis Testing ***Suppose H o is true *** Then the sample mean ( ) for cholesterol should be “near”  0 = 211 mg/ml. Since is normally distributed with std. error, this suggests that, if H 0 is true, then should be near 0. Distribution of Z if H 0 is true Notice that, if H 0 is true, then only 5% of the time.

Fall 2002Biostat Hypothesis Testing Suppose we decide to reject H 0 if |Z| > 1.96; then P[reject H 0 | H 0 true] = In other words, we would only make a type I error 5% of the time. More generally, suppose we want P[reject H 0 | H 0 true] =  ; then our rule should be reject H 0 if |Z| >. The type I error rate is . Given  and H 0 we can construct a test of H 0 with a specified significance level (we can’t eliminate the possibility of making an error, but we can control it). But remember, we start by assuming that H 0 is true - we haven’t proved it is true. Therefore, we usually say if |Z| > 1.96 then we reject H 0. if |Z| < 1.96 then we fail to reject H 0.

Fall 2002Biostat Hypothesis Testing Let’s review what we’ve just done …. 1)I made an assumption - the null hypothesis is true (therefore Z ~ N(0,1)) 2)I determined if the data was consistent with that assumption (is it plausible that my observed Z came from a N(0,1) distribution?) 3)If the data are not consistent with the assumption, I conclude that either H 0 is true and something unusual happened (with probability  )  or, H 0 is not true. This 3-step procedure is the basis of all hypothesis tests, but … different hypotheses  different details

Fall 2002Biostat Hypothesis Testing Cholesterol Example: Let  be the (true but unknown population) mean serum cholesterol level for male hypertensives. We observe =220 mg/ml Also, we are told that for the general population...  0 = mean serum cholesterol level for males = 211 mg/ml  =std. dev. of serum cholesterol for males = 46 mg/ml NULL HYPOTHESIS: mean for male hypertensives is the same as the general male population. ALTERNATIVE HYPOTHESIS: mean for male hypertensives is different than the mean for the general male population. H 0 :  =  0 = 211 mg/ml H A :    0 (   211 mg/ml)

Fall 2002Biostat Hypothesis Testing Cholesterol Example: Test H 0 with significance level . Under H 0 we know: Therefore, Reject H 0 if > 1.96 gives an  = 0.05 test. Reject H 0 if

Fall 2002Biostat Hypothesis Testing Cholesterol Example: TEST: Reject H 0 if In terms of Z... Reject H 0 if Z 1.96

Fall 2002Biostat Rejection region

Fall 2002Biostat Hypothesis Testing p-value: smallest possible  for which the observed sample would still reject H 0. probability of obtaining a result as extreme or more extreme than the actual sample (given H 0 true). NOTE: probability calculations are always based on a model.

Fall 2002Biostat Hypothesis Testing p-value: Cholesterol Example = 220 mg/mln = 25  = 46 mg/ml H 0 :  = 211 mg/ml H A :   211 mg/ml What value of  would cause us to reject H o ? Need to find the value of that corresponds to the observed (Z). The p-value is given by: 2 * P[ > 220] = 2 * P[Z > 0.978] =.33

Fall 2002Biostat Determination of Statistical Significance for Results from Hypothesis Tests Either of the following methods can be used to establish whether results from hypothesis tests are statistically significant: (1)The test statistic Z can be computed and compared with the critical value at an  level of.05. Specifically, if H 0 :  =  0 versus H 1 :    0 are being tested and |Z| > 1.96, then H 0 is rejected and the results are declared statistically significant (i.e., p <.05). Otherwise, H 0 is accepted and the results are declared not statistically significant (i.e., p .05). We refer to this approach as the critical-value method. (2)The exact p-value can be computed, and if p <.05, then H 0 is rejected and the results are declared statistically significant. Otherwise, if p .05 then H 0 is accepted and the results are declared not statistically significant. We will refer to this approach as the p-value method.

Fall 2002Biostat Guidelines for Judging the Significance of p-value (Rosner pg 200) If.01 < p <.05, then the results are significant. If.001 < p <.01, then the results are highly significant. If p <.001, then the results are very highly significant. If p >.05, then the results are considered not statistically significant (sometimes denoted by NS). However, if.05 < p <.10, then a trend toward statistically significance is sometimes noted.

Fall 2002Biostat Hypothesis Testing and Confidence Intervals Confidence Interval: “Plausible” values for  are given by If the null hypothesis value of  (namely,  0 ) is not in this interval, then it is not “plausible” and should be rejected. In fact, this relationship is exact – 1)any value for  that is not in an  -level confidence interval will be rejected by a  -level hypothesis test. 2)an  -level confidence interval for  consists of all possible values of  0 that would not be rejected in a  -level hypothesis test.

Fall 2002Biostat Hypothesis Testing “how many sides?” Depending on the alternative hypothesis a test may have a one-sided alternative or a two- sided alternative. Consider H 0 :  =  0 We can envision (at least) three possible alternatives H A :    0 (1) H A :  <  0 (2) H A :  >  0 (3) (1)is an example of a “two-sided alternative” (2) and (3) are examples of “one-sided alternatives” The distinction impacts Rejection regions p-value calculation

Fall 2002Biostat Hypothesis Testing “how many sides?” Cholesterol Example: Instead of the two-sided alternative considered earlier we may have only been interested in the alternative that hypertensives had a higher serum cholesterol. H 0 :  = 211 H A :  > 211 Given this, an  = 0.05 test would reject when We put all the probability on “one-side”. The p-value would be half of the previous, p-value = P[ > 220] =.163

Fall 2002Biostat

Fall 2002Biostat Hypothesis Testing Through this worked example we have seen the basic components to the statistical test of a scientific hypothesis. Summary 1.Identify H 0 and H A 2.Identify a test statistic 3.Determine a significance level,  = 0.05,  = Critical value determines rejection / acceptance region 5.p-value 6.Interpret the result

Fall 2002Biostat Hypothesis Testing Null Hypothesis Alternative Hypothesis Significance level Statistically significant Critical value Acceptance / rejection region p-value power Types of errors: Type I (  ), Type II (  ) One-sided (one-tailed) test Two-sided (two-tailed) test