Review: Large Sample Confidence Intervals 1-  confidence interval for a mean: x +/- z  /2 s/sqrt(n) 1-  confidence interval for a proportion: p +/-

Slides:



Advertisements
Similar presentations
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Advertisements

Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Inference Sampling distributions Hypothesis testing.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Stat Day 16 Observations (Topic 16 and Topic 14)
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
IENG 486 Statistical Quality & Process Control
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Chapter 8 Introduction to Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 TUTORIAL 6 Chapter 10 Hypothesis Testing.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 10 Hypothesis Testing
On Thursday, John will provide information about the project Next homework: John will announce in class on Thursday, and it’ll be due the following Thursday.
Confidence Intervals and Hypothesis Testing - II
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Statistics Pooled Examples.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Chapter 10 Hypothesis Testing
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
6.5 One and Two sample Inference for Proportions np>5; n(1-p)>5 n independent trials; X=# of successes p=probability of a success Estimate:
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Don’t forget HW due on Tuesday. Assignment is on web.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
On Thursday, I’ll provide information about the project Due on Friday after last class. Proposal will be due two weeks from today (April 15 th ) You’re.
Large sample CI for μ Small sample CI for μ Large sample CI for p
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
1 When we free ourselves of desire, we will know serenity and freedom.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
1 When we free ourselves of desire, we will know serenity and freedom.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Sample Size Needed to Achieve High Confidence (Means)
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Chapter Nine Hypothesis Testing.
6. Statistical Inference: Significance Tests
More on Inference.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Chapter 9: testing a claim
Hypothesis Testing: Hypotheses
More on Inference.
Chapter Review Problems
YOU HAVE REACHED THE FINAL OBJECTIVE OF THE COURSE
Welcome Back Homework Due next Tuesday in class (on the web…)
Presentation transcript:

Review: Large Sample Confidence Intervals 1-  confidence interval for a mean: x +/- z  /2 s/sqrt(n) 1-  confidence interval for a proportion: p +/- z  /2 p(1-p)/sqrt(n) 1-  confidence interval for the difference between two means: x 1 – x 2 +/- z  /2 sqrt(s 2 1 /n 1 +s 2 2 /n 2 ) n >30 or so for means, np and n(1-p) both > 5 for proportions

In General: Estimate (that is normally distributed via the Central Limit Theorem) +/- standard deviation Z  /2 of estimate ( ) This gives an interval: (Lower Bound, Upper Bound) Interpretation: This is a plausible range for the true value of the number that we’re estimating.  is a tuning parameter for level of plausibility: smaller  = more conservative estimate.

Large Sample Confidence Intervals 1-  confidence interval for difference between two proportions: p 1 -p 2 +/- z  /2 sqrt[(p 1 (1-p 1 )/n 1 )+(p 2 (1-p 2 )/n 2 )] np and n(1-p) > 5 for all p’s…

Designing an Experiment and Choosing a Sample Size Example: Compare the shrinkage in a tumor due to a “new” cancer treatment relative to standard treatment 100 patients randomly assigned to “new” treatment or standard treatment x inew = reduction in tumor size for person i under new treatment x jstd = reduction in tumor size for person j under std treatment x new and s 2 new x std and s 2 std Mean and sample variance of the changes in size for the new and standard treatments

Suppose the data are: x new = 25.3 s new = 2.0 x std = 24.8 s std = % Confidence Interval for difference: x 1 – x 2 +/- z  /2 sqrt(s 2 1 /n 1 +s 2 2 /n 2 ) = 0.5 +/ What can we conclude?

There’s no difference? Can’t see a difference? There’s a difference, but it’s too small to care about?

There is a difference between: Can’t see a difference There’s no difference Situation for Cancer example (In cancer experiment, we can assume we care about small differences.)

Can’t see a difference (that is big enough to care about) = wasted experiment AVOID / PREVENT THE WASTE AND ASSOCIATED TEARS USE SAMPLE SIZE PLANNING

Sample Size Planning Length of a 1-  level confidence interval is: “2 z  /2 std deviation of estimate” 2z  /2 s/sqrt(n) 2z  /2 p(1-p)/sqrt(n) 2z  /2 sqrt((s 2 1 /n 1 )+(s 2 2 /n 2 )) 2z  /2 sqrt[(p 1 (1-p 1 )/n 1 )+(p 2 (1-p 2 )/n 2 )]

1.Suppose we want a 95% confidence interval no wider than W units. 2.  is fixed. Assume a value for the standard deviation (or variance) of the estimator. 3.Solve for an n (or n 1 and n 2 ) so that the width is less than W units. 4.When there are two sample sizes (n 1 and n 2 ), we often assume that n 1 = n 2.

Cancer example Let W = 0.1. Want 95% CI for difference between means with width less than W. Suppose s 2 new = s 2 std = 6 (conservative guess) W > 2z  /2 sqrt((s 2 new /n 1 )+(s 2 std /n 2 )) 0.1 > 2(1.96)sqrt(6/n + 6/n) 0.1 > 3.92sqrt(12/n) 0.01 > ( )12/n n > (each group…) Book’s B = our W/2

Hypothesis testing and p-values (Chapter 9) We used confidence intervals in two ways: 1.To determine an interval of plausible values for the quantity that we estimate. Level of plausibility is determined by 1- . 90% (  =0.1) is less conservative than 95% (  =0.05) is less conservative than 99% (  =0.01)... 2.To see if a certain value is plausible in light of the data: If that value was not in the interval, it is not plausible (at certain level of confidence). Zero is a common certain value to test, but not the only one. Hypothesis tests address the second use directly

Example: Dietary Folate Data from the Framingham Heart Study n = 333 Elderly Men Mean = x = Std Dev = s = Can we conclude that the mean is greater than 300 at 5% significance? (same as 95% confidence)

Five Components of the Hypothesis test: 1. Null Hypothesis = “What we want to disprove” = “H 0 ” = “H not” = Mean dietary folate in the population represented by these data is <= 300. =  <= Alternative Hypothesis = “What we want to prove” = “H A ” = Mean dietary folate in the population represented by these data is > 300. =  > 300

3. Test Statistic To test about a mean with a large sample test, the statistic is z = (x –  )/(s/sqrt(n)) (i.e. How many standard deviations (of X) away from the hypothesized mean is the observed x?) 4. Significance Level of Test, Rejection Region, and P-value 5. Conclusion Reject H 0 and conclude H A if test stat is in rejection region. Otherwise, “fail to reject” (not same as concluding H 0 – can only cite a “lack of evidence” (think “innocent until proven guilty”) (Equivalently, reject H 0 if p-value is less than .) Next page

Significance Level:  =1% or 5% or 10%... (smaller is more conservative) (Significance = 1-Confidence) Rejection Region: –Reject if test statistic in rejection region. –Rejection region is set by: Assume H 0 is true “at the boundary”. Rejection region is set so that the probability of seeing the observed test statistic or something further from the null hypothesis is less than or equal to  P-value –Assume H 0 is true “at the boundary”. –P-value is the probability of seeing the observed test statistic or something further from the null hypothesis. –= “observed level of significance” Note that you reject if the p-value is less than . (Small p-values mean “more observed significance”)

Example: H 0 :  300 z (x-  )/(s/sqrt(n)) = (336.4 – 300)/(193.4/sqrt(333)) = 3.43 Significance level = 0.05 When H 0 is true, Z~N(0,1). As a result, the cutoff is z 0.05 = (Pr(Z>1.645) = 0.05.) P-value = Pr(Z>3.43 when true mean is 300) = Reject. Mean is greater than 300. Would you reject at significance level ?

Picture Test Statisistic Density Rejection region Distribution of Z = (X – 300)/(193.4/sqrt(333)) when true mean is 300. Area to right of =0.05 = sig level 3.43 Area to right of 3.43 = = p-value Test statistic Observed Test Statistic

One Sided versus Two Sided Tests Previous test was “one sided” since we’d only reject if the test statistic is far enough to “one side” (ie. If z > z 0.05 ) Two sided tests are more common (my opinion): H 0 :  =0, H A :  does not equal 0

Two Sided Tests (cntd) Test Statistic (large sample test of mean) z = (x –  )/(s/sqrt(n)) Rejection Region: reject H 0 at signficance level  if |z|>z  /2 i.e. if z>z  /2 or z<-z  /2 Note that this “doubles” p-values. See next example.

Example: H 0 :  =300, H A :  doesn’t equal 300 z=(x-  )/(s/sqrt(n)) = (336.4 – 300)/(193.4/sqrt(333)) = 3.43 Significance level = 0.05 When H 0 is true, Z~N(0,1). As a result, the cutoff is z =1.96. (Pr(|Z|>1.96)=2*Pr(Z>1.96)=0.05 P-value = Pr(|Z|>3.43 when true mean is 300) = Pr(Z>3.43) + Pr(Z<-3.43) = 2(0.0003)= Reject. Mean is not equal to 300. Would you reject at significance level ?

Picture 1.96 Test Statisistic Density Rejection region Distribution of Z = (X – 300)/(193.4/sqrt(333)) when true mean is Area to right of 3.43 = Test statistic Area to left of = Rejection region Sig level = area to right of area to the left of = 0.05=  Pvalue=0.0006=Pr(|Z|>3.43)

Power and Type 1 and Type 2 Errors Truth H 0 True H A True Action Fail to Reject H 0 Reject H 0 correct Type 1 error Type 2 error Significance level =  =Pr( Making type 1 error ) Power = 1–Pr( Making type 2 error )

Assuming H 0 is true, what’s the probability of making a type I error? H 0 is true means true mean is  0. This means that the test statistic has a N(0,1) distribution. Type I error means reject which means |test statistic| is greater than z  /2. This has probability .