Chapter 10 AP Statistics St. Francis High School Fr. Chris.

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Objectives (BPS chapter 15)
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Introduction to Hypothesis Testing
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Section 9.2 Testing the Mean .
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Probability Population:
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Descriptive statistics Inferential statistics
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Hypothesis Testing. Section 8-1: Steps in Hypothesis Testing – Traditional Method Learning targets – IWBAT understand the definitions used in.
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
14. Introduction to inference
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Lesson Carrying Out Significance Tests. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
Section 9.2 Testing the Mean  9.2 / 1. Testing the Mean  When  is Known Let x be the appropriate random variable. Obtain a simple random sample (of.
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Significance Tests: THE BASICS Could it happen by chance alone?
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Essential Statistics Chapter 131 Introduction to Inference.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
Chapter 20 Testing hypotheses about proportions
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
The z test statistic & two-sided tests Section
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chapter 8 Parameter Estimates and Hypothesis Testing.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
SUMMARY. Central limit theorem Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
© Copyright McGraw-Hill 2004
AP Statistics Section 11.2 A Inference Toolbox for Significance Tests.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Uncertainty and confidence If you picked different samples from a population, you would probably get different sample means ( x ̅ ) and virtually none.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
DSCI 346 Yamasaki Lecture 1 Hypothesis Tests for Single Population DSCI 346 Lecture 1 (22 pages)1.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
More on Inference.
Tests of significance: The basics
More on Inference.
The Practice of Statistics in the Life Sciences Fourth Edition
Lecture 10/24/ Tests of Significance
Carrying Out Significance Tests
Section 11.1: Significance Tests: Basics
Presentation transcript:

Chapter 10 AP Statistics St. Francis High School Fr. Chris

Introduction to Inference 1. Estimating with Confidence 2. Tests of Significance 3. Using Significance Tests 4. Inference as a Decision.

Confidence Interval Two parts: the estimate and the confidence level Estimate±Margin of Error If you have an SRS, you can assume that the mean is normally distributed If you can estimate the spread, you can find the boundaries where 90%, 95% or 99% of the area under this normal curve lie.

A Pharmaceutical Manufacturer To establish the concentration of an active ingredient of a drug, the same specimen of a particular batch is measured three times: , , and The standard deviation is known to be grams per liter. What is the 95% confidence interval? So.8404 is a good estimate of the actual amount, but how big a margin of error do we need to be 95% confident the actual measurement is in our interval?

The 95% Confidence Interval 95% On a standard normal curve (N[0,1]), 95% is between and standard deviations Since we know sigma, we convert the drug’s distribution to the standard normal so the interval is Where z = 1.96 (95% Confidence)

Substituting the numbers... So we can be 95% sure that the actual measurement of the sample is between.8303 and.8505 We would have a more narrow interval if we are willing to drop our confidence interval to 90%. On a standard normal curve, 90% of the area is between and Or (.8339,.8469)

Caution!!!!! Data must be SRS from the population Formulas for more complex designs are available Fancy formulas cannot rescue badly produced data Outliers can influence x-bar If n is small (<15), population needs to be normal Need to know the sigma of the population (If sample is huge, you can use s, otherwise, wait for more methods next chapter!)

10.2 Tests of Significance H 0 and H a One Sided vs. Two Sided Alternative P-Value  Critical values and the z Statistic

Blood Pressure Males age have a systolic blood pressure of 128 with a  =15. A researcher looks at the records of 72 male ATC’s in this age group and found their average systolic blood pressure to be Obviously this is higher than the average. It is likely that this is due to chance variation, or that ATC’s have a significantly higher blood pressure? It is SIGNIFICANT if the result is unlikely to be caused by chance.

The Null Hypothesis Males age have a systolic blood pressure of 128 with a  = of our ATC’s average at over 131. The null Hypothesis is to say There is no difference between ATC’s and males in general. H 0 :  =  0 H a :  ≠  0 One way to state the alternative is to say that the average of ATC’s is different, so the alternative hypothesis is This is a “Two-tail” test, since the alternative includes two possibilities: Either the ATC’s have a higher blood pressure or they have a lower blood pressure.

Let’s get graphic A two tailed test means that the sample mean falling on either extreme will be considered grounds to reject the H0H0 Knowing what we know about the Normal Distribution, we know the critical z score that partitions the inner 95% is where the cumulative area is at 2.5% on both ends (thus our  =.05)

Are ATC’s average on the fringe? If they belong to the same distribution as anyone else, we would expect the  of the sample mean of 72 randomly selected males to be So to convert to the Standard Normal Curve...

Fail to reject the Null Hypothesis Since is less than 1.96, we are in the 95% of sample means, and we cannot reject the H0 H0 … But what if we only consider the alternative that ATC’s, due to the stressful nature of their work, would have a HIGHER blood pressure than other males...

The One Tailed Test The one tailed test has the same H0 H0 :  =0 =0 but a different alternative hypothesis, viz., Ha Ha :  >0>0 The critical value is different, since 95% is on the left, and 5% is on the upper end.

Rejecting the Null Hypothesis Since our critical value is now 1.645, and the sample mean of has a z value of , it is in the extreme upper 5%, so we can reject the null hypothesis in favor of the alternative hypothesis: The mean of ATC’s systolic blood pressure is higher than other males of this age group.

Review of terms Notice our  is still.05, but only when we consider the one-tailed test (which is appropriate in this case), are we able to reject the null hypothesis. Computers and statistical calculators can compute a p-value (probability) which allows to see how unlikely the average is if the null hypothesis is true.

Using your calculator Select STAT, then TESTS Select “1:Z-Test” Select “Stats” since we don’t have the raw data (72 sbp’s) Notice that it shows the z-score, and the p-value (.08 is bigger than.05). So we cannot reject the null hypothesis. Go back and instead of selecting “Calculate”, select “Draw” First let’s select the two tailed alternative hypothesis then press Calculate

Using your calculator for One Tail Test Select STAT, then TESTS Select “1:Z-Test” Select “Stats” since we don’t have the raw data (72 sbp’s) First let’s select the two tailed alternative hypothesis  >  0 then press Calculate Notice that it shows the z-score, and the p-value (.04 is less than.05). So we can reject the null hypothesis. Go back and instead of selecting “Calculate”, select “Draw”

10.3 Using Significance Tests Choosing a level of significance When is Statistical Inference Valid Examples

Choosing a level P-values are more informative than Traditional  levels Small effects can be highly significant Lack of signifcance doesn’t imply H 0 is true

Validity Faulty data collection Outliers Testing a hypothesis on the same data that suggests a hypothesis can invalidate the test Many tests at once can produce significance due to chance alone, even if the H 0 ’s are true

Examples Hawthorne Effect: Will background music increase productivity? Any short term change can effect, regard less of what it is (See Example 10.19, p 590) A researcher looking for evidence of ESP tests 500 subjects. Four do significantly better (P<.01) than random guessing. Can we conclude these 4 have ESP? What should the researcher do to test whether these 4 have ESP? (10.61 p. 592)

Kidney Failure Patients with kidney failure are often treated by dialysis, and can also cause the retention of phosphorous that must be corrected by changes in the diet. One patient in a study had these 6 measurements over time: 5.6, 5.3, 4.6, 4.8, 5.7, 6.4 (These are separated over time and can be considered an SRS of the patient’s blood phosphorous level). If this level varies normally with  = 0.9 mg/dl, give a 90% confidence interval for the mean blood phosphorus level. (4.7956, )

Conclusions The normal range of phosphorous in the blood is considered to be 2.6 to 4.8 mg/dl. Is there strong evidence that the patient has a mean phosphorous level that exceeds 4.8? H 0 :  =4.8 H a :  >4.8 Z=1.633 P-value is There is evidence that the patient’s mean phosphorous level is higher than 4.8 mg/dl, but it is not strong. Whether this is sufficient evidence is a judgement call..05 is not sacred What if the inference is use to determine a change in diet? Surgury? What are the hypotheses? What is the z-score/P-value? Conclusion?