Please turn off cell phones, pagers, etc. The lecture will begin shortly.

Slides:



Advertisements
Similar presentations
Chapter 17/18 Hypothesis Testing
Advertisements

Chapter 10, part D. IV. Inferences about differences between two population proportions You will have two population proportions, p1 and p2. The true.
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Chapter 10: Hypothesis Testing
© 2010 Pearson Prentice Hall. All rights reserved Hypothesis Testing Using a Single Sample.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Chapter 8 Inference for Proportions
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Today Concepts underlying inferential statistics
Today Today: Finish Chapter 9, start Chapter 10 Sections from Chapter 9: 9.1, 9.4, 9.5, 9.10 (know just class notes for these sections) Recommended Questions:
Probability Population:
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Inference about Population Parameters: Hypothesis Testing
CHAPTER 23 Inference for Means.
AM Recitation 2/10/11.
Math 227 Elementary Statistics
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
More About Significance Tests
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
The Probability of a Type II Error and the Power of the Test
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a quiz at the end of today’s lecture.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Chapter 8 Introduction to Hypothesis Testing
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Essential Statistics Chapter 131 Introduction to Inference.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Topic 7 - Hypothesis tests based on a single sample Sampling distribution of the sample mean - pages Basics of hypothesis testing -
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a quiz at the end of today’s lecture. Friday’s lecture has been.
Slide Copyright © 2008 Pearson Education, Inc. Chapter 11 Inferences for Population Proportions.
Normal Distr Practice Major League baseball attendance in 2011 averaged 30,000 with a standard deviation of 6,000. i. What percentage of teams had between.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Section A Confidence Interval for the Difference of Two Proportions Objectives: 1.To find the mean and standard error of the sampling distribution.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Testing the Differences between Means Statistics for Political Science Levin and Fox Chapter Seven 1.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Review Statistical inference and test of significance.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Quantitative Methods in the Behavioral Sciences PSY 302
Chapter Nine Hypothesis Testing.
Sampling and Sampling Distributions
© LOUIS COHEN, LAWRENCE MANION AND KEITH MORRISON
Statistical inference: distribution, hypothesis testing
Comparing Two Proportions
Chapter 8: Inference for Proportions
Chapter 9: testing a claim
Inferences About Means from Two Groups
Week 11 Chapter 17. Testing Hypotheses about Proportions
Chapter 9 Hypothesis Testing.
Essential Statistics Introduction to Inference
Comparing two Rates Farrokh Alemi Ph.D.
What are their purposes? What kinds?
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Chapter 10: Comparing Two Populations or Groups
Presentation transcript:

Please turn off cell phones, pagers, etc. The lecture will begin shortly.

Lecture 23 This lecture will cover some background material that will help us to understand Chapter Inferences about proportions 3. Test for independence in a 2×2 table 2. Hypothesis testing

1. Inferences about proportions Suppose that we examine data from a sample and discover something interesting. For example: These observations may be true for our sample. But are they true for the whole population? One baseball player has a higher batting average than another in the first three weeks of the season Those who consume alcohol in small to moderate amounts have lower rate of heart disease than those who do not Amount of television watched by school children is negatively correlated with grades

Statistical inference We know what the value of a statistic (mean, correlation coefficient, proportion, relative risk, odds ratio) is in our sample. But we do not know what it is in the full population. Statistical inference is to make meaningful statements about the population based on what we observe in a sample. In other words, statistical inference is to generalize from our sample to the whole population. Inference requires us to know something about sampling error and sampling variation.

Sampling error A quantity computed from a sample (mean, correlation coefficient, relative risk, odds ratio, etc.) is not identical to the quantity that we would get from the entire population. For example, if our sample shows a relative risk of 1.2, that does not mean that the relative risk in the population is 1.2. If the sample is truly representative (i.e., randomly drawn from) the population, then the relative risk in the population should be close to the sample value of RR=1.2. But how close will it be? The difference between a quantity estimated from a sample and the quantity that we would get if we could use the entire sample is called sampling error.

Example Scores on the Stanford-Binet IQ test are normally distributed in the population with mean 100 and standard deviation 16. If we give this test to a sample of n = 100 subjects, the mean score in our sample could be exactly 100. But more likely, it will be something like 98.7, 100.4, or If the sample mean is… 98.7 then the sampling error is… 98.7 – 100 = – – 100 = – 100 = – 100 = – 0.7

How can we draw inferences? Most of the time, we do not know what the sampling error is, because the population mean (or the population correlation coefficient, relative risk, odds ratio, etc.) is unknown. We know what the value is in our sample. But we do not know what it is in the full population. So how can we draw inferences? To draw inferences about a population from a single sample, we must know something about sampling variation. That is, we need to know how large the sampling errors tend to be in samples as large as ours.

Inferences about proportions Early in the semester (Lectures 4-5), we learned the margin of error for a percentage from a simple random sample: If the number of units in the sample is n, then the margin of error for a percentage is approximately 100 / sqrt( n ). Based on this, we could compute the range of likely values for the percentage in the population. The range of likely values goes from sample percentage – margin of error sample percentage + margin of error to

Let’s apply this method to data from 2×2 tables to make some inferences about various populations Total Men Women YesNo Legal abortion for any reason Total “Should it be possible for a pregnant woman to obtain a legal abortion if the woman wants it for any reason?” Example The subjects in this study are essentially a simple random sample of American adults. We can use it to draw inferences about the full population of adults. We can also use it to draw inferences about adult men and adult women.

Total Men Women YesNo Legal abortion for any reason Total All adults: Estimated rate of support is 387 / 900 = 0.43 = 43% The margin of error is 100 / sqrt(900) = 100 / 30 = 3.3 % So we are quite sure that the rate of support in the population lies between 43 – 3.3 = 39.7% and = 46.3%.

Total Men Women YesNo Legal abortion for any reason Total Adult men: Estimated rate of support is 215 / 484 = 0.43 = 44.4% The margin of error is 100 / sqrt(484) = 100 / 22 = 4.5 % So we are quite sure that the rate of support among men lies between 44.4 – 4.5 = 39.9% and = 48.9%.

Total Men Women YesNo Legal abortion for any reason Total Adult women: Estimated rate of support is 172 / 416 = = 41.3% The margin of error is 100 / sqrt(416) = 100 / 20.4 = 4.9 % So we are quite sure that the rate of support among women lies between 41.3 – 4.9 = 36.4% and = 46.2%.

Range Men Women Est. % Adults Create a table of estimates and ranges: Notice that none of the three ranges covers 50%. From this sample, we may safely conclude that adults who support legalized abortion “for any reason” are a minority of adults. We may also conclude that supporting legalized abortion “for any reason” is a minority view among men, and it is also a minority view among women.

Is there a difference between men and women? Range Men Women Est. % Adults Notice that the rate of support is higher among men than among women. The difference in percentages is 44.4 – 41.3 = 2.1%. The relative risk is 44.4 / 41.3 = The odds ratio (which we can find from the 2×2 table) is 1.13.

Is it safe for us to conclude that support of legalized abortion “for any reason” is higher among men than among women in the population? Range Men Women Est. % Adults One way to decide is to compute a range of likely values for the difference in percentages, for the relative risk, or for the odds ratio. Formulas for these ranges do exist, but they are a bit complicated. Instead, we will answer this question by a different approach called hypothesis testing.

2. Hypothesis testing Hypothesis testing helps us to decide whether an interesting effect that we see in a sample also holds in the population. To perform a hypothesis test, we need to formulate a null hypothesis and an alternative hypothesis. The null hypothesis is that the interesting effect that we see in the sample is not real (i.e., that it does not hold in the population). The alternative hypothesis is that the interesting effect is real (i.e., that it does hold in the population). If the evidence against the null hypothesis is strong, then we reject the null and accept the alternative.

Example Without realizing it, we have already been performing simple hypothesis tests about population proportions. From the opinion poll, we concluded that adults who favor legalized abortion “for any reason” are in the minority. In effect, we set up the following null hypothesis: The rate of support of legalized abortion in the adult population is exactly 50%. We didn’t set up this null hypothesis because we thought that it was true. Rather, we set it up as a “straw man,” to see if we could disprove it with evidence from the data.

The alternative hypothesis was: The rate of support of legalized abortion in the adult population is not 50%, but something higher or something lower. From the sample, we computed the estimated rate of support and the range of likely values. The estimated rate of support was 43%, which provided some evidence that the null hypothesis is false. But the most powerful evidence came from the range of likely values: 39.7% to 46.3%. This range did not cover the null value of 50%. So we could safely reject the null hypothesis and accept the alternative.

“The population rate is 50%” is perhaps the most common null hypothesis concerning a rate or proportion. But on occasion, null values other than 50% may be of interest. Other null hypotheses “An overwhelming majority of the American public— more than 70%—support a a woman’s right to choose abortion for any reason.” Suppose an abortion-rights group has claimed: Or, suppose that an anti-abortion group has claimed: “By a 2:1 margin, the American public overwhelmingly rejects the idea of abortion on demand.” To see if the first claim is plausible, we could test the null hypotheses that the true percentage is 70%. To see if the second claim is plausible, the null hypothesis would be 33%.

The range of likely values (39.7% to 46.3%) does not cover the null value of 70%, nor does it cover the null value of 33%. In this case, the data would not support the claim of either group.