 Last time we discussed t-tests: how to use sample means of quantitative variables to make inferences about parameters.  Today we’ll use the very same.

Slides:



Advertisements
Similar presentations
Small differences. Two Proportion z-Interval and z-Tests.
Advertisements

Significance Tests Hypothesis - Statement Regarding a Characteristic of a Variable or set of variables. Corresponds to population(s) –Majority of registered.
INFERENCE: SIGNIFICANCE TESTS ABOUT HYPOTHESES Chapter 9.
Sociology 601 Class 8: September 24, : Small-sample inference for a proportion 7.1: Large sample comparisons for two independent sample means.
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Lab 4: What is a t-test? Something British mothers use to see if the new girlfriend is significantly better than the old one?
Two Sample Hypothesis Testing for Proportions
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
Stat 301 – Day 28 Review. Last Time - Handout (a) Make sure you discuss shape, center, and spread, and cite graphical and numerical evidence, in context.
The Normal Distribution. n = 20,290  =  = Population.
Sociology 601 Class 7: September 22, 2009
Sociology 601 Class12: October 8, 2009 The Chi-Squared Test (8.2) – expected frequencies – calculating Chi-square – finding p When (not) to use Chi-squared.
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Stat 301 – Day 27 Sign Test. Last Time – Prediction Interval When goal is to predict an individual value in the population (for a quantitative variable)
BCOR 1020 Business Statistics
5-3 Inference on the Means of Two Populations, Variances Unknown
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Inference about Population Parameters: Hypothesis Testing
Chapter 9 Comparing Means
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
Chapter 13: Inference in Regression
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Statistics Pooled Examples.
CHAPTER 21: Comparing Two Proportions
More About Significance Tests
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
T tests comparing two means t tests comparing two means.
Comparing Two Proportions
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
STA291 Statistical Methods Lecture 24. Comparing Two Proportions Sample of 25, year-olds: Men: 84.9% diploma rate Women: 88.1% diploma rate Are.
Testing means, part II The paired t-test. Outline of lecture Options in statistics –sometimes there is more than one option One-sample t-test: review.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
12.2 Inference for a Population Proportion We are interested in the unknown proportion p of a population that has some outcome – call the outcome we are.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
2 sample interval proportions sample Shown with two examples.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Inference for Proportions Section Starter Do dogs who are house pets have higher cholesterol than dogs who live in a research clinic? A.
1 Math 4030 – 10b Inferences Concerning Proportions.
Chapter 22 Comparing Two Proportions.  Comparisons between two percentages are much more common than questions about isolated percentages.  We often.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Week 111 Review - Sum of Normal Random Variables The weighted sum of two independent normally distributed random variables has a normal distribution. Example.
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Inference about proportions Example: One Proportion Population of students Sample of 175 students CI: What proportion (percentage) of students abstain.
Of the following situations, decide which should be analyzed using one-sample matched pair procedure and which should be analyzed using two-sample procedures?
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 8: Tests of significance and confidence.
Hypothesis Tests for 1-Proportion Presentation 9.
Essential Statistics Chapter 191 Comparing Two Proportions.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Chapters 22, 24, 25 Inference for Two-Samples. Confidence Intervals for 2 Proportions.
One-Sample Inference for Proportions
business analytics II ▌assignment one - solutions autoparts 
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Module 22: Proportions: One Sample
Week 11 Chapter 17. Testing Hypotheses about Proportions
Statistical Inference for the Mean Confidence Interval
Section 12.2: Tests about a Population Proportion
CHAPTER 6 Statistical Inference & Hypothesis Testing
Sampling Distributions
Inference Confidence Interval for p.
Comparing Two Proportions
Presentation transcript:

 Last time we discussed t-tests: how to use sample means of quantitative variables to make inferences about parameters.  Today we’ll use the very same principles, but we’ll do tests to use sample proportions of categorical variables to make inferences about parameters.

 E.g., a random sample of households in a poor neighborhood in Lima, Peru, finds that 38% of the households are headed by single women.  Is this neighborhood proportion representative of the population proportion of Lima poor neighborhoods), or is it different enough to test statistically significant?

 E.g., since a non-government organization began providing enterprise training & subsidies to women in a poor neighborhood in Buenos Aires, a random sample finds that the neighborhood’s proportion of income-earning adult women has increased from 34% to 39%.  Is this after vs. before proportion due to sampling variability? Or is it different enough to be statistically significant?

 The same random sample finds that 41% of the sampled women who participated in the program & 36% of the sampled women who didn’t participate are now engaged in income-earning activities.  Is this difference in proportions due to sampling variability? Or is it different enough to be statistically significant?

 Sample proportion: a binomial count within a sample divided by the sample size-n.  It is a categorical variable (e.g., yes vs. no; lived vs. died) Sample Proportion

Premises  Random sample of independent observations.  Binomial (i.e. ‘success’/’failure’) count.  The population must be at least 10 times larger than the sample.  There must be at least 10 observations for p & at least 10 observations for 1 – p.

 If these sample assumptions are met, then the difference between the two proportions being compared (e.g., observed proportion vs. benchmark; two- sample proportions; after vs. before proportion) is approximately standard normal in distribution.

 Moore/McCabe use a more precise than ‘traditional’ estimate of confidence interval for proportions called the ‘Wilson estimate.’  The Stata command is:. ci binaryvar, binomial wilson  As we’ll later discuss, there are other options besides ‘wilson’.

 Wilson estimate of the population proportion based on sample data: = X + 2/n + 4  Standard error of the proportion (i.e. based on sample data):

 Approximate level C confidence interval for the proportion:  Large-sample (n >5) significance test for a population proportion

 The Stata command to find a ‘Wilson estimate’ of a population proportion based on a confidence interval of sample data:. ci hmath, binomial wilson. ci hmath, b w level(90). ci hmath, b w l(99)  Other binomial options: exact, agresti, jeffreys.

To Repeat: The Steps  Step 1: Ask if the binomial assumptions are fulfilled (including that both the expected #failures & the expected #successes >10).  Step 2: Do a frequency table or bar graph of the binary variable & display the variable’s sample proportion.  Step 3: If all checks out okay, state the null hypothesis & the alternative hypothesis.  Step 4: Conduct the hypothesis test.

. use hsb2, clear. gen hmath=math>=60 & math<.. la var hmath “Honors math (>=60)” Example

. tab hmath Honors math (>=60)Freq.PercentCum Total

. ci hmath, binomial wilson Variable Obs Mean Std. Err. [95% Conf Interval] hmath |  Ho: hmath=.265; Ha: hmath~=.265.  Is hmath significantly different from.265 (two-sided test, i.e. does the mean of hmath fall outside the confidence interval)?  Does using other command options (or no option except ‘binomial’) make a difference? A Two-Sided CI Hypothesis Test

How to Conduct a Large-Sample Hypothesis Test for a Population Proportion: prtest. ‘prtest’ allows testing one- or two- sided hypotheses.  Check the premises & data.  Test the hypothesis.

. prtest hmath =.265 One-sample test of proportion hmath:Number of obs = 200 Variable Mean Std. Err. z P>z [95% Conf. Interval] hmath Ho: proportion(hmath) =.265 Ha: hmath.265 z = z = z = P z = P > z = Conclusion: Fail to reject Ho.

 Recall that conclusions are always uncertain.

How to Conduct a Test Comparing Two Proportions Ho: female hmath = male hmath Ha: female hmath ~= male hmath  Check the premises & the data:.

tab female hmath female=1 | math>=60 male=0 | 0 1 | Total male | | 91 female | | Total | | 200

 State the hypotheses. Ho: female hmath = male hmath Ha: female hmath ~= male hmath

 Step 3: test the hypothesis.

. prtest hmath, by(female) Two-sample test of proportion male: Number of obs = 91 female: Number of obs = 109 Variable Mean Std. Err. z P>z [95% Conf. Interval] male female diff under Ho: Ho: proportion(male) - proportion(female) = diff = 0 Ha: diff 0 z = z = z = P z = P > z =  Conclusion: Fail to reject Ho.

 Substantive conclusions? Next research steps?

 Here’s an after vs. before example.. Does a summer math course significantly increase the proportion of students who qualify for honors math?  Check the sample premises.  Display the data proportion.  Test the hypothesis: Ho: post-test honors proportion = pre-test honors proportion (i.e. difference = 0) Ha: post-test honors proportion > pre-test honors proportion (i.e. difference > 0)

. prtesti Two-sample test of proportion x: Number of obs = 191 y: Number of obs = Variable | Mean Std. Err. z P>|z| [95% Conf. Interval] x | y | diff | | under Ho: Ho: proportion(x) - proportion(y) = diff = 0 Ha: diff 0 z = z = z = P |z| = P > z =

 Test conclusion?  Results are always uncertain.