Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.

Slides:



Advertisements
Similar presentations
Comparing Two Proportions
Advertisements

Hypothesis Testing: Intervals and Tests
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Comparing Two Proportions
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Stat 301 – Day 17 Tests of Significance. Last Time – Sampling cont. Different types of sampling and nonsampling errors  Can only judge sampling bias.
Stat 512 – Lecture 14 Analysis of Variance (Ch. 12)
Stat 301 – Day 15 Comparing Groups. Statistical Inference Making statements about the “world” based on observing a sample of data, with an indication.
Stat 217 – Day 21 Cautions/Limitations with Inference Procedures.
Stat 217 – Day 22 Review. Last Time – Subtle Issues Make sure you have a random sample or interval doesn’t tell you much! Make sure you have a sample!
Stat 301- Day 32 More on two-sample t- procedures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Stat 217 – Day 27 Chi-square tests (Topic 25). The Plan Exam 2 returned at end of class today  Mean.80 (36/45)  Solutions with commentary online  Discuss.
Today’s Agenda Review of ANOVA Module 9 Review for Exam 2 Please log in with your UMID and your participation will be graded by the number of questions.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Stat 301 – Day 21 Large sample methods. Announcements HW 4  Updated solutions Especially Simpson’s Paradox  Should always show your work and explain.
Stat 301 – Day 22 Relative Risk. Announcements HW 5  Learn by Doing Lab 2-3  Evening Office hours  Friday: 10-11, 12-1.
Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.
BS704 Class 7 Hypothesis Testing Procedures
Stat 217 – Day 20 Comparing Two Proportions The judge asked the statistician if she promised to tell the truth, the whole truth, and nothing but the truth?
Stat 512 – Lecture 11 Type I/Type II Errors Open Applets page Review.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Stat 301 – Day 26 Prediction Intervals Paired t-test.
Stat 512 – Lecture 15 Two-way ANOVA (Ch. 12). Last Time – Analysis of Variance (ANOVA) When: Want to compare two or more population/true treatment means.
Lecture Slides Elementary Statistics Twelfth Edition
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
QNT 531 Advanced Problems in Statistics and Research Methods
More About Significance Tests
Comparing Two Population Means
Comparing Two Proportions
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Chapter 8 Hypothesis Testing with Two Samples 1. Chapter Outline 8.1 Testing the Difference Between Means (Large Independent Samples) 8.2 Testing the.
Chapter 22: Comparing Two Proportions
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
+ Chi Square Test Homogeneity or Independence( Association)
Copyright © 2010 Pearson Education, Inc. Slide Beware: Lots of hidden slides!
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
AP Statistics Section 11.1 B More on Significance Tests.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Name Mean of Sampling Distribution Standard Deviation/Error of Sampling Distribution 1 sample z-Interval for Proportions 1 sample z-interval for Means.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Two-Sample Hypothesis Testing
Simulation-Based Approach for Comparing Two Means
Lecture Slides Elementary Statistics Twelfth Edition
Comparing Two Proportions
Stat 217 – Day 28 Review Stat 217.
Hypothesis Testing Two Proportions
Comparing Two Proportions
Hypothesis tests for the difference between two proportions
Presentation transcript:

Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited

Announcements No HW this week!  PP 10 due Thursday, preview next example  Readings (no starred sections) Final Exam schedule Thank you for feedback Midterms returned at end of class  Grades updated on blackboard  Exam solutions online Probably a sub on Thursday

What’s left to do? Practice the concepts  p-value, confidence  scope of conclusions New techniques for new situations  Minitab number crunching  Type and number of variables Keep thinking about whether a technique will apply to your project

Example 1: Newspaper Credibility Nationwide sample of 1,002 adults interviewed via telephone (under direction of Princeton Survey Research Assoc), 2002 Comparison to 1998 nationwide sample Variable of interest: Would you rate your local daily newspaper as largely believable?  Can consider 1998 vs as second variable

Example 1: Newspaper Credibility Two-way table Segmented bar graph Some evidence that believability of local paper was higher in 1998, but not a very large difference (.670 vs..634)

Example 1: Newspaper Credibility Want to compare a categorical variable (believability rating) between two populations Let     = difference in population proportion who rate daily paper as largely believable  H 0 :     = 0 (no change in proportion)  H a :     > 0 (more believable in 1998 Assume H 0 is true, could these sample proportions be coming from the same population proportion?

Example 1: Newspaper Credibility Estimate: difference in sample proportions Observed: =.036 Could a difference this large happen by chance?  If we repeatedly took two samples from the same population, how often would we observe a difference in the sample proportions at least this large?

Sampling Distribution of Mean:     Standard deviation  Variances add…  When null hypothesis is true:

Shape?  Normal if sample size is large…  At least 5 successes and 5 failures in each group Sampling Distribution of n 1  1 > 5, n 1 (1-  1 ) >5 and n 2  2 > 5, n 2 (1-  2 ) > 5

Example 1: Newspaper Credibility Large sample sizes √ Independent random samples√

Example 1: Newspaper Credibility  Difference in sample proportions   z values 

Summary The difference in the proportions is statistically significant (p-value =.051 ≈.05)  Difference in sample proportions observed is larger than we would expect from the random sampling process alone We believe a smaller proportion of the population would rate their daily newspaper as largely believable in 2002 than in 1998 We are 90% confident that the population proportions differ by.00 to.072 so this may not be considered a practically significant decrease. Cause and Effect? No Generalizable? Yes

Example 2: Letrozole and Cancer New England Journal of Medicine, 2003 Study of over 5,000 women, randomly assigned to letrozole or placebo  Do we have a random sample of letrozole users and a random sample of placebo users? Willing to generalize to all women with breast cancer?  Do we have random assignment? Hypotheses will be about “treatment effect” instead of about difference in population proportions

Example 2: Letrozole and Cancer Want to compare a categorical response variable (whether survived) between two explanatory variable groups Consider underlying treatment effect (increase in survival rate) from using letrozole  H 0 : true treatment effect = 0 (letrozole has no real effect on survival)  H a : true treatment effect > 0 (letrozole increases survival rate) Assume H 0 is true, could the difference between these two group proportions occur from the randomization process alone?

Example 2: Letrozole and Cancer LetrozolePlaceboTotal Disease-free survival Not disease-free survival Total Data are in direction conjectured, have a higher rate of survival in the letrozole group How often would we find this many of the successes in the letrozole group by chance (from the randomization process) alone?

Example 2: Letrozole and Cancer Using normal distribution Large sample sizes √ Randomization √

Example 2: Letrozole and Cancer Conclusions: Not only was p-value small, the group proportions were more than 7 standard deviations apart. We are 95% confident that the percentage of women surviving will be 4.3% to 7.7% higher if letrozole is used. Cause and effect? Yes Generalizable? Risky

Example 3: Body Temperatures Minitab commands depend on which format typed data in Statistical Analysis  Observational study, treating as independent random samples  Samples show slight tendency for higher body temperatures among women

Example 3: Body Temperatures Perhaps the population means are equal, and these sample means differ just based on random sampling variability H 0 :      H a :     ≠  (“differs”) Technical conditions  Normal populations (works ok if n 1, n 2 >20)  Large populations  Independent random samples

Example 3: Body Temperatures

Test statistic Example 3: Body Temperatures Result is statistically significant at 5% level but not 1% level. Moderate evidence that these sample means are further apart than we would expect from random sampling variability alone if the population means were equal. Conclude that the mean body temperature differs.

Example 4: Sleep Deprivation Case 2: Randomized Experiment When samples sizes are large or each group distribution is normal, the randomization distribution is well approximated by the t distribution  Pooled t test?

Example 4: Sleep Deprivation Case 2: Randomized Experiment Validity?

Example 4 Conclusions 1. Statistically significant 2. Cause and effect conclusion valid 3. Generalizing to larger population? Is it possible that we are making the wrong decision?  Yes, type I error…

Summary Type of study  Do you have (independent) random samples from two populations? OR Do you have a randomized experiment?  Same calculations, different conclusions Are the sample sizes large for you to use normal/t procedures?  With small sample sizes, use Fisher’s Exact Test (two-way table simulation) or randomization tests from before  With larger samples, get test statistic and confidence interval conveniently

Comments on Exam Average score around 81  Still a bit to do  Focus on big issues: Types of study and consequences on conclusions (e.g., cause and effect), validity of methods, interpretation of results (What is a p-value? What is a confidence interval?)  Solutions on line with commentary Estimated current course grade  Average around.85

For Thursday Preview Example 1, Lecture 13 PP 10 online Read Ch. 7, chi-square procedures