Stat 31, Section 1, Last Time Hypothesis Testing –Careful about 1-sided vs. 2-sided Connection: CIs - Hypo Tests 3 Traps of Hypo Testing –Statistically.

Slides:



Advertisements
Similar presentations
Stat 31, Section 1, Last Time Paired Diff’s vs. Unmatched Samples
Advertisements

1 Difference Between the Means of Two Populations.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Lecture 4 Chapter 11 wrap-up
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 23 = Finish Chapter “Confidence Interval Estimation” (CIE)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
5-3 Inference on the Means of Two Populations, Variances Unknown
1 Confidence Interval for Population Mean The case when the population standard deviation is unknown (the more common case).
Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Stat 31, Section 1, Last Time T distribution –For unknown, replace with –Compute with TDIST & TINV (different!) Paired Samples –Similar to above, work.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
More About Significance Tests
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful.
Significance Tests in practice Chapter Tests about a population mean  When we don’t know the population standard deviation σ, we perform a one.
Dan Piett STAT West Virginia University
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
Chapter 8 Estimation Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Stor 155, Section 2, Last Time Hypothesis Testing –Assess strength of evidence with P-value P-value interpretation: –Yes – No –Gray – level –1 - sided.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Last Time Central Limit Theorem –Illustrations –How large n? –Normal Approximation to Binomial Statistical Inference –Estimate unknown parameters –Unbiasedness.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Essential Statistics Chapter 131 Introduction to Inference.
Stor 155, Section 2, Last Time Review…. Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday,
Stor 155, Section 2, Last Time Prediction in Regression –Given new point X 0, predict Y 0 –Confidence interval for mean –Prediction Interval for value.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
STA Lecture 191 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm – 9:15pm Location CB 234.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Last Time Binomial Distribution Political Polls Hypothesis Testing
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
Last Time Interpretation of Confidence Intervals Handling unknown μ and σ T Distribution Compute with TDIST & TINV (Recall different organization) (relative.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Administrative Matters Midterm II Results Take max of two midterm scores:
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Announcements Exam 1 key posted on web HW 7 (posted on web) Due Oct. 24 Bonus E Due Oct. 24 Office Hours –this week: –Wed. 8-11, 3-4 –Fri For Bonus.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Chapter 7 Confidence Interval Estimation
Final Examination Thursday, April 30, 4:00 – 7:00
Stat 31, Section 1, Last Time Choice of sample size
Stat 31, Section 1, Last Time Sampling Distributions
Putting It All Together: Which Method Do I Use?
Confidence Interval Estimation
Presentation transcript:

Stat 31, Section 1, Last Time Hypothesis Testing –Careful about 1-sided vs. 2-sided Connection: CIs - Hypo Tests 3 Traps of Hypo Testing –Statistically Sign’t ≠ Really Sign’t –Non-sign’t ≠ Nothing there –In many tests, will find some sign’t T Distribution (handles unknown σ)

Reading In Textbook Approximate Reading for Today’s Material: Pages , Approximate Reading for Next Class: Pages

Midterm II Coming on Tuesday, April 10 Think about: Sheet of Formulas –Again single 8 ½ x 11 sheet –New, since now more formulas Redoing HW… Asking about those not understood Will schedule Extra Office Hours

Sec. 7.1: Deeper look at Inference Recall: “inference” = CIs and Hypo Tests Main Issue: In sampling distribution Usually is unknown, so replace with an estimate,. For n large, should be “OK”, but what about: n small? How large is n “large”?

Unknown SD Approach: Account for “extra variability in the approximation” Mathematics: Assume individual I.e. Data have mound shaped histogram Recall averages generally normal But now must focus on individuals

Unknown SD Then Replace by, then has a distribution named: “t-distribution with n-1 degrees of freedom”

t - Distribution Notes: 1.n is a parameter (like ) that controls “added variability from approximation

t - Distribution Notes: 2.Careful: set “degrees of freedom” = = n – 1 (not n) Easy to forget later Good to add to sheet of notes for exam

t - Distribution Notes: 3.Must work with standardized version of i.e. No longer can plug mean and SD into EXCEL formulas In text this was already done, Since need this for Normal table calc’ns

t - Distribution Notes: 4.Calculate t probs, i.e. areas, using TDIST & TINV Caution: these are set up differently from NORMDIST & NORMINV See Class Example 26

EXCEL Functions Summary: Normal: plug in: get out: NORMDIST: cutoff area NORMINV: area cutoff (but TDIST is set up really differently)

EXCEL Functions t distribution: 1 tail: plug in: get out: TDIST: cutoff area EXCEL notes: - no explicit inverse - backwards from Normal…

EXCEL Functions t distribution: Area 2 tail: plug in: get out: TDIST: cutoff area TINV: area cutoff (EXCEL note: this one has the inverse)

EXCEL Functions Note: when need to invert the 1-tail TDIST, Use twice the area. Area = A Area = 2 A

t - Distribution HW: C21 For T ~ t, with degrees of freedom: (a) 3 (b) 12 (c) 150 (d) N(0,1) Find: i.P{T> 1.7} (0.094, 0.057, 0.046, 0.045) ii.P{T < 2.14} (0.939, 0.973, 0.983, 0.984) iii.P{T < -0.74} (0.256, 0.237, 0.230, 0.230) iv.P{T > -1.83} (0.918, 0.954, 0.965, 0.966)

t - Distribution HW: C21 v.P{|T| > 1.18} (0.323, 0.261, 0.240, 0.238) vi.P{|T| < 2.39} (0.903, 0.966, 0.982, 0.983) vii.P{|T| < -2.74} (0, 0, 0, 0) viii.C so that 0.05 = P{|T| > C} (3.18, 2.17, 1.98, 1.96) ix.C so that 0.99 = P{|T| < C} (5.84, 3.05, 2.61, 2.58)

t - Distribution Application 1: Confidence Intervals Recall: margin of error from NORMINV or CONFIDENCE Using TINV? Careful need to standardize

t - Distribution Using TINV? Careful need to standardize # spaces on number line Need to work in to use TINV

t - Distribution distribution So want: i.e. want:

t - Distribution Terminology: TINV(0.05,n-1) is called a critical value (from connection between CIs and Tests) HW: 7.19

t - Distribution Class Example 27, Part I Old text book problem 7.24: In a study of DDT poisoning, researchers fed several rats a measured amount. They measured the “absolutely refractory period” required for a nerve to recover after a stimulus. Measurements on 4 rats gave:

t - Distribution Class Example 27, Part I Old text book problem 7.24: Measurements on 4 rats gave: a)Find the mean refractory period, and the standard error of the mean b)Give a 95% CI for the mean “absolutely refractory period” for all rats of this strain

t - Distribution Confidence Interval HW: 7.5, 7.7

And now for something completely different… Two issues: What do professional statisticians think about EXCEL? Why are the EXCEL functions so poorly organized?

And now for something completely different… Professional Statisticians Dislike Excel: Very poor handling of numerics Unacceptable?!? Jeff Simonoff Example:

And now for something completely different… A similar example: Class Example 28: Problem 1: Excel doesn’t keep enough significant digits (relative to other software) [single precision vs. double precision]

And now for something completely different… Problem 2: Excel doesn’t warn when troubles are encountered… All software has this problem sometimes But is easy to provide warnings… “Competent software does this…”

And now for something completely different… More discussion of Excel accuracy issues: By Erhan Erkut, University of Alberta:

And now for something completely different… Why are the EXCEL functions so poorly organized? E.g. NORMDIST uses left areas TDIST uses right or 2-sided areas E.g. NORMINV uses left areas TINV uses 2-sided areas More to come…

And now for something completely different… Why are the EXCEL functions so poorly organized? Looks like programmer was handed a statistics text, and told “turn these into functions”… Problem: organization was good for table look ups, but looks clunky now…

And now for something completely different… Fun personal story: Colin Bell AT Microsoft heard about “complaints from statisticians on EXCEL” Decided to “try to fix these” Contacted Jeff Simonoff about numerics Asked Jeff to work with him Jeff refused, doesn’t like or use EXCEL

And now for something completely different… Fun personal story: Jeff told Colin about me Colin asked me I agreed about numerical problems, but said I had bigger objections about organization Colin asked me to write these up

And now for something completely different… Fun personal story: I said I was too busy, but… I would teach (similar course) soon. I offered to send an , every time I noted an organizational inconsistency Over the semester, I sent around 30 s about all of these

And now for something completely different… Fun personal story: Colin agreed with each of the points made Colin approached the statistical people at Microsoft They agreed that organization could have been done better

And now for something completely different… Fun personal story: But for “backwards compatibility” reasons, refused to change anything Colin apologetically archived all my s…

And now for something completely different… How much should we worry: Organization is a pain, but you can live with it (OK to complain when you feel like it) Usually (except for weird rounding) numerical issues don’t arise, but need to be aware of potential!

t - Distribution Application 2: Hypothesis Tests Idea: Calculate P-values using TDIST

t – Distribution Hypo Testing E.g. Old Textbook Example 7.26 For the above DDT poisoning example, Suppose that the mean “absolutely refractory period” is known to be 1.3. DDT poisoning should slow nerve recovery, and so increase this period. Do the data give good evidence for this supposition?

t – Distribution Hypo Testing E.g. Old Textbook Example 7.26 Let = population mean absolutely refractory period for poisoned rats. (from before)

t – Distribution Hypo Testing E.g. Old Textbook Example 7.26 P-value = P{what saw or more conclusive | H 0 – H A Bdry}

t – Distribution Hypo Testing E.g. Old Textbook Example 7.26 From Class Example 27, part 2: = Interpretation: very strong evidence, for either yes-no or gray-level

t – Distribution Hypo Testing Variations: For “opposite direction” hypotheses: P-value = Then use symmetry, i.e. put - into TDIST.

t – Distribution Hypo Testing Variations: For 2-sided hypotheses: Use 2-tailed version of TDIST.

t – Distribution Hypo Testing HW: (0.04), 7.17, 7.21 a, f Interpret P-values: (i)yes-no (ii)gray-level