Stat 31, Section 1, Last Time Statistical Inference Confidence Intervals: –Range of Values to reflect uncertainty –Bracket true value in 95% of repetitions.

Slides:



Advertisements
Similar presentations
Hypothesis Testing making decisions using sample data.
Advertisements

Stat 31, Section 1, Last Time Hypothesis Tests –H -, H 0 or H + version of testing –P-values – small for strong evidence –1-sided (strong evidence for.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
BCOR 1020 Business Statistics
Inference about Population Parameters: Hypothesis Testing
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Statistical Techniques I
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
14. Introduction to inference
Stat 31, Section 1, Last Time T distribution –For unknown, replace with –Compute with TDIST & TINV (different!) Paired Samples –Similar to above, work.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Stor 155, Section 2, Last Time Hypothesis Testing –Assess strength of evidence with P-value P-value interpretation: –Yes – No –Gray – level –1 - sided.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Significance Tests: THE BASICS Could it happen by chance alone?
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.1 Significance Tests: The Basics.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Stor 155, Section 2, Last Time Review…. Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday,
Chapter 20 Testing hypotheses about proportions
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
AP STATISTICS LESSON 10 – 2 DAY 2 MORE DETAIL: STATING HYPOTHESES.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Last Time Binomial Distribution Political Polls Hypothesis Testing
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Chapter 9: Hypothesis Tests Based on a Single Sample 1.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Math 3680 Lecture #13 Hypothesis Testing: The z Test.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Tests of Significance: Stating Hypothesis; Testing Population Mean.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Section 9.1 First Day The idea of a significance test What is a p-value?
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
+ Testing a Claim Significance Tests: The Basics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
+ Homework 9.1:1-8, 21 & 22 Reading Guide 9.2 Section 9.1 Significance Tests: The Basics.
Module 10 Hypothesis Tests for One Population Mean
Stat 31, Section 1, Last Time Choice of sample size
Unit 5: Hypothesis Testing
Hypothesis Testing: Hypotheses
Essential Statistics Introduction to Inference
Significance Tests: The Basics
Presentation transcript:

Stat 31, Section 1, Last Time Statistical Inference Confidence Intervals: –Range of Values to reflect uncertainty –Bracket true value in 95% of repetitions Choice of sample size –Choose n to get desired error Hypothesis Testing –Yes – No questions, under uncertainty

Reading In Textbook Approximate Reading for Today’s Material: Pages , Approximate Reading for Next Class: Pages ,

Hypothesis Tests E.g. A fast food chain currently brings in profits of $20,000 per store, per day. A new menu is proposed. Would it be more profitable? Test: Have 10 stores (randomly selected!) try the new menu, let = average of their daily profits.

Hypothesis Testing Note: Can never make a definite conclusion, Instead measure strength of evidence. Reason: have to deal with uncertainty But: Can quantify uncertainty

Hypothesis Testing Approach I: (note: different from text) Choose among 3 Hypotheses: H + : Strong evidence new menu is better H 0 : Evidence in inconclusive H - : Strong evidence new menu is worse

Caution!!! Not following text right now This part of course can be slippery I am “breaking this down to basics” Easier to understand (If you pay careful attention) Will “tie things together” later And return to textbook approach later

Fast Food Business Example Base decision on best guess: Will quantify strength of the evidence using probability distribution of E.g.  Choose H +  Choose H 0  Choose H -

Fast Food Business Example How to draw line? (There are many ways, here is traditional approach) Insist that H + (or H - ) show strong evidence I.e. They get burden of proof (Note: one way of solving gray area problem)

Fast Food Business Example Suppose observe:, based on Note, but is this conclusive? or could this be due to natural sampling variation? (i.e. do we risk losing money from new menu?)

Fast Food Business Example Assess evidence for H + by: H + p-value = Area

Fast Food Business Example Computation in EXCEL: Class Example 22, Part 1: P-value = i.e. About 10% Is this “small”? (where do we draw the line?)

Fast Food Business Example View 1: Even under H 0, just by chance, see values like, about 10% of the time, i.e. 1 in 10, so not “terribly convincing”??? Could be a “fluke”? But where is the boundary line?

P-value cutoffs View 2: Traditional (and even “legal”) cutoff, called here the yes-no cutoff: Say evidence is strong, when P-value < 0.05 i.Just a commonly agreed upon value, but very widely used: –Drug testing –Publication of scientific papers

P-value cutoffs ii.Say “results are statistically significant” when this happens, i.e. P-value < 0.05 iii.Can change cutoff value 0.05, to some other level, often called Greek “alpha” E.g. your airplane safe to fly, want E.g. often called strongly significant

P-value cutoffs View 3: Personal idea about cutoff, called gray level (vs. yes-no above) P-value < 0.01: “quite strong evidence” 0.01 < P-value < 0.1: “weaker evidence but stronger for smaller P-val.” 0.1 < P-value: “very weak evidence, at best”

Gray Level Cutoffs View 3: gray level (vs. yes-no above) Note: only about interpretation of P-value E.g.: When P-value is given: HW: 6.40 & (d) give gray level interp. (no, no, relatively weak evidence) 6.41 & (d) give gray level interp. (yes, not, moderately strong evidence)

Caution!!! Gray level viewpoint not in text Will see it is more sensible Hence I teach this Suggest you use this later in life Will be on HW & exams

Fast Food Business Example P-value of for H +, Is “quite weak evidence for H + ”, i.e. “only a mild suggestion” This happens sometimes: not enough information in data for firm conclusion

Fast Food Business Example Flip side: could also look at “strength of evidence for H - ”. Expect: very weak, since saw Quantification: H - P-value = $20,000 $21,000

Fast Food Business Example EXCEL Computation: Class Example 24, part 1 H - P-value = >> ½, so no evidence at all for H - (makes sense)

Fast Food Business Example A practical issue: Since, May want to gather more data… Could prove new menu clearly better (since more data means more information, which could overcome uncertainty)

Fast Food Business Example Suppose this was done, i.e. n = 10 is replaced by n = 40, and got the same: Expect: 4 times the data  ½ of the SD Impact on P-value? Class Example 24, Part 2

Fast Food Business Example How did it get so small, with only ½ the SD? mean = $20,000, observed $21,000 P-value = P-value = 0.004

Hypothesis Testing HW: C20 For each of the problems: a)A box label claims that on average boxes contain 40 oz. A random sample of 12 boxes shows on average 39 oz., with s = 2.2. Should we dispute the claim?

Hypothesis Testing b)We know from long experience that Farmer A’s pigs average 570 lbs. A sample of 16 pigs from Farmer B averages 590 lbs, with an SD of 110. Is it safe to say B’s pigs are heavier on average? c)Same as (b) except “lighter on average”. d)Same as (b) except that B’s average is 630 lbs.

Hypothesis Testing Do: i.Define the population mean of interest. ii.Formulate H +, H 0, and H -, in terms of mu. iii.Give the P-values for both H + and H -. (a , 0.058, b , 0.766, c , 0.766, d , 0.985) iv.Give a yes-no answer to the questions. (a. H -  don’t dispute b. H -  not safe c. H -  not safe d. H -  safe)

Hypothesis Testing v.Give a gray level answer to the questions. (a. H -  moderate evidence against b. H -  no strong evidence c. H -  seems to go other way d. H -  strong evidence, almost very strong)

And now for something completely different…. An amazing movie clip: Thanks to Trent Williamson

Hypothesis Testing Hypo Testing Approach II: 1-sided testing (more conventional & is version in text) Idea: only one of H + and H - is usually relevant, so combine other with H 0

Attention!!! Now return to textbook presentation H -, H 0, and H + ideas are building blocks Will combine these In two different ways To get more conventional hypothesis As developed in text

Hypothesis Testing Approach II: New Hypotheses Null Hypothesis: H 0 = “H 0 or ” Alternate Hypothesis: H A = opposite of Note: common notation for H A is H 1 Gets “burden of proof”, I might accidentally put this i.e. needs strong evidence to prove this

Hypothesis Testing Weird terminology: Firm conclusion is called “rejecting the null hypothesis” Basics of Test: P-value = Note: same as H 0 in H +, H 0, H - case, so really just same as above

Fast Food Business Example Recall: New menu more profitable??? Hypo testing setup: P-val = Same as before. See: Class Example 24, part 3:

Hypothesis Testing HW: 6.55, 6.61 Interpret with both yes-no and gray level AlternateTerminology: “Significant at the 5% level” = = P-value < 0.05 “Test Statistic z” = N(0,1) cutoff

Hypothesis Testing Hypo Testing Approach III: 2-sided tests Main idea: when either of H + or H - is conclusive, then combine them E.g. Is population mean equal to a given value, or different? Note either bigger or smaller is strong evidence

Hypothesis Testing Hypo Testing Approach III: “Alternative Hypothesis” is: H A = “H + or H - ” General form: Specified Value

Hypothesis Testing, III Note: “ ” always goes in H A, since cannot have “strong evidence of =”. i. e. cannot be sure about difference between and while can have convincing evidence for “ ” (recall H A gets “burden of proof”)

Hypothesis Testing, III Basis of test: (now see why this distribution form is used) observed value of “more conclusive” is the two tailed area

Fast Food Business Example Two Sided Viewpoint: $1,000 $1,000 P-value = $20,000 $21,000 mutually exclusive “or” rule

Fast Food Business Example P-value = =NORMDIST… See Class Example 24, part 4 = So no strong evidence, Either yes-no or gray-level

Fast Food Business Example Shortcut: by symmetry 2 tailed Area = 2 x Area See Class Example 24, part 4

Hypothesis Testing, III HW: interpret both yes-no & gray-level (-2.20, , rather strong evidence)

Hypothesis Testing, III A “paradox” of 2-sided testing: Can get strange conclusions (why is gray level sensible?) Fast food example: suppose gathered more data, so n = 20, and other results are the same

Hypothesis Testing, III One-sided test of: P-value = … = Part 5 of Two-sided test of: P-value = … = 0.062

Hypothesis Testing, III Yes-no interpretation: Have strong evidence But no evidence !?! (shouldn’t bigger imply different?)

Hypothesis Testing, III Notes: i.Shows that yes-no testing is different from usual logic (so be careful with it!) ii.Reason: 2-sided admits more uncertainty into process (so near boundary could make a difference, as happened here) iii.Gray level view avoids this: (1-sided has stronger evidence, as expected)

Hypothesis Testing, III Lesson: 1-sided vs. 2-sided issues need careful: 1.Implementation (choice does affect answer) 2.Interpretation (idea of being tested depends on this choice) Better from gray level viewpoint

Hypothesis Testing, III CAUTION: Read problem carefully to distinguish between: One-sided Hypotheses - like: Two-sided Hypotheses - like:

Hypothesis Testing Hints: Use 1-sided when see words like: –Smaller –Greater –In excess of Use 2-sided when see words like: –Equal –Different Always write down H 0 and H A –Since then easy to label “more conclusive” –And get partial credit….

Hypothesis Testing E.g. Text book problem 6.34: In each of the following situations, a significance test for a population mean, is called for. State the null hypothesis, H 0 and the alternative hypothesis, H A in each case….

Hypothesis Testing E.g. 6.34a An experiment is designed to measure the effect of a high soy diet on bone density of rats. Let = average bone density of high soy rats = average bone density of ordinary rats (since no question of “bigger” or “smaller”)

Hypothesis Testing E.g. 6.34b Student newspaper changed its format. In a random sample of readers, ask opinions on scale of -2 = “new format much worse”, -1 = “new format somewhat worse”, 0 = “about same”, +1 = “new a somewhat better”, +2 = “new much better”. Let = average opinion score

Hypothesis Testing E.g. 6.34b (cont.) No reason to choose one over other, so do two sided. Note: Use one sided if question is of form: “is the new format better?”

Hypothesis Testing E.g. 6.34c The examinations in a large history class are scaled after grading so that the mean score is 75. A teaching assistant thinks that his students have a higher average score than the class as a whole. His students can be considered as a sample from the population of all students he might teach, so he compares their score with 75. = average score for all students of this TA

Hypothesis Testing E.g. Textbook problem 6.36 Translate each of the following research questions into appropriate and Be sure to identify the parameters in each hypothesis (generally useful, so already did this above).

Hypothesis Testing E.g. 6.36a A researcher randomly divides 6-th graders into 2 groups for PE Class, and teached volleyball skills to both. She encourages Group A, but acts cool towards Group B. She hopes that encouragement will result in a higher mean test for group A. Let = mean test score for Group A = mean test score for Group B

Hypothesis Testing E.g. 6.36a Recall: Set up point to be proven as H A

Hypothesis Testing E.g. 6.36b Researcher believes there is a positive correlation between GPA and esteem for students. To test this, she gathers GPA and esteem score data at a university. Let = correlation between GPS & esteem

Hypothesis Testing E.g. 6.36c A sociologist asks a sample of students which subject they like best. She suspects a higher percentage of females, than males, will name English. Let: = prop’n of Females preferring English = prop’n of Males preferring English

Hypothesis Testing HW on setting up hypotheses: 6.35, 6.37