The Fine Art of Knowing How Wrong You Might Be. To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more.

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Chapter 20 Testing Hypotheses about Proportions. Hypothesis Testing: used to assess the evidence provided by data in favor of some claim about the population.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 19 Rocks!.  In our formula, fractionwise, z-star is in the top and n is in the bottom.  The terms for these are directly proportional (for z-star)
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Testing Hypotheses About Proportions
More About Tests and Intervals Chapter 21. Zero In on the Null Null hypotheses have special requirements. To perform a hypothesis test, the null must.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
Copyright © 2009 Pearson Education, Inc. Chapter 21 More About Tests.
Chapter 20 Testing hypotheses about proportions
Hypotheses tests for means
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Chapter 20 Testing Hypothesis about proportions
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Chapter 21: More About Test & Intervals
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
Slide 21-1 Copyright © 2004 Pearson Education, Inc.
Chapter 21: More About Tests
Welcome to MM570 Psychological Statistics
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Statistical Techniques
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
A 10 Step Quest Every Time!. Overview When we hypothesis test, we are trying to investigate whether or not a sample provides strong evidence of something.
Chapter 20 Testing Hypotheses about Proportions. Hypothesis Testing: used to assess the evidence provided by data in favor of some claim about the population.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
6.2 Large Sample Significance Tests for a Mean “The reason students have trouble understanding hypothesis testing may be that they are trying to think.”
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Hypothesis Tests for 1-Proportion Presentation 9.
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
How Bad Is Oops?. When we make a decision while hypothesis testing (to reject or to do not reject the H O ) we determine which kind of error we have made.
Chapter 9: Testing a Claim
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
Chapter 21 More About Tests.
CHAPTER 9 Testing a Claim
More about Tests and Intervals
Chapter 9: Testing a Claim
P-value Approach for Test Conclusion
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Presentation transcript:

The Fine Art of Knowing How Wrong You Might Be

To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more okay. When we do a hypothesis test, there are only so many options for what we decide to do. More specifically, there are only four possible outcomes.

Four?!? Some of you might be thinking that our only two possible options are to reject H 0 or to not reject H 0. You are correct. In each of these cases, though, we might be right or we might be wrong. That is where the number four comes from.

Four We might find sufficient evidence, and it turns out the evidence is telling us the truth. We might find sufficient evidence, but it turns out the evidence was naughty, bad evidence that was misleading us. (A frame job.) We might not find sufficient evidence, but it turns out that there really is the difference we were looking for. We might not find sufficient evidence, and it turns out that is correct and that nothing is going on.

So Then What? No matter what decision we make, we might be wrong. When you’re right, you’re right. When you’re wrong, you don’t know it, so act like you’re right…it projects an attractive level of confidence. In statistics we have no real way to find the truth without taking a census. Do not administer a census if you can help it. A census is dumb.

Where Am I Going With This? If we reject H 0 (which means there was sufficient evidence), but we were wrong, this is called a Type I Error (Roman Numeral 1). The probability of making this kind of error is alpha. So, normally, 5%. We call this percentage significance level, and use the letter α for it. We can set it as high or low as we want. Normally we go with 5%.

Why Not Make It 0%? If we do not reject H 0 (which means there was not sufficient evidence to draw a conclusion), but something really is going on, this is called a Type II Error (Roman Numeral 2). The probability of this kind of error is called β and it is a real pain to calculate, so we are not going to. What you need to know about it is that the lower α is, the higher β is.

5% Is A Good Balance There are really only 2 things we can do to make β smaller. First, we can make α higher. This is usually counterproductive, as it just makes us more likely to be wrong the other direction. Second, we can make n (the sample size) larger. Larger sample sizes make everything better. Make the sample size too large, though, and you violate the less than 10% condition. Also, if the sample is too large, you are pretty much doing a census.

A Pair of Complements For a 2-sided test (One where the alternative is a ≠ kind), the complement of α is the confidence level. And the compliment of the confidence level is the significance level. So, 95% confidence matches with 5% (for a ≠ test). The complement of β is called statistical power. You don’t need the formula for this, and only need to know the term in a general way.

Significance Remember how we discussed the difference between statistical significance and practical significance? I remember it. Hopefully we can all remember it. If the p-value is lower than α, we found sufficient evidence. Another way to say this is that a “statistically significant difference was found”. Statistically significant means the H 0 got rejected.

Nothing To Prove Remember how I said earlier that Statistics is never used to prove anything ever? I remember it. Hopefully we can all remember it. We never prove the null or the alternative hypothesis. We only support the alternative strongly enough or do not support it strongly enough.

A Truckload of Apples On a recent homework problem you were asked to find out how likely it was that a sample of 150 apples were taken from a whole truckload would have less than 5% damaged. We were able to do this math because we were told that the true percentage of damaged apples was 8%.

A Truckload of Apples In the real world we would not know that 8% of the whole truckload was damaged. They just told us that so we could do the math. So what should we do in the real world? Since we don’t know what the true percent is, we will do the next best thing…we will make a wild assumption about the truth.

Wild Assumption In practice we will make a sensible assumption, actually. Our assumption will usually be based on something that is known. This could be past data. This could be the value from a similar group. This could be.5 in the case of fair coins. The purpose in making this assumption is, primarily, to allow us to do the math.

Alternatives The alternative hypothesis will be based on whatever we are trying to prove. We use the null hypothesis so we can do the math, but really the alternative hypothesis is what we are trying to focus on when we run a hypothesis test.

Foreshadowing FTW Tomorrow we will discuss inferior ways to run a hypothesis test. I say inferior, but they are perfectly acceptable. Mr. J.P. Damron, in fact, almost always favored a confidence interval method for running hypothesis tests. It is worth noting that he never once got a hypothesis test wrong with this method, and seriously did over 30 of them.

Assignments You can read chapter 21, but I do not think it is necessary, especially if you have limited free time. Test 4 is next week on Thursday. There will be another Quiz Retake Bonanza on February 19 th (the day after the test).