10.3: Use and Abuse of Tests Carrying out a z-confidence interval calculation is simple. On a TI-83, it’s STAT-->TESTS-->7 Similarly, carrying out a z-signficance.

Slides:



Advertisements
Similar presentations
Decision Errors and Power
Advertisements

CHAPTER 9 Testing a Claim
2 nd type of inference Assesses the evidence provided by the data in favor of some claim about the population Asks how likely an observed outcome would.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Significance Tests: THE BASICS Could it happen by chance alone?
Concept of Power ture=player_detailpage&v=7yeA7a0u S3A.
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Lesson Significance Tests: The Basics. Vocabulary Hypothesis – a statement or claim regarding a characteristic of one or more populations Hypothesis.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
CHAPTER 9 Testing a Claim
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
The z test statistic & two-sided tests Section
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Inference Toolbox! To test a claim about an unknown population parameter: Step 1: State Identify the parameter (in context) and state your hypotheses.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Issues concerning the interpretation of statistical significance tests.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
Section 9.1(re-visited)  Making Sense of Statistical Significance  Inference as Decision.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Math 3680 Lecture #13 Hypothesis Testing: The z Test.
PHANTOMS: A Method of Testing Hypotheses
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Lesson Use and Abuse of Tests. Knowledge Objectives Distinguish between statistical significance and practical importance Identify the advantages.
AP Statistics Chapter 21 Notes
Don’t be wordy; mind your style! “The results that were taken from a sample of people go to show some results that tell us some facts about how women and.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.3 Tests About a Population.
AP STATISTICS LESSON (DAY 1) MAKING SENSE OF SATISTICAL SIGNIFICANCE.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics Third Edition Chapter 11: Testing a Claim Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
AP Statistics Section Statistical significance is valued because it points to an effect that is __________ to occur simply by chance. Carrying out.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Statistical Inference What can go wrong?. Tests of significance are an important part of much of scientific study today. The validity of the conclusions.
Chapter Use and Abuse of Tests. Choosing a Level of Significance How plausible is H 0 ? What are the consequences of rejecting H 0 ? There is.
CHAPTER 9 Testing a Claim
Unit 5: Hypothesis Testing
Tests of Significance The reasoning of significance tests
Warm Up Check your understanding P. 586 (You have 5 minutes to complete) I WILL be collecting these.
CHAPTER 9 Testing a Claim
CHAPTER 18: Inference in Practice
TESTs about a population mean
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 16: Inference in Practice
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

10.3: Use and Abuse of Tests Carrying out a z-confidence interval calculation is simple. On a TI-83, it’s STAT-->TESTS-->7 Similarly, carrying out a z-signficance test is simple. On a TI-83, it’s STAT-->TESTS-->1 The trick is knowing when it’s APPROPRIATE to use these tools. Don’t get calculator happy! Thinking is required.

Choosing a Level of Significance Recall that the P -value gives you the strength of the evidence against the null hypothesis. We often decide whether or not to reject the null hypothesis based on the chosen significance level, a. However, a should not be treated as some sort of universal standard. Different situations require difference significance levels.

Choosing a Level of Significance The significance levels of 1%, 5%, 10% come from a time before the help of computers or software. They were born from convenience, not scientific rigor. Because of this, no matter what the signifiance level is, YOU SHOULD ALWAYS REPORT THE P -value!!!

Choosing a Level of Significance Another thing to consider: There is NEVER an exact cut-off between significant and insignificant, only increasing evidence against the null hypothesis as the P -value decreases. For example, there is no practical difference between P -values of and 0.051, even though the first is significant at the 5% level and the second, technically, is not. You have to use your discretion!

What statistical significance doesn’t mean Suppose we perform a significance test and determine significance. (P-value ≤ a ). This means: there is good evidence that an effect is present. This DOES NOT mean: there is good evidence that the effect is large.

What statistical significance doesn’t mean Given a large enough sample, even small deviations from the mean will yield significance. For example, if n = 1000, a correlation of r = 0.08 is significant at the 1% level given a null hypothesis of r = 0. But r = 0.08 is an extremely small correlation! So what we’ve shown is: There DOES appear to be a positive association. What we HAVEN’T shown is: The association is large.

What statistical significance doesn’t mean Q: So what do we do? A: PLOT YOUR DATA! Look for outliers, influentials, or any systematic deviations (bias). A significance test is a cool thing, but it’s overused. While a significance level says NOTHING about the size of an effect, a Confidence Interval DOES help you determine the size of an effect. IT NEVER HURTS TO USE BOTH!

Proper Steps of Data Analysis Use graphical analysis - NQPs, Boxplots, and Histograms. Use significance tests to determine if there is an effect. Use confidence intervals to determine the size of the effect.

Don’t ignore lack of significance It is easy to get obsessed with trying to show that an effect exists. Upon discovering there isn’t one, many researchers have discarded their work. But keep in mind: not finding an effect where we expected to see one is important too! Moral: ALWAYS REPORT YOUR RESULTS For one, you don’t want to doom others to doing your experiment. For another, it is worthwhile to analyze WHY no effect was detected.

Statistical Inference is not valid for all data sets Remember, badly designed experiments produce worthless results. Even math can’t save the day. Remember also that we sometimes don’t always know WHAT caused a result, even in a seemingly well-designed experiment. This raises the issues of confounding and lurking variables.

Statistical Inference is not valid for all data sets For example, there exists statistical significance between the English vocabulary scores of high school seniors who have studied a foreign language and those who have not. Does this mean taking a foreign language cause higher English vocabulary scores? Or does it mean that taking a foreign language is confounded by the type of student who takes a foreign language during their senior year? Statistical significance tells us a difference exists, but it doesn’t tell us HOW or WHY the difference exists.

Beware of searching for significance Statistical Significance has its basis in probability. But even randomness can show a pattern if you look long enough. This can lead to problems if you perform too many tests on the same data.

Beware of searching for significance Example: There was a team of psychiatrists who performed SEVENTY-SEVEN significance tests on a set of data that contained 77 variables! Consider this, if you made 77 tests at the 5% level, you would expect a few to be significant by pure chance. After all, results significant at the 5% level happen 5 out of 100 times (in the long run) EVEN WHEN THE NULL HYPOTHESIS IS TRUE! So it wasn’t really meaningful when the psychiatrists achieved significance for two of the tests.

Morals of the Story A significance level should be used as a guideline, not a rigid barrier. Significance levels should vary based on the severity of the experiment.

Morals of the Story Significance tests DO NOT tell you whether an effect is large or small. They only tell you whether an effect exists. Use Confidence Intervals to determine the size of an effect.

Morals of the Story Don’t ignore lack of significance. Sometimes you learn more from failing than you do from succeeding! Beware of confounding and lurking variables before drawing any conclusions from a significance test. Searching for significance should NOT be the goal. It is simply a tool which can yield helpful information for a properly designed experiment.