Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.

Slides:



Advertisements
Similar presentations
Introducing Hypothesis Tests
Advertisements

CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Hypothesis Testing: Intervals and Tests
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Hypothesis Testing: Hypotheses
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.
Section 4.4 Creating Randomization Distributions.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Rejecting Chance – Testing Hypotheses in Research Chapter 22.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 101 Dr. Kari Lock Morgan 9/25/12 SECTION 4.2 Randomization distribution.
Randomization Tests Dr. Kari Lock Morgan PSU /5/14.
How Can We Test whether Categorical Variables are Independent?
Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University.
More Randomization Distributions, Connections
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Hypothesis Testing: p-value
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/18/12 Confidence Intervals: Bootstrap Distribution SECTIONS 3.3, 3.4 Bootstrap.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
AP Statistics Section 11.1 A Basics of Significance Tests
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 101 Dr. Kari Lock Morgan 10/18/12 Chapter 5 Normal distribution Central limit theorem.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.1 Significance Tests: The Basics.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Chapter 20 Testing hypotheses about proportions
Hypotheses tests for means
Statistics: Unlocking the Power of Data Lock 5 Bootstrap Intervals Dr. Kari Lock Morgan PSU /12/14.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
AP Statistics Section 11.1 B More on Significance Tests.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 p-value.
Hypothesis Tests for 1-Proportion Presentation 9.
Statistics: Unlocking the Power of Data Lock 5 Section 4.1 Introducing Hypothesis Tests.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Randomization Tests PSU /2/14.
What Is a Test of Significance?
Introducing Hypothesis Tests
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
Warm Up Check your understanding p. 541
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Introducing Hypothesis Tests
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Significance Tests: The Basics
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Significance Testing
CHAPTER 9 Testing a Claim
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
CHAPTER 9 Testing a Claim
Presentation transcript:

Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative hypotheses Statistical significance

Statistics: Unlocking the Power of Data Lock 5 Review of Last Class The standard error of a statistic is the standard deviation of the sample statistic, which can be estimated from a bootstrap distribution Confidence intervals can be created using the standard error or the percentiles of a bootstrap distribution Confidence intervals can be created this way for any parameter, as long as the bootstrap distribution is approximately symmetric and continuous

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception Is there such a thing as extrasensory perception (ESP) or a “sixth sense”? Do you believe in ESP? a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception One way to test for ESP is with Zener cards: Subjects draw a card at random and telepathically communicate this to someone who then guesses the symbol

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception There are five cards with five different symbols If there is no such thing as ESP, what proportion of guesses should be correct? a) p = 0 b) p = 1/4 c) p = 1/5 d) p = 1/2 Because there are 5 cards, each person has a 1/5 chance of guessing correctly each time, if ESP does not exist.

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception As we’ve learned, statistics vary from sample to sample Even if the population proportion is 1/5, not every sample proportion will be exactly 1/5 How do we determine when a sample proportion is far enough above 1/5 to provide evidence of ESP?

Statistics: Unlocking the Power of Data Lock 5 Statistical Test A statistical test uses data from a sample to assess a claim about a population In the ESP experiment, we want to use sample data to determine whether the population proportion of correct guesses is really higher than 1/5

Statistics: Unlocking the Power of Data Lock 5 Statistical Evidence 3/4 is the highest, so provides the strongest evidence of ESP.

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception Let’s create our own sample proportion! Randomly choose a letter from A B C D E, and write it down (don’t show anyone!) Find a partner, telepathically communicate your letter (no auditory or visual clues!), and have them guess your letter. Switch roles. Did you guess correctly? a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception What is the sample proportion for our class? This provides Next class, we’ll learn how to quantify this evidence! a) Strong evidence for ESP b) Weak evidence for ESP c) No evidence for ESP d) Not sure

Statistics: Unlocking the Power of Data Lock 5 Statistical Hypotheses Null Hypothesis (H 0 ): Claim that there is no effect or difference. Alternative Hypothesis (H a ): Claim for which we seek evidence. Statistical tests are framed formally in terms of two competing hypotheses:

Statistics: Unlocking the Power of Data Lock 5 Statistical Hypotheses The alternative hypothesis is established by observing evidence (data) that contradicts the null hypothesis and supports the alternative hypothesis Hypotheses are always about population paramete rs H o : Null hypothesis H a : Alternative hypothesis Competing claims about a population

Statistics: Unlocking the Power of Data Lock 5 Statistical Hypotheses Null Hypothesis Alternative Hypothesis ALL POSSIBILITIES Can we reject the null hypothesis? Usually the null is a very specific statement ?

Statistics: Unlocking the Power of Data Lock 5 ESP Hypotheses For the ESP experiment: H o : p = 1/5 H a : p > 1/5 No “effect” or no “difference” Claim we seek “evidence” for Helpful hints: H 0 usually includes = H a usually includes >, <, or ≠ The inequality in H a depends on the question

Statistics: Unlocking the Power of Data Lock 5 Sleep versus Caffeine Mednick, Cai, Kanady, and Drummond (2008). “Comparing the benefits of caffeine, naps and placebo on verbal, motor and perceptual memory,” Behavioral Brain Research, 193, Students were given words to memorize, then randomly assigned to take either a 90 min nap, or a caffeine pill. 2 ½ hours later, they were tested on their recall ability. Explanatory variable: sleep or caffeine Response variable: number of words recalled Is sleep or caffeine better for memory?

Statistics: Unlocking the Power of Data Lock 5 Sleep versus Caffeine What is the parameter of interest in the sleep versus caffeine experiment? a) Proportion b) Difference in proportions c) Mean d) Difference in means e) Correlation The response variable (number of words recalled) is quantitative and the explanatory variable (sleep or caffeine) is categorical, so we are interested in a difference in means.

Statistics: Unlocking the Power of Data Lock 5 Sleep versus Caffeine Let  s and  c be the mean number of words recalled after sleeping and after caffeine. Is there a difference in average word recall between sleep and caffeine? What are the null and alternative hypotheses? a) H 0 :  s ≠  c, H a :  s =  c b) H 0 :  s =  c, H a :  s ≠  c c) H 0 :  s ≠  c, H a :  s >  c d) H 0 :  s =  c, H a :  s >  c e) H 0 :  s =  c, H a :  s <  c The null hypotheses is “no difference,” or that the means are equal. The alternative hypothesis is that there is a difference.

Statistics: Unlocking the Power of Data Lock 5 Difference in Hypotheses Note: the following two sets of hypotheses are equivalent, and can be used interchangeably: H 0 :  1 =  2 H a :  1 ≠  2 H 0 :  1 –  2 = 0 H a :  1 –  2 ≠ 0

Statistics: Unlocking the Power of Data Lock 5 Hypotheses Take a minute to write down the hypotheses for each of the following situations:  Does the proportion of people who support gun control differ between males and females?  Is the average hours of sleep per night for college students less than 7? p f : proportion of females who support gun control p m : proportion of males who support gun control H 0 : p f = p m H a : p f ≠ p m  : average hours of sleep per night for college students H 0 :  =7 H a :  < 7

Statistics: Unlocking the Power of Data Lock 5 Your Own Hypotheses Come up with a situation where you want to establish a claim based on data What parameter(s) are you interested in? What would the null and alternative hypotheses be? What type of data would lead you to believe the null hypothesis is probably not true?

Statistics: Unlocking the Power of Data Lock 5 Statistical Significance When results as extreme as the observed sample statistic are unlikely to occur by random chance alone (assuming the null hypothesis is true), we say the sample results are statistically significant If our sample is statistically significant, we have convincing evidence against H 0, in favor of H a If our sample is not statistically significant, our test is inconclusive

Statistics: Unlocking the Power of Data Lock 5 Statistical Significance

Statistics: Unlocking the Power of Data Lock 5 Note on Statistical Significance Statistical significance is a difficult concept, but also one of the most fundamental concepts of the course We return to this concept almost every class for the rest of the semester, so  it will get easier!  it’s worth thinking deeply about!

Statistics: Unlocking the Power of Data Lock 5 Sleep versus Caffeine a) there is a difference between sleep and caffeine for memory (and data show sleep is better) b) there is not a difference between sleep and caffeine for memory c) nothing

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception p = Proportion of correct guesses H 0 : p = 1/5 H a : p > 1/5 If results are statistically significant…  the sample proportion of correct guesses is higher than is likely just by random chance (if ESP does not exist and p = 1/5)  we have evidence that the true proportion of correct guesses really is higher than 1/5, and thus have evidence of ESP

Statistics: Unlocking the Power of Data Lock 5 Extrasensory Perception p = Proportion of correct guesses H 0 : p = 1/5 H a : p > 1/5 If results are NOT statistically significant…  the sample proportion of correct guesses could easily happen just by random chance (if ESP does not exist and p = 1/5)  we do not have enough evidence to conclude that p > 1/5, or that ESP exists

Statistics: Unlocking the Power of Data Lock 5 Key Question If it is very unusual, we have statistically significant evidence against the null hypothesis How do we measure how unusual a sample statistic is, if H 0 is true? How unusual is it to see a sample statistic as extreme as that observed, if H 0 is true? SIMULATE what would happen if H 0 were true!

Statistics: Unlocking the Power of Data Lock 5 ESP How could we simulate what would happen, just by random chance, if the null hypotheses were true for the ESP experiment? Roll a die. 1 = “correct letter” 2-5 = “wrong letter” 6 = roll again Did you get the correct letter? (a) Yes (b) No

Statistics: Unlocking the Power of Data Lock 5 ESP – Random Chance

Statistics: Unlocking the Power of Data Lock 5 ESP Based on the distribution below, do you think the results of our class ESP experiment are statistically significant? a) Yes b) No

Statistics: Unlocking the Power of Data Lock 5 ESP What does this imply about ESP? a) Evidence that ESP exists b) Evidence that ESP does not exist c) Impossible to tell

Statistics: Unlocking the Power of Data Lock 5 Summary Statistical tests use data from a sample to assess a claim about a population Statistical tests are usually formalized with competing hypotheses:  Null hypothesis (H 0 ): no effect or no difference  Alternative hypothesis (H a ): what we seek evidence for If it would be unusual to get results as extreme as that observed, just by random chance, if the null were true, then the data is statistically significant If data are statistically significant, we have convincing evidence against the null hypothesis, and in favor of the alternative

Statistics: Unlocking the Power of Data Lock 5 Project 1 Pose a question that you would like to investigate. If possible, choose something related to your major! Find or collect data that will help you answer this question (you may need to edit your question based on available data) You can choose either a single variable or a relationship between two variables See Finding Data for help finding dataFinding Data

Statistics: Unlocking the Power of Data Lock 5 Project 1 The result will be a five page paper including  Description of the data collection method, and the implications this has for statistical inference  Descriptive statistics (summary stats, visualization)  Confidence interval  Hypothesis test Proposal due next Monday, 2/17  If using existing data: include link to data, relevant summary statistic  If collecting your own data, proposal should include a detailed data collection plan

Statistics: Unlocking the Power of Data Lock 5 Sample or Population? If your data represents a sample from a population, inference makes sense If you have data on the entire population, you will be asked to take a random sample and do inference

Statistics: Unlocking the Power of Data Lock 5 To Do Read Section 4.1 Project 1 Proposal (due Monday 2/17)