Making Data-Based Decisions

Slides:



Advertisements
Similar presentations
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Quantitative Skills 4: The Chi-Square Test
Using Statistics to Analyze your Results
Chapter 10: Hypothesis Testing
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
8-2 Basics of Hypothesis Testing
Chapter 8: Hypothesis Testing and Inferential Statistics What are inferential statistics, and how are they used to test a research hypothesis? What is.
Descriptive Statistics
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Results and Discussion Let’s revisit this assignment!
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Elementary Statistical Methods André L. Souza, Ph.D. The University of Alabama Lecture 22 Statistical Power.
Significance Testing Statistical testing of the mean (z test)
AP STATISTICS LESSON 10 – 4 ( DAY 1 ) INFERENCE AS DECISION.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Hypothesis Testing State the hypotheses. Formulate an analysis plan. Analyze sample data. Interpret the results.
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Scientific Method Probability and Significance Probability Q: What does ‘probability’ mean? A: The likelihood that something will happen Probability.
Introduction to Hypothesis Testing
Statistical Techniques
Section 10.2: Tests of Significance Hypothesis Testing Null and Alternative Hypothesis P-value Statistically Significant.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
15 Inferential Statistics.
Introduction to Hypothesis Testing: The Binomial Test
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Introduction to Hypothesis Testing: The Binomial Test
9.3 Hypothesis Tests for Population Proportions
Unit 5: Hypothesis Testing
Is this quarter fair?. Is this quarter fair? Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of.
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Introduction to Hypothesis Testing: The Binomial Test
Chapter 8: Hypothesis Testing and Inferential Statistics
Central Limit Theorem, z-tests, & t-tests
Overview and Basics of Hypothesis Testing
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Hypothesis Testing.
Section 10.2: Tests of Significance
Statistical Tests P Values.
Statistical Inference
Chapter 9: Hypothesis Tests Based on a Single Sample
Tests of Significance.
P-VALUE.
Hypothesis Testing.
CHAPTER 9 Testing a Claim
Testing Hypotheses I Lesson 9.
Is this quarter fair?. Is this quarter fair? Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of.
1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.
Section 11.1: Significance Tests: Basics
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
STAT 1301 Tests of Significance about Averages.
Presentation transcript:

Making Data-Based Decisions Let’s flip a coin

Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?

The Research Question… Hypotheses: Null hypothesis: A coin toss will results in 50 % head and 50% tails. Expected data: Equal numbers of heads and tails Alternative hypothesis 1: Heads will occur more often compared to tails. More heads than tails Alternative hypothesis 2: Heads will occur less often compared to tails. Fewer heads than tails

Testing: Flip coin 10 times

Results - out of 10 flips What is the minimum number of heads that you would expect if: the null hypothesis is correct? Why? (A coin toss will results in 50 % head and 50% tails.) alternative hypothesis 1 is correct? Why? (Heads will occur more often compared to tails.) alternative hypothesis 2 is correct? Why? (Heads will occur less often compared to tails.)

Results, coin flipped 10 times 8 or 80% of the flips were heads Which hypothesis does this support? null hypothesis: A coin toss will results in 50 % head and 50% tails. alternative hypothesis 1: Heads will occur more often compared to tails. alternative hypothesis 2: Heads will occur less often compared to tails.

Results - out of 10 flips 8 or 80% of the flips were heads. Is it possible that we could have gotten 8 heads if the other hypotheses were correct, too?

What if we could actually calculate the likelihood of getting at least 8 heads? For instance If null hypothesis fails to be rejected – what is the probability that we could have actually gotten 8? If alternative hypothesis 1 is supported, what is the probability that we could have actually gotten 8? Testing the null hypothesis is the easiest…why?

The problem is…. Null hypothesis – equal numbers of predicted flips Alternative 1 – how many more heads would we expect? WE KNOW EXACTLY WHAT TO EXPECT FOR THE NULL, BUT HAVE NO IDEA WHAT VALUES TO EXPECT FOR THE ALTERNATIVE

Let’s set up a simulation…. Let’s flip a coin 10 times as one sample How many heads would you expect to get? Explain TO DETERMINE PROBABILITY REPEAT A LOT OF TIMES AND SEE HOW OFTEN WE GET AT LEAST 8 HEADS

Number of “correct flips” Simulation – 20 samples Count of Samples   X  X X  1 2 3 4 5 6 7 8 9 10 Number of “correct flips” Number Heads 1 2 3 4 5 6 7 8 9 10 Chance of getting this many heads 0.0 0.05  0.15 0.15 0.25 0.2 0.1   0

Question… Why don’t we get 5 heads every time we flip a coin 10 times? Why are some values not represented? We didn’t get any samples with 0, 2, or 10 heads?

The Impact of Sampling We are sampling We don’t expect every sample to look exactly like the population. There is going to be variability because of chance

Simulation – 20 samples Simulation – 10000 samples 1 2 3 4 5 6 7 8 9 Number Heads 1 2 3 4 5 6 7 8 9 10 Total Times occurred Probability of Occurring 0.0 0.05  0.15 0.15 0.25 0.2 0.1   0 Simulation – 10000 samples Number Heads 1 2 3 4 5 6 7 8 9 10 Total Times occurred 92 461 1154 1981 2537 2063 1117 479 101 Probability of Occurring 0.0005 0.009 0.046  0.115 0.198 0.245 0.206 0.117 0.048 0.010  0.0006

My simulation – 10000 samples The Big Question… What is the likelihood (probability) of having AT LEAST 8 heads in our sample (getting 8, 9, or 10 heads)? My simulation – 10000 samples Number Heads 1 2 3 4 5 6 7 8 9 10 Total Times occurred 92 461 1154 1981 2537 2063 1117 479 101 Probability of Occurring 0.0005 0.009 0.046  0.115 0.198 0.245 0.206 0.117 0.048 0.010  0.0006

My simulation – 10000 samples The Big Question… What is the likelihood (probability) of having AT LEAST 8 heads in our sample? p= 0.048 +0.01 + 0.006 so p=0.0586 (8) (9) (10) My simulation – 10000 samples Number Heads 1 2 3 4 5 6 7 8 9 10 Total Times occurred 92 461 1154 1981 2537 2063 1117 479 101 Probability of Occurring 0.0005 0.009 0.046  0.115 0.198 0.245 0.206 0.117 0.048 0.010  0.0006

My simulation – 10000 samples The Big Question… What is the likelihood (probability) of having AT LEAST 8 heads in our sample? p=0.0586 or the likelihood of this occurring is 6 times out of 100. My simulation – 10000 samples Number Heads 1 2 3 4 5 6 7 8 9 10 Total Times occurred 92 461 1154 1981 2537 2063 1117 479 101 Probability of Occurring 0.0005 0.009 0.046  0.115 0.198 0.245 0.206 0.117 0.048 0.010  0.0006

Logic of Statistical Testing Inferring from samples – INFERENTIAL STATISTICS Scientists collect data from a sample and determine whether or not that sample provides EVIDENCE AGAINST the null hypothesis. If the null hypothesis is true, what is the probability we would have randomly chosen a sample with the values we observed? Analysis: By looking at our probability of obtaining 80% or 8 heads in a sample of 10 flips, we can make a decision. PROBABILITY IS OFTEN CALLED THE P value

Our Example Likelihood of getting 8 heads out of ten in our sample if the null hypothesis were actually true is p=0.0586 meaning it would occur roughly 6 times out of 100. Do you consider this value low or high? Do you think it provides enough evidence against the null hypothesis?

Statistical Significance Need a cut point for the p-value Common “cut points”: 0.05, 0.01, .001 If P value < 0.05, you say the result is “statistically significant” and you reject the null hypothesis. If the null hypothesis is true, the probability of randomly getting the observed sample is unlikely. This provides evidence against the null hypothesis and we would REJECT the null hypothesis, suggesting one of the alternative hypotheses were correct.

Statistical Significance If P value > 0.05, You say the results were “not statically significant” If the null hypothesis is true, the probability of randomly getting the observed sample is likely. This does not provides evidence against the null hypothesis and we would FAIL TO REJECT the null hypothesis, allowing us to reject the alternative hypotheses.

Statistical Tests/Hypothesis Testing/Inferential Test: All statistical tests provide a P-value that is the probability that your results would have occurred if the null hypothesis were true. They use information from your data (mean, standard deviation, etc.) to figure out a probability based upon a population that meets the null hypothesis (much like our coin simulation). You use the p-value to make a data-driven decision

Question: 16 heads out of 20? p = 0.01 40 heads out of 50? p<0.0001 What do you think would happen to the probability of getting 80% heads if we had flipped more: 16 heads out of 20? p = 0.01 40 heads out of 50? p<0.0001 Increasing your sample size decreases the chance that your results will be impacted by errors or chance factors that might mask differences.

Example Hypotheses and P value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 The correlation between amount of nutrient and growth is 0. The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. Cut-off value is 0.05

Example Hypotheses and p-value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 Do not reject the null hypothesis There is no evidence to suggest the mean life-span is not 15 years. The correlation between amount of nutrient and growth is 0. The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. Cut-off value is 0.05

Example Hypotheses and p-value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 Do not reject the null hypothesis There is no evidence to suggest the mean life-span is not 15 years. The correlation between amount of nutrient and growth is 0. 0.010 The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. Cut-off value is 0.05

Example Hypotheses and p-value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 Do not reject the null hypothesis (P > 0.05) There is no evidence to suggest the mean life-span is not 15 years. The correlation between amount of nutrient and growth is 0. 0.010 Reject the null hypothesis (P < 0.05) There is evidence to suggest the correlation is not zero. The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. Cut-off value is 0.05

Example Hypotheses and p-value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 Do not reject the null hypothesis (P > 0.05) There is no evidence to suggest the mean life-span is not 15 years. The correlation between amount of nutrient and growth is 0. 0.010 Reject the null hypothesis (P < 0.05) There is evidence to suggest the correlation is not zero. The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. 0.0001 Cut-off value is 0.05

Example Hypotheses and p-value Null Hypothesis P-Value Decision Interpretation The mean life-span is 15 years. 0.078 Do not reject the null hypothesis (P > 0.05) There is no evidence to suggest the mean life-span is not 15 years. The correlation between amount of nutrient and growth is 0. 0.010 Reject the null hypothesis (P < 0.05) There is evidence to suggest the correlation is not zero. The mean height of plants exposed to sunlight equals the mean height of plants not exposed to light. 0.0001 There is evidence to suggest light makes a difference on plan growth. Cut-off value is 0.05