Summary.

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Hypothesis testing Week 10 Lecture 2.
Review: What influences confidence intervals?
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Statistics for Managers Using Microsoft® Excel 5th Edition
Hypothesis Testing: Two Sample Test for Means and Proportions
Probability Population:
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Descriptive statistics Inferential statistics
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
Statistical inference: confidence intervals and hypothesis testing.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Chapter 8 Hypothesis Testing. Section 8-1: Steps in Hypothesis Testing – Traditional Method Learning targets – IWBAT understand the definitions used in.
14. Introduction to inference
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Overview Basics of Hypothesis Testing
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
SUMMARY Hypothesis testing. Self-engagement assesment.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
Section 9.2 Hypothesis Testing Proportions P-Value.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Correct decisions –The null hypothesis is true and it is accepted –The null hypothesis is false and it is rejected Incorrect decisions –Type I Error The.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
SUMMARY. Central limit theorem Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Wed, Nov 1, 2006.
© Copyright McGraw-Hill 2004
Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.
SUMMARY Hypothesis testing. Self-engagement assesment.
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Aim: What is the P-value method for hypothesis testing? Quiz Friday.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
9.3 Hypothesis Tests for Population Proportions
Presentation transcript:

summary

Central limit theorem

Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions about that population

Confidence interval for 𝑛≥30: 𝑥 ±𝑍× 𝑠 𝑛 for 𝑛<30: 𝑥 ± 𝑡 𝑛−1 × 𝑠 𝑛 critical value kritická hodnota for 𝑛≥30: 𝑥 ±𝑍× 𝑠 𝑛 margin of error možná odchylka for 𝑛<30: 𝑥 ± 𝑡 𝑛−1 × 𝑠 𝑛

The margin of error depends on margin of error=neco× 𝑠 𝑛 The margin of error depends on neco – the confidence level (common is 95%) s or σ – the variability of data the sample size 𝑛 Margin of error does not measure anything else than chance variation. It doesn’t measure any bias or errors that happen during the process. It does not tell anything about the correctness of your data!!!

hypothesis testing

Aim of hypothesis testing decision making

Engagement self-assessment Hopefully, you like this course so far. How to measure this? At the scale between 1 and 10, self-report how engaged you think you’re during the lecture (1 is the lowest value, 10 is the highest value).

Engagement distribution 1 2 3 4 5 6 7 8 9 10

Engagement distribution 𝜇=7.8 𝜎=0.76 1 2 3 4 5 6 7 8 9 10

Engagement distribution 𝑛=30 𝑀=𝜇=7.8 𝑆𝐸= 𝜎 𝑛 = 0.76 30 =0.14 𝜇=7.8 𝜎=0.76 1 10 7.8

Engagement distribution 𝑛=30 𝑀=𝜇=7.8 𝑆𝐸= 𝜎 𝑛 = 0.76 30 =0.14 𝜇=7.8 𝜎=0.76 1 10 7.8

Hypothesis testing song 𝑛=30 𝑥 =8.2 8.2 1 7.8

Did the song help? 𝑛=30, 𝑀=7.8, 𝑆𝐸=0.14 Mean engagement of randomly chosen 30 students from the population of 100 students can lie anywhere on the blue curve. What is the probability of getting the value of at least 8.2? Use Z-tables. 8.2−7.8 0.14 =2.85=𝑍 corresponding probability is 0.0022 8.2 1 7.8 𝑛=30, 𝑀=7.8, 𝑆𝐸=0.14

Situation on the battlefield From the no-song population, we have 0.22% chance that randomly sampled data (n=30) will have the mean engagement of 8.2 or more. 30 students that were subjected to the song testing showed mean engagement of 8.2. If there was no difference between no-song and song populations, it is rather unlikely (0.22% is really not that much) that we have chosen 30 students with the mean engagement of 8.2. Conclusion: Because of such a low probability, we interpret 8.2 as a significat increase over 7.8 caused by undeniable pedagogical qualities of the 'Hypothesis testing song'.

Hypothesis testing So, where are the hypotheses? Null hypothesis H0 The song did not cause any observable effect. H0 generally – there is no significant difference between current population parameters and new population parameters after some sort of intervention (𝜇= 𝜇 𝐼 ). Alternative (research) hypothesis H1, Ha The song improved students' engagement H1 generally – there is a significant difference between current population parameters and new population parameters after an intervention.

Null hypothesis H0 states that nothing happened, there is no change, no difference. Does the song improve students' engagement? H0: students' engagement is the same regardless the song. Does diet coke tastes differently than whole coke? H0: there is no difference in the taste between diet and whole coke. Does this drug increases the blood pressure? H0: this drug causes no increase in the blood pressure. Is this a fair coin? H0: the coin is fair.

Alternative hypothesis The alternative hypothesis states the opposite to the null and is usually the hypothesis you are trying to prove. Does the song improve students' engagement? Ha: song improved students' engagement. Does diet coke tastes differently than whole coke? Ha: diet coke tastes much worse/better than whole coke. Does this drug increases the blood pressure? Ha: this drug leads to increase in the blood pressure. Is this a fair coin? Ha: the coin is unfair.

Hypothesis testing Formulate the null hypothesis. There is no statistical testing without the null. You assume that H0 is true. I.e., you assume that the hypothesis song did not influence students' engagement. Then, you must find an evidence for rejecting or not rejecting the null. Collect the data. Population data, no song: μ = 7.8, 𝜎 = 0.76 Sample data, song fired: n = 30, 𝑥 =8.2

Hypothesis testing Then, you will calculate the probability of observing your sample results (or more extreme) given that the null hypothesis is true. Null hypothesis which we consider true: teaching with and without song produces the same effect. Higher value in the mean of the song sample ( 𝑥 =8.2) compared to the no-song mean (μ = 7.8) is caused only by the variability in random sampling.

Hypothesis testing If there is really no difference between the two teaching methods (with and without song) in the population (i.e., given that the null hypothesis is true), how likely would it be to see a difference in the mean engagement between the two teaching methods as large as (or larger than) that which has been observed in our sample? Given the null is true (both populations are the same), there is only 0.22% chance that the random sampling would lead to the mean engagement of 8.2. The value of 0.0022 is referred to as p-value.

Levels of likelihood Conclusion: 8.2 is a statistically significant increase over 7.8. We want a crisp decision, i.e. which probability is still considered 'unlikely' and which is considered as 'likely'?

Levels of likelihood - 𝛼 levels 0.05 (5%) 0.01 (1%) 0.001 (0.1%) If the probability of getting sample mean is less than 0.05 – 0.01 – 0.001 then it is usually considered unlikely. These are called the 𝜶 levels. Or significance levels (hladiny významnosti). 𝛼 level is our criteria for deciding if something is likely or unlikely.

Lingo Probability by which we decide if the result is likely or unlikely is called p-value. And we compare the p-value to the α level: if the p-value is less than the α level then such a result is considered to be unlikely. Or, alternatively, we can calculate Z-score of the result and compare it with the Z-score corresponding to the α level (so-called Z-critical value).

Z-critical value If the probability of obtaining a particular sample mean is less than alpha level then it will fall in this tail which is called the critical region. Z* Z-critical value If the Z-score of the sample mean is greater than the Z-critical value we have an evidence that this mean is different from the regular population (the population that had not watched the musical lesson).

Critical regions What is the Z-critical value for 𝛼=0.05? Using Z-table you find Z-value for 0.95 probability. Which is 1.65. What is the Z-critical value for 𝛼=0.01? 2.33 What is the Z-critical value for 𝛼=0.001? 3.08

We take mean from the sample of size 𝑛. Then we calculate its Z-score 𝑍= 𝑥 −𝜇 𝜎 𝑛 And we get the Z-score of 1.82. We say that this is significant at 𝑝<0.05. 1.82 is somewhere in the red region in the previous picture. It is less than 0.05, but not less than 0.01. It means that a probability of obtaining this sample mean is less than 5%, but is not less than 1%. And remember, 0.05 is the alpha level. Test statistic

Significance quiz Z-score Significant at 3.14 p < 2.07 2.57 14.31 𝛼 level Z-critical value 0.05 1.65 0.01 2.32 0.001 3.08 Significance quiz Z-score Significant at 3.14 p < 2.07 2.57 14.31

Significance quiz Z-score Significant at 3.14 p < 0.001 2.07 𝛼 level Z-critical value 0.05 1.65 0.01 2.32 0.001 3.08 Significance quiz Z-score Significant at 3.14 p < 0.001 2.07 p < 0.05 2.57 p < 0.01 14.31

Sampling distribution Quiz Sampling distribution Focus on 𝛼=0.05 Which of the following are true, if the null hypothesis were true? If the probability of getting a particular sample mean is less than 𝛼, it is unlikely to occur. If a sample mean has a Z-score greater than Z*, it is “unlikely” to occur. If the probability of getting a particular sample mean is “unlikely”, the sample mean is in he orange region. The alpha level corresponds to the orange region. Z*

Another engagement score After the sample of 30 students heard the „Hypotheses testing song“, their mean engagement score is 7.13. Just to remind you. The population parameters are 𝜇=7.8, 𝜎=0.76 Z-score of this sample mean: 𝑍= 7.13−7.8 0.76 30 =−4.83 Z-score is -4.83, what does it mean?

Two-tailed test (oboustranný test) mean engagement score of 7.13 is significant at p < 0.05 𝛼=0.05 Z=??

One-tailed Two-tailed 𝛼 2 𝛼 2 𝜶 level Z-critical value 0.05 ±1.65 0.01 critical region probability = 𝛼 critical region probability = 𝛼 𝛼 2 𝛼 2 The critical region can also be on the left. 𝜶 level Z-critical value 0.05 ±1.65 0.01 ±2.32 0.001 ±3.08 𝜶 level Z-critical value 0.05 ±1.96 0.01 ±2.57 0.001 ±3.27 In the critical region, we (most likely) did not get sample mean by chance.

One-tailed and two-tailed one-tailed (directional) test two-tailed (non-directional) test

One-tailed or two-tailed In general, we use two-tailed tests. One exception to this general rule is when we’re comparing a new treatment with an established treatment. In such cases we often only care if the new treatment is better than the old one. And we would use a one-tailed directional test.

Alternative hypothesis 𝜇≠ 𝜇 𝐼 𝜇< 𝜇 𝐼 𝜇> 𝜇 𝐼 two-tailed test one-tailed test

Quiz – reject the null What does it mean to reject the null using two-tailed test? Our sample mean falls within/outside the critical region. The Z-score of our sample mean is less than/greater than the Z-critical value. The probability of obtaining the sample mean is less than/greater than the alpha level.

Four steps of hypothesis testing Formulate the null and the alternative (this includes one- or two-directional test) hypothesis. Select the significance level α – a criterion upon which we decide that the claim being tested is true or not. --- COLLECT DATA --- Compute the p-value. The p-value is the probability that the data would be at least as extreme as those observed, if the null hypothesis were true. Compare the p-value to the α-level. If p ≤ α, the observed effect is statistically significant, the null is rejected, and the alternative hypothesis is valid.