SUMMARY. Central limit theorem Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions.

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Summary.
Hypothesis testing Week 10 Lecture 2.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Review: What influences confidence intervals?
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
BCOR 1020 Business Statistics
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Hypothesis Testing: Two Sample Test for Means and Proportions
Probability Population:
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Overview of Statistical Hypothesis Testing: The z-Test
STATISTICS ELEMENTARY MARIO F. TRIOLA Chapter 7 Hypothesis Testing
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
Presented by Mohammad Adil Khan
Descriptive statistics Inferential statistics
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
Statistical inference: confidence intervals and hypothesis testing.
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Overview Basics of Hypothesis Testing
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 10 Hypothesis Testing
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
SUMMARY Hypothesis testing. Self-engagement assesment.
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Section 9.2 Hypothesis Testing Proportions P-Value.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Correct decisions –The null hypothesis is true and it is accepted –The null hypothesis is false and it is rejected Incorrect decisions –Type I Error The.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
© Copyright McGraw-Hill 2004
Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.
SUMMARY Hypothesis testing. Self-engagement assesment.
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Last lecture summary Population 2015.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
9.3 Hypothesis Tests for Population Proportions
Presentation transcript:

SUMMARY

Central limit theorem

Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions about that population

Confidence interval Point estimate Interval estimate margin of error možná odchylka critical value kritická hodnota interval spolehlivosti hladina spolehlivosti

HYPOTHESIS TESTING

Aim of hypothesis testing decision making

Engagement self-assessment Hopefully, you like this course so far. How to measure this? At the scale between 1 and 10, self-report how engaged you think you’re during the lecture (1 is the lowest value, 10 is the highest value).

Engagement distribution

Engagement distribution

Hypothesis testing song

Null hypothesis no song song Null hypothesis: I assume that populations without and with song are same.

Sampling from population 7.8

Sampling from population 7.8 Mean engagement of randomly chosen 30 students from student population can lie anywhere on the blue curve.

Hypothesis testing song

Did the song help? If we assume that the null hypothesis is true, what is the probability that we randomly chose such 30 students that their mean engagement is at least 8.2? Use Z-tables

Conclusion If we assume that two populations are same (i.e., we assume that the song does not influence the engagement, null hypothesis is true), than the observed sample mean of 8.2 is only due to the fact that we chose 30 students that are substantially more engaged than it would be expected (the expectation value of the population, i.e. population mean, is 7.8). However, it is very unlikely that we select such 30 students just by chance. The probability of selecting such 30 students only by chance is really very low (0.22%).

Conclusion But, we observed the mean of 8.2 in our sample! How is it possible, if the probability of observing this result is only 0.22%? The only possible explanation is that our assumption about the identity of two populations is not correct. We should reject the null hypothesis. Conclusion: Because of such a low probability, we interpret 8.2 as a significant increase over 7.8 caused by undeniable pedagogical qualities of the 'Hypothesis testing song'. HURRAY, THE SONG WORKS

Clear-cut decision We decided that the song works because the probability of 0.22% is so low that it is very unlikely that we observed the mean of 8.2 just by chance. However, what if the probability would be 1%? Is it likely or unlikely? Or 3%? Or 5%? Or 8%? Or 15%? Where is the line between likely and unlikely? If you have such a line, you can make a clear-cut decision. If the probability is lower than this line, null hypothesis is rejected. If the probability is higher than this line, we failed to reject the null hypothesis.

Lingo Probability by which we decide if the result is likely or unlikely is called the p-value. In our example, the p-value is A line between likely and unlikely is called the α level (hladina významnosti). If the p-value is less than the α level then such result is considered to be unlikely.

α levels α level (or significance level, hladina významnosti) is our criteria for deciding if something is likely or unlikely. If the probability of getting sample mean is less than 0.05 – 0.01 – then it is considered unlikely (5%) 0.01 (1%) (0.1%)

Hypothesis testing

Null hypothesis H 0 states that nothing happened, there is no change, no difference. Does the song improve students' engagement? H 0 : students' engagement is the same regardless the song. Does a diet coke taste differently than a full-fat coke? H 0 : there is no difference in the taste between diet and full-fat cokes. Does this drug increases the blood pressure? H 0 : this drug causes no increase in the blood pressure. Is this a fair coin? H 0 : the coin is fair.

Alternative hypothesis The alternative hypothesis states the opposite to the null and is usually the hypothesis you are trying to prove. Does the song improve students' engagement? H a : song improved students' engagement. Does a diet coke tastes differently than a full-fat coke? H a : a diet coke tastes much worse/better than a full-fat coke. Does this drug increases the blood pressure? H a : this drug leads to an increase in the blood pressure. Is this a fair coin? H a : the coin is unfair.

Hypothesis testing Formulate the null hypothesis. There is no statistical testing without the null. You assume that H 0 is true. Then, you collect data. If you find enough evidence in your data for rejecting H 0, you do so and accept H a. Otherwise, you failed to reject H 0. If H 0 is rejected at the given significance level (i.e., H a is accepted), the result is called statistically significant.

Z-critical value If the Z-score of the sample mean is greater than the Z- critical value we have an evidence that this mean is different from the regular population (the population that had not watched the musical lesson). If the probability of obtaining a particular sample mean is less than alpha level then it will fall in this tail which is called the critical region. Z* Z-critical value

Critical regions

Test statistic

Quiz Sampling distribution Z*

Another engagement score

Two-tailed test (oboustranný test) Z=?? mean engagement score of 7.13 (Z = -4.83) is significant at p < 0.05

One-tailed Z-critical value 0.05± ± ±3.08 Two-tailed Z-critical value 0.05± ± ±3.27 In the critical region, we (most likely) did not get sample mean by chance. The critical region can also be on the left.

One-tailed and two-tailed one-tailed (directional) test two-tailed (non-directional) test

One-tailed or two-tailed In general, we use two-tailed tests. One exception to this general rule is when we’re comparing a new treatment with an established treatment. In such cases we often only care if the new treatment is better than the old one. And we would use a one-tailed directional test.

Alternative hypothesis two-tailed test one-tailed test

Quiz – reject the null What does it mean to reject the null using two-tailed test? Our sample mean falls within/outside the critical region. The Z-score of our sample mean is less than/greater than the Z- critical value. The probability of obtaining the sample mean is less than/greater than the alpha level.

Four steps of hypothesis testing 1. Formulate the null and the alternative (this includes one- or two-directional test) hypothesis. 2. Select the significance level α – a criterion upon which we decide that the claim being tested is true or not. --- COLLECT DATA Compute the p-value. The p-value is the probability that the data would be at least as extreme as those observed, if the null hypothesis were true. 4. Compare the p-value to the α-level. If p ≤ α, the observed effect is statistically significant, the null is rejected, and the alternative hypothesis is valid.