Is it statistically significant?

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Inferential Statistics
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Chapter Seventeen HYPOTHESIS TESTING
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Chapter 11: Inference for Distributions
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Hypothesis Testing:.
Experimental Statistics - week 2
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Lesson 11 - R Review of Testing a Claim. Objectives Explain the logic of significance testing. List and explain the differences between a null hypothesis.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Chapter 4 Introduction to Hypothesis Testing Introduction to Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
STA Statistical Inference
Significance testing and confidence intervals Col Naila Azam.
Chapter 20 Testing hypotheses about proportions
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 11 Introduction to Hypothesis Testing Sir Naseer Shahzada.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
1 Hypothesis Testing A criminal trial is an example of hypothesis testing. In a trial a jury must decide between two hypotheses. The null hypothesis is.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
26134 Business Statistics Tutorial 12: REVISION THRESHOLD CONCEPT 5 (TH5): Theoretical foundation of statistical inference:
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Welcome to MM207 Unit 7 Seminar Dr. Bob Hypothesis Testing and Excel 1.
Created by Erin Hodgess, Houston, Texas Section 7-1 & 7-2 Overview and Basics of Hypothesis Testing.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
HYPOTHESIS TESTING E. Çiğdem Kaspar, Ph.D, Assist. Prof. Yeditepe University, Faculty of Medicine Biostatistics.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
1 Nonparametric Statistical Techniques Chapter 18.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
Chapter Nine Hypothesis Testing.
Review of Testing a Claim
Keller: Stats for Mgmt & Econ, 7th Ed
Keller: Stats for Mgmt & Econ, 7th Ed Hypothesis Testing
Math 4030 – 10a Tests for Population Mean(s)
Hypothesis Testing: Hypotheses
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Presentation transcript:

Is it statistically significant? Hypothesis testing Is it statistically significant? Most of the slides are adopted from: http://wweb.uta.edu/insyopma/baker/STATISTICS/Keller7/Keller%20PP%20slides-7/Chapter11.ppt

Example In a trial a jury must decide between two hypotheses. Null hypothesis H0: The defendant is innocent Alternative hypothesis H1: The defendant is guilty The jury must make a decision on the basis of evidence presented. In the language of statistics convicting the defendant is called rejecting the null hypothesis in favor of the alternative hypothesis. That is, the jury is saying that there is enough evidence to conclude that the defendant is guilty. If the jury acquits, then it is stating that there is not enough evidence to support the alternative hypothesis.

Goal Assess the evidence provided by data in favor of some claim (null hypothesis) about the population. The procedure begins with the assumption that the null hypothesis is true. The goal is to determine whether there is enough evidence to infer that the alternative hypothesis is true, or the null is not likely to be true. Results of the test are expressed in terms of a probability This probability measures how well the data and the hypothesis agree.

Example usage in our work Usually we need to find if the performance of our proposed algorithm is statistically superior than that of a base line algorithm. General procedure For i=1:n Generate a random train/test dataset. Run both algorithms and get performance numbers. Record the difference between the performance of the proposed and base line algorithms. Let it be x. Null hypothesis H0 : x <= 0 i.e. base line better than proposed algorithm. Alternate hypothesis H1 : x > 0 Our algorithm is better. We want to reject H0 hypothesis in favor of H1 with certain confidence value.

Statistical Example Consider mean demand for computers during assembly lead time. The operations manager wants to know whether the mean is different from 350 units. Our test statistics is over mean demand. Null hypothesis: H0 = 350 Our alternative hypothesis becomes: H1 ≠ 350 Assume σ = 75 and the sample size n = 25, and the sample mean is calculated to be 370.16. A large value of (say, 600) provides enough evidence. If is close to 350 (say, 355), then this does not provide a great deal of evidence to infer that the population mean is different than 350.

Is the Sample Mean in the Guts of the Sampling Distribution? Statistical Example Assume H0 : μ = 350 is true. Determine the sampling distribution of sample mean . Assume has mean 350. Variance of is 75/sqrt(25) = 15 Is the Sample Mean in the Guts of the Sampling Distribution?

Hypothesis Test 1: Unstandardized test For the guts to be the center 95% of the distribution [α = 0.05]. The critical values that define the guts will be 1.96 standard deviations of X-Bar UCV = 350 + 1.96*15 = 379.4 LCV = 350 – 1.96*15 = 320.6

Hypothesis Test 2: Z-score Calculate z score of the sample mean. Z = ( - )/ = (370.16 – 350)/15 = 1.344 Is this Z-Score in the guts of the sampling distribution?

Hypothesis Test 3: p-value Increase “Rejection Region” until it “captures” sample mean. P( > 370.16) = P(Z > 1.344) = 0.0901 p-value = double of this area for two tailed test = 0.1802 Since rejection region is defined to be 5% and our sample mean is in the 18.02% region, it in not in rejection region.

Statistical Conclusions Unstandardized Test Statistic Since LCV (320.6) < (370.16) < UCV (379.4), we reject the null hypothesis at a 5% level of significance. Standardized Test Statistic Since -Z/2(-1.96) < Z(1.344) < Z/2 (1.96), we fail to reject the null hypothesis at a 5% level of significance. P-value Since p-value (0.1802) > 0.05 [], we fail to reject the null hypothesis at a 5% level of significance

Interpreting the p-value Overwhelming Evidence (Highly Significant) Strong Evidence (Significant) Weak Evidence (Not Significant) No Evidence (Not Significant) 0 .01 .05 .10

Assumptions Distribution of test statistics follows standard deviation. Generally one appeals to the central limit theorem to justify assuming that a test statistic varies normally. Central limit theorem (CLT): mean (x bar) of a sufficiently large number of independent random variables (x), each with finite mean and variance, will be approximately normally distributed. If the variation of the test statistic is strongly non-normal, a Z-test should not be used. should be known or can be estimated reliably. when is same for each x and all x are independent from each other. [Follows from var(c.x) = c2 var(x) and var (x1 + x2) = var (x1)+ var (x2) if x1 and x2 are independent]. A Z-test is appropriate when you are handling moderate to large samples (n > 30).

Hypothesis Test 4: Student’s t-test The test statistic (x bar) follows a Student’s T-distribution if the null hypothesis is true. T-test is appropriate for small sample size (n < 30). Great if the populations’ standard deviation is unknown. s= sample standard deviation. Once a t value is determined, a p-value can be found using a table of values from Student's t-distribution The Student t-distribution is symmetric and bell-shaped, like the normal distribution, but has heavier tails

Student’s t-test Two sample t-test can be used to compare means of two independent and identically distributed samples.

Hypothesis Test 5: Wilcoxon Signed Test A non-parametric statistical hypothesis test. Alternative to the paired Student's t-test when the population cannot be assumed to be normally distributed. H0 : μ = 0 Rank xis based on their magnitude. Let their rank be Ri. Wilcoxon signed rank statistics φi = I(xi > 0) Score Find the critical value for the given sample size n and the wanted confidence level. Compare S to the critical value, and reject H0 if S is less than or is equal to the critical value.

Thank you 

Two types of Errors Type I error Type II error H0 T F I II When we reject a true null hypothesis. That is, jury convicts an innocent person. P(Type 1 error) = α [usually 0.05 or 0.01] Type II error When we don’t reject a false null hypothesis. That is, when a guilty defendant is acquitted. P(Type 2 error) = β H0 T F Reject I II The two probabilities are inversely related. Decreasing one increases the other, for a fixed sample size.