Statistical Hypothesis Testing. Suppose you have a random variable X ( number of vehicle accidents in a year, stock market returns, time between el nino.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Is it statistically significant?
1 Hypothesis testing. 2 A common aim in many studies is to check whether the data agree with certain predictions. These predictions are hypotheses about.
1 Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Type I and Type II Errors One-Tailed Tests About a Population Mean: Large-Sample.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Elementary hypothesis testing
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Chapter 6 Hypotheses texts. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Inferences About Process Quality
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
BCOR 1020 Business Statistics
5-3 Inference on the Means of Two Populations, Variances Unknown
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Chapter 7 Hypothesis testing. §7.1 The basic concepts of hypothesis testing  1 An example Example 7.1 We selected 20 newborns randomly from a region.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Confidence intervals and hypothesis testing Petter Mostad
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Random Variables A random variable is a variable whose value is determined by the outcome of a random experiment. Example: In a single die toss experiment,
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
© Copyright McGraw-Hill 2004
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Hypothesis Testing.
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
Sample Mean Distributions
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing.
Discrete Event Simulation - 4
Presentation transcript:

Statistical Hypothesis Testing

Suppose you have a random variable X ( number of vehicle accidents in a year, stock market returns, time between el nino events, etc.) from some unknown distribution D. You formulate a hypothesis that a function f:X ->  of this random variable satisfy the condition H 0 : f(X) = c vs H 1 : f(X) > c ( or f(X) < c ) Some usual examples are f(X) = E(X), i.e. the population mean is equal to some known value c=µ 0, or f(X) = Var(X) and c=  0 2. For historical reasons, H 0 is called the null hypothesis, and H 1 the alternative. Given some samples from the distribution D, we want to make a decision whether to accept or reject H 0, based on statistics from the samples. Statistical hypothesis testing is a statement about probabilities. Nothing is guaranteed. A good test should lead to a correct decision “most of the time”.

What you need: 1)A good sample statistic or estimator for f(X), call it m(X 1,X 2,…,X n ). What is a good sample statistic. Example, f(X) = E(X) = expected value of X. A bad statistic: you have 100 samples, and you use the sample statistic m = (X 1 + X 2 + … + X 10 )/10 for your sample mean. A better statistic: m = (X 1 + X 2 + … + X n )/n. It is good since it is an unbiased statistic (E(m) = theoretical mean of D, in the above example) and have minimum variance. Other criteria for goodness are consistency and linearity 2)Large enough sample size The more data, the better the sample estimate is. The law of large numbers states the sample mean (X 1 + X 2 + … + X n )/n converges to the true mean of the distribution D as n -> ∞, provided that the population mean exists, X i ’s are independent and has the same distribution as D.

3) f(X) exists and is finite Implicit in the previous statements is the assumption that f(X) exists and has a finite value. For example, if f(X) = E(X), then we’re assuming that the distribution D has a finite mean. This may not be the case, say when D is Cauchy distributed. The simplest Cauchy distribution has pdf (probability density function) It can be seen that the mean is not defined and neither is the variance, so it does not make sense to test for E(X) =µ 0. Be careful what you’re testing for. By the way, the ratio of two independent normal N(0,1) random variables is Cauchy distributed.

No assumption is made about the normality of the distribution D, and it does not have to be so. However, it is known that if D has finite mean and finite variance, and if the sample size n is large (~100 or more), then the statistic m(X 1,X 2,…,X n ) = (X 1 + X 2 + … + X n )/n approximates a normal distribution. This is the so-called Central Limit Theorem of probability. This result is often applied to justify the use of normal distribution statistics in large datasets. It is important to observe that the distribution D’ (not identical to D) of the test statistic m is known completely, given that the null hypothesis is true.

How does it work: Let m=m(X 1,X 2,…,X n ) be our test statistic. m itself is a random variable, with probability distribution D’. By assuming that the null hypothesis is true, the distribution D’ is determined completely, and we can find the value q (either directly or using lookup tables) such that Prob( Z > m ) = q where Z is a standard random variable with distribution D’. We then select a number α (called the significance level, usually =.05), and reject H 0 : if q < α accept H 0 : if q > α

What can happen: H 0 True False p β (type II error) 1- p (type I error) 1-β Accept Reject α=1-p is the significance level of the test. This is usually set to a value of This means that we want the probability of making a type I error (false positives) to be small, ie. Prob(Reject H 0 | H 0 is true) = 0.05

Another way of interpreting this: when we reject H 0 using the test statistic, there is only a 5% chance that we could be wrong. Note that accepting the null hypothesis H 0 is a weaker conclusion than rejecting H 0. It only means that the test provide no evidence to contradict our assumption that H 0 is true. In some cases, it may be desirable to minimize type II errors as well. The power of a test is defined by Prob(Reject H 0 | H 0 is false) = 1- β The more powerful a test, the less chance of making a type II error. The function H(u,m) = Prob(Z > m | f(X)=u) is known as the power function of the test.

Example: Annual rainfall data for 8 years in inches at some locality 34.1, 33.7,27.4,31.1,30.9,35.2,28.4,32.1 Historically known to be normally distributed with mean=30 inches but with unknown variability. It is hypothesized that the mean annual rainfall has increased as of late. H 0 : µ = 30 H 1 : µ > 30 Sample statistic m = = s 2 = sample variance = 7.5. Note has Student’s t distribution with 8-1=7 degrees of freedom, therefore Prob(Z > 31.6 | µ =30) = Prob( ) =.072 >.05 So we accept the null hypothesis at.05 significance level. No evidence to support µ > 30 inches with this test. Suppose we have evidence from another dataset collected nearby that the mean annual rainfall is actually 31 in. The power of our test is then Prob(Z > 31.6 | µ =31) = Prob( ) =.28 with β=1-.28 =.72.

Conclusion: Hypothesis testing is just another tool for analyzing data. It cannot be relied upon by itself as proof or disproof of an assumption. It may not even be practical. Independent verification of the test result using other methods is always necessary. “Absence of evidence is not the same as evidence of absence.” - Carl Sagan