Chapter 2 HYPOTHESIS TESTING

Slides:



Advertisements
Similar presentations
Hypothesis testing Another judgment method of sampling data.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inferential Statistics & Hypothesis Testing
Becoming Acquainted With Statistical Concepts CHAPTER CHAPTER 12.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Chapter Seventeen HYPOTHESIS TESTING
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Sample size computations Petter Mostad
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Chapter 7 Hypothesis Testing 7-1 Overview 7-2 Fundamentals of Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Choosing Statistical Procedures
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
1.State your research hypothesis in the form of a relation between two variables. 2. Find a statistic to summarize your sample data and convert the above.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
McGraw-Hill/Irwin Business Research Methods, 10eCopyright © 2008 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 17 Hypothesis Testing.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Chapter 3 INTERVAL ESTIMATES
More on Inference.
BAE 6520 Applied Environmental Statistics
BAE 5333 Applied Water Resources Statistics
Lecture Nine - Twelve Tests of Significance.
Chapter 3 INTERVAL ESTIMATES
Hypothesis Testing: One Sample Cases
Part Four ANALYSIS AND PRESENTATION OF DATA
Chapter 9 Multiple Linear Regression
Chapter 4 Comparing Two Groups of Data
Assumptions For testing a claim about the mean of a single population
How Psychologists Ask and Answer Questions Statistics Unit 2 – pg
Hypothesis Tests: One Sample
Hypothesis Testing: Hypotheses
Statistical Process Control
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Chapter 9 Hypothesis Testing.
More on Inference.
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter 9: Hypothesis Testing
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 9: Hypothesis Tests Based on a Single Sample
Hypothesis Testing.
Hypothesis Testing.
Chapter 24 Comparing Two Means.
Nonparametric Statistics
Analyzing and Interpreting Quantitative Data
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Chapter 2 HYPOTHESIS TESTING BAE 5333 Applied Water Resources Statistics Biosystems and Agricultural Engineering Department Division of Agricultural Sciences and Natural Resources Oklahoma State University Source Dr. Dennis R. Helsel & Dr. Edward J. Gilroy 2006 Applied Environmental Statistics Workshop and Statistical Methods in Water Resources

Which Statistical Test to Use? Type of Variables Grouped and Continuous Qualitative and Quantitative Distribution of Data Normal Non-normal

Statistical Test Based on Variable Type

Parametric vs. Nonparametric Parametric Test Assumes Specific underlying distribution for the data Or invoke the Central Limit Theorem (see next slide) Computed with mean, standard deviation (i.e. uses parameters) Normal Distribution most common assumption. Data should look symmetric.

Central Limit Theorem (CLT) Given a distribution with a mean (μ) and variance (σ2), the sampling distribution of the mean approaches a normal distribution with a mean (μ) and a variance σ2/N as N, the sample size, increases. No matter what the shape of the original distribution, the sampling distribution of the mean approaches a normal distribution. For most distributions, a normal distribution is approached very quickly as N increases. Parametric tests use the sample mean, which the CLT tells us will be approximately normally distributed. Source: http://davidmlane.com/hyperstat/A14043.html

Central Limit Theorem Example Computer sampled N values from a uniform distribution and computed the mean. This procedure was performed 500 times for samples sizes of 1, 4, 7, and 10. Mean N=Sample Size Value (x) a b Probability Uniform Distribution Source: http://davidmlane.com/hyperstat/A14461.html

Central Limit Theorem Examples Central Limit Theorem Applet http://www.stat.sc.edu/~west/javahtml/CLT.html Demonstrates the CLT using dice rolling Large number of rolls  bell-shaped curve ONE Die  approximate uniform distribution The Central Limit Theorem - How to Tame Wild Populations http://intuitor.com/statistics/CentralLim.html

Central Limit Theorem, cont. A sampling distribution always has less variability than the population. σs = σp / n1/2 σs = standard deviation of sampling distribution σp = standard deviation of the population n = sample size n = 100  1/10 the variability of the population n = 10000  1/100 the variability of the population The sampling distribution will act more like a normal distribution as the sample size increases, for any type distribution. If the CLT did not exist, we could not use parametric statistics since we could not reliably estimate a parameter like the mean by using an average derived from a much smaller sample. Source: http://intuitor.com/statistics/CentralLim.html

Parametric vs. Nonparametric Nonparametric Test Does not assume a specific distribution for the data (Distribution Free) Uses median, percentiles Information extracted from positions (ranks) of the data Data may be symmetric or skewed

Choice Based on Data Distribution Close to a Normal Distribution? Yes Use nonparametric test or transform data Parametric or nonparametric test

How Hypothesis Tests Work Choose a test procedure based on type of variable and data distribution Establish hypothesis Null (Ho) Alternative (Ha) Select alpha (significance level) Compute test statistic and obtain p-value (measure of how much evidence we have against the null hypothesis) Compare p-value to alpha Reject Ho if p-value < alpha

Null vs. Alternative Hypothesis Null Hypothesis (Ho) Assumed true until found unlikely by data The null state: no difference between groups, no correlation, no trend, etc. Alternative Hypothesis (Ha) Is chosen if null hypothesis is not supported by data Two Types: one-sided, two-sided http://www.itl.nist.gov/

Total Suspended Solids (TSS) in the Cimarron River Null Hypothesis (Ho): TSS is the same in the Cimarron River and the Arkansas River Alternative Hypothesis (Ha) Two-sided: TSS is different in the two rivers One-sided: TSS is higher in the Cimarron River One-sided: TSS is lower in the Cimarron River Reject Null Hypothesis when p < alpha

Alternative Hypothesis One or Two-sided? Two Sided (not equal to) Change can be high or low “Has TSS changed?” “Are values different between the Cimarron River and the Arkansas River?” One Sided (< or >) Change is directional “Has TSS increased?” “Are values higher between the Cimarron River and the Arkansas River?”

Outliers (p value) in Red Two-sided Test Outliers (p value) in Red

Outliers (p value) in Red One-sided Test Outliers (p value) in Red

How Do You Decide an Alpha Level? Alpha (significance level) is a management decision What percent error can I live with? Set by scientist, legislation, or management Common: 5% (alpha=0.05) Alpha is the probability of rejecting the null hypothesis when it is actually true (Type I Error)

How Do You Decide an Alpha Level? Continued… Choosing an alpha level: strength of evidence needed depends on study objectives, and cost of rejecting null hypothesis Small alpha 0.05 Preponderance of evidence Smaller alpha 0.03 Clear and convincing Very small alpha 0.01 Beyond a reasonable doubt

Type I and Type II Errors Type I Error (α) – occurs if one rejects the null hypothesis when it is true Type II Error (β) – occurs if one does NOT reject the null hypothesis when it is false Ho True Ho False Type I Error α Correct Decision 1- α Type II β 1- β Reject Ho Do Not Reject Ho Source: Elementary Statistics, 6th ed., Allan Bluman, McGraw-Hill

Type III Error Correctly rejecting the null hypothesis for the wrong reason. Correctly rejecting the null hypothesis, but incorrectly attributing the cause. Correctly identifying an effect, but incorrectly attributing the cause of the effect. Source: http://en.wikipedia.org/wiki/Type_III_error

Reading Assignment Chapter 4 Hypothesis Testing (pages 97 to 116) Statistical Methods in Water Resources by D.R. Helsel and R.M. Hirsch