Chapters 9 (NEW) Intro to Hypothesis Testing

Slides:



Advertisements
Similar presentations
June 14. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and Sample.
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
HS 167Basics of Hypothesis Testing1 (a)Review of Inferential Basics (b)Hypothesis Testing Procedure (c)One-Sample z Test (σ known) (d)One-sample t test.
Chapter 9 Hypothesis Tests. The logic behind a confidence interval is that if we build an interval around a sample value there is a high likelihood that.
8 - 10: Intro to Statistical Inference
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Basic Biostat8: Intro to Statistical Inference1. In Chapter 8: 8.1 Concepts 8.2 Sampling Behavior of a Mean 8.3 Sampling Behavior of a Count and Proportion.
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Probability Population:
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Descriptive statistics Inferential statistics
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
14. Introduction to inference
7 Elementary Statistics Hypothesis Testing. Introduction to Hypothesis Testing Section 7.1.
September 15. In Chapter 11: 11.1 Estimated Standard Error of the Mean 11.2 Student’s t Distribution 11.3 One-Sample t Test 11.4 Confidence Interval for.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
October 15. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and.
Chapter 9: Testing Hypotheses
Hypothesis testing Chapter 9. Introduction to Statistical Tests.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Chapter 14Introduction to Inference1 Chapter 14 Introduction to Inference.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
May 2004 Prof. Himayatullah 1 Basic Econometrics Chapter 5: TWO-VARIABLE REGRESSION: Interval Estimation and Hypothesis Testing.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
1 When we free ourselves of desire, we will know serenity and freedom.
Essential Statistics Chapter 141 Thinking about Inference.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
1 When we free ourselves of desire, we will know serenity and freedom.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
© Copyright McGraw-Hill 2004
1 Chapter 14 Introduction to Inference Part B. 2 Hypothesis Tests of Statistical “Significance” Test a claim about a parameter Often misunderstood elaborate.
Introduction to Hypothesis Testing
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Chapter 9 Hypothesis Testing.
Test of Hypothesis Art of decision making.
Intro to Confidence Intervals Introduction to Inference
S.M.JOSHI COLLEGE,HADAPSAR Basics of Hypothesis Testing
Presentation transcript:

Chapters 9 (NEW) Intro to Hypothesis Testing Ch 8 - Ch 10 Summary 4/17/2017 Chapters 9 (NEW) Intro to Hypothesis Testing Basic Biostat Chapter 9 (new PPTs) Basic Biostat

Statistical Inference Ch 8 - Ch 10 Summary 4/17/2017 Statistical Inference Statistical inference is the act of generalizing from a sample to a population with calculated degree of certainty. using statistics calculated in the sample We want to learn about population parameters… Statistical inference is the logical process by which we make sense of numbers. It is how we generalize from the particular to the general. This is an important scientific activity. Consider a study that wants to learn about the prevalence of a asthma in a population. We draw a SRS from the population and find that 3 of the 100 individuals in the sample have asthma. The prevalence in the sample is 3%. However, are still uncertain about the prevalence of asthma in the population; the next SRS from the same population may have fewer or more cases. How do we deal with this element of chance introduced by sampling? A similar problem occurs when we do an experiment Each time we do an experiment, we expect different results due to chance. How do we deal with this chance variability? Our first task is to distinguish between parameters and statistics. Parameters are the population value. Statistics are from the sample. The statistics will vary from sample to sample: they are random variables. In contrast, the parameters are invariable; they are constants. To help draw this distinction, we use different symbols for parameters and statistics. For example, we use “mu” to represent the population mean (parameter) and “xbar” to represent the sample mean (statistic). Basic Biostat Chapter 9 (new PPTs) Basic Biostat

Parameters and Statistics We MUST draw a distinction between parameters and statistics Parameters Statistics Source Population Sample Calculated? No Yes Constants? Examples μ, σ, p Basic Biostat Chapter 9 (new PPTs)

Statistical Inference There are two forms of statistical inference: Hypothesis testing Confidence interval estimation We introduce hypothesis testing concepts with the most basic testing procedure: the one-sample z test Basic Biostat Chapter 9 (new PPTs)

One Sample z Test Objective: to test a claim about population mean µ Study conditions: Simple Random Sample (SRS) Population Normal or sample large The value of σ is known The value of μ is NOT known Basic Biostat Chapter 9 (new PPTs)

Sampling distribution of a mean What is the mean weight µ of a population of men? Sample n = 64 and calculate sample mean “x-bar” If we sampled again, we get a different x-bar Repeated samples from the same population yield different sample means Basic Biostat Chapter 9 (new PPTs)

Sampling distribution of the Mean (SDM) We form a hypothetical probability model based on the differing sample means. This distribution is called the sampling distribution of the mean i.e., the sampling distribution model used for inference Basic Biostat Chapter 9 (new PPTs)

The nature of the SDM (probability model) is predictable We use this Normal model when making inference about population mean µ when σ is known Will tend to be Normal Will be centered on population mean µ Will have standard deviation µ Basic Biostat Chapter 9 (new PPTs)

Ch 8 - Ch 10 Summary Chapter 9 4/17/2017 4/17/2017 Hypothesis Testing Objective: To test a claim about a population parameter Hypothesis testing steps Hypothesis statements Test statistic P-value and interpretation Significance level (optional) Hypothesis testing (also called significance testing) uses a quasi-deductive procedure to judge claims about parameters. Before testing a statistical hypothesis it is important to clearly state the nature of the claim to be tested. We are then going to use a four step procedure (as outlined in the last bullet) to test the claim. Basic Biostat Chapter 9 (new PPTs) Basic Biostat Basic Biostat 9

Ch 8 - Ch 10 Summary Chapter 9 4/17/2017 4/17/2017 Step A: Hypotheses Convert research question to null and alternative hypotheses The null hypothesis (H0) is a claim of “no difference” The alternative hypothesis (Ha) says “H0 is false” The hypotheses address the population parameter (µ), NOT the sample statistic (x-bar) The first step in the procedure is to state the hypotheses null and alternative forms. The null hypothesis (abbreviate “H naught”) is a statement of no difference. The alternative hypothesis (“H sub a”) is a statement of difference. Seek evidence against the claim of H0 as a way of bolstering Ha. The next slide offers an illustrative example on setting up the hypotheses. Basic Biostat Chapter 9 (new PPTs) Basic Biostat Basic Biostat 10

Step A: Hypotheses Null hypothesis H0: μ = 170 Ch 8 - Ch 10 Summary Chapter 9 4/17/2017 4/17/2017 Step A: Hypotheses Research question: Is mean body weight of a particular population of men higher than expected? Expected norm: Prior research (before collecting data) has established that the population should have mean μ = 170 pounds with standard deviation σ = 40 pounds. Beware : Hypotheses are always based on research questions and expected norms, NOT on data! Null hypothesis H0: μ = 170 Alternative hypothesis : Ha: μ > 170 (one-sided) OR Ha: μ ≠ 170 (two-sided) In the late 1970s, the weight of U.S. men between 20- and 29-years of age had a log-normal distribution with a mean of 170 pounds and standard deviation of 40 pounds. As you know, the overweight and obese conditions seems to be more prevalent today, constituting a major public health problem. To illustrate the hypothesis testing procedure, we ask if body weight in this group has increased since 1970. Under the null hypothesis there is no difference in the mean body weight between then and now, in which case μ would still equal 170 pounds. Under the alternative hypothesis, the mean weight has increased Therefore, Ha: μ > 170. This statement of the alternative hypothesis is one-sided. That is, it looks only for values larger than stated under the null hypothesis. There is another way to state the alternative hypothesis. We could state it in a “two-sided” manner, looking for values that are either higher- or lower-than expected. For the current illustrative example, the two-sided alternative is Ha: μ ≠ 170. Although for the current illustrative example, this seems unnecessary, two-sided alternative offers several advantages and are much more common in practice. Basic Biostat Chapter 9 (new PPTs) Basic Biostat Basic Biostat 11

Ch 8 - Ch 10 Summary Chapter 9 4/17/2017 4/17/2017 Step B: Test Statistic For one sample test of µ when σ is known, use this test statistic: Basic Biostat Chapter 9 (new PPTs) Basic Biostat Basic Biostat 12

Step B: Test Statistic For our example, μ0 = 170 and σ = 40 Take an SRS of n = 64 Calculate a sample mean (x-bar) of 173 Basic Biostat Chapter 9 (new PPTs)

Step C: P-Value Convert z statistics to a P-value: For Ha: μ > μ0 P-value = Pr(Z > zstat) = right-tail beyond zstat For Ha: μ < μ0 P-value = Pr(Z < zstat) = left tail beyond zstat For Ha: μ ¹ μ0 P-value = 2 × one-tailed P-value Basic Biostat Chapter 9 (new PPTs) 14

Step C: P-value (example) Use Table B to determine the tail area associated with the zstat of 0.6 One-tailed P = .2743 Two-tailed P = 2 × one-tailed P = 2 × .2743 = .5486 Basic Biostat Chapter 9 (new PPTs)

Step C: P-values P-value answer the question: What is the probability of the observed test statistic … when H0 is true? Smaller and smaller P-values provide stronger and stronger evidence against H0 Basic Biostat Chapter 9 (new PPTs)

Step C: P-values Conventions* P > 0.10  poor evidence against H0 0.05 < P  0.10  marginally evidence against H0 0.01 < P  0.05  good evidence against H0 P  0.01  very good evidence against H0 Examples P =.27  poor evidence against H0 P =.01  very good evidence against H0 * It is unwise to draw firm borders for “significance” Basic Biostat Chapter 9 (new PPTs)

Basics of Significance Testing Summary Basic Biostat Basics of Significance Testing Chapter 9 (new PPTs) 18

Step D (optional) Significance Level Let α ≡ threshold for “significance” If P-value ≤ α  evidence is significant If P-value > α  evidence not significant Example: If α = 0.01 and P-value = 0.27  evidence not significant If α = 0.01 and P-value = 0.0027  evidence is significant Basic Biostat Chapter 9 (new PPTs) 19

§9.6 Power and Sample Size Two types of decision errors: Type I error = erroneous rejection of true H0 Type II error = erroneous retention of false H0 Truth Decision H0 true H0 false Retain H0 Correct retention Type II error Reject H0 Type I error Correct rejection α ≡ probability of a Type I error β ≡ Probability of a Type II error Basic Biostat Chapter 9 (new PPTs)

Power β ≡ probability of a Type II error β = Pr(retain H0 | H0 false) (the “|” is read as “given”) 1 – β = “Power” ≡ probability of avoiding a Type II error 1– β = Pr(reject H0 | H0 false) Basic Biostat Chapter 9 (new PPTs)

Power of a z test where Φ(z) ≡ cumulative probability of Standard Normal value z μ0 ≡ population mean under H0 μa ≡ population mean under Ha with . Basic Biostat Chapter 9 (new PPTs)

Calculating Power: Example A study of n = 16 retains H0: μ = 170 at α = 0.05 (two-sided); σ is 40. What was the power of test to identify a population mean of 190?  look up cumulative probability on Table B  Basic Biostat Chapter 9 (new PPTs)

Reasoning of Power Calculation Competing “theories” Top curve (next page) assumes H0 is true Bottom curve assumes Ha is true α set to 0.05 (two-sided) Reject H0 when sample mean exceeds 189.6 (right tail, top curve) Probability of a value greater than 189.6 on the bottom curve is 0.5160, corresponding to the power of the test Basic Biostat Chapter 9 (new PPTs)

Basic Biostat Chapter 9 (new PPTs)

Sample Size Requirements Sample size for one-sample z test: where 1 – β ≡ desired power α ≡ desired significance level (two-sided) σ ≡ population standard deviation Δ = μ0 – μa ≡ the difference worth detecting Basic Biostat Chapter 9 (new PPTs)

Example: Sample Size Requirement How large a sample is needed to test H0: μ = 170 versus Ha: μ = 190 with 90% power and α = 0.05 (two-tailed) when σ = 40? Note: Δ = μ0 − μa = 170 – 190 = −20 Round up to 42 to ensure adequate power. Basic Biostat Chapter 9 (new PPTs)

Basic Biostat Chapter 9 (new PPTs)

Illustration: conditions for 90% power. Basic Biostat Chapter 9 (new PPTs)