8 - 10: Intro to Statistical Inference

Slides:



Advertisements
Similar presentations
AP Statistics Section 11.2 B. A 95% confidence interval captures the true value of in 95% of all samples. If we are 95% confident that the true lies in.
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
HS 167Basics of Hypothesis Testing1 (a)Review of Inferential Basics (b)Hypothesis Testing Procedure (c)One-Sample z Test (σ known) (d)One-sample t test.
Chapters 9 (NEW) Intro to Hypothesis Testing
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
7/2/2015Basics of Significance Testing1 Chapter 15 Tests of Significance: The Basics.
Basic Biostat8: Intro to Statistical Inference1. In Chapter 8: 8.1 Concepts 8.2 Sampling Behavior of a Mean 8.3 Sampling Behavior of a Count and Proportion.
BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
Chapter 9 Hypothesis Testing.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Inference about Population Parameters: Hypothesis Testing
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
14. Introduction to inference
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
BPS - 3rd Ed. Chapter 141 Tests of Significance: The Basics.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
October 15. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and.
Section 9.2 Testing the Mean  9.2 / 1. Testing the Mean  When  is Known Let x be the appropriate random variable. Obtain a simple random sample (of.
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Chapter 14Introduction to Inference1 Chapter 14 Introduction to Inference.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
CHAPTER 17: Tests of Significance: The Basics
June 16. In Chapter 8: 8.1 Concepts 8.2 Sampling Behavior of a Mean 8.3 Sampling Behavior of a Count and Proportion.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Chapter 10: Basics of Confidence Intervals
Essential Statistics Chapter 141 Thinking about Inference.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Chapter 9: Introduction to Inference. 2 Thumbtack Activity Toss your thumbtack in the air and record whether it lands either point up (U) or point down.
AP Statistics Section 11.1 B More on Significance Tests.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
© Copyright McGraw-Hill 2004
1 Chapter 14 Introduction to Inference Part B. 2 Hypothesis Tests of Statistical “Significance” Test a claim about a parameter Often misunderstood elaborate.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
More on Inference.
More on Inference.
Essential Statistics Introduction to Inference
Chapter 10: Basics of Confidence Intervals
Intro to Confidence Intervals Introduction to Inference
Presentation transcript:

8 - 10: Intro to Statistical Inference Ch 8 - Ch 10 Summary 4/17/2017 Chapters 8 – 10 (Summary) Basic Biostat 8 - 10: Intro to Statistical Inference Basic Biostat

Statistical Inference Ch 8 - Ch 10 Summary 4/17/2017 Statistical Inference Statistical inference is the act of generalizing from a sample to a population with calculated degree of certainty. using statistics calculated in the sample We want to learn about population parameters… Statistical inference is the logical process by which we make sense of numbers. It is how we generalize from the particular to the general. This is an important scientific activity. Consider a study that wants to learn about the prevalence of a asthma in a population. We draw a SRS from the population and find that 3 of the 100 individuals in the sample have asthma. The prevalence in the sample is 3%. However, are still uncertain about the prevalence of asthma in the population; the next SRS from the same population may have fewer or more cases. How do we deal with this element of chance introduced by sampling? A similar problem occurs when we do an experiment Each time we do an experiment, we expect different results due to chance. How do we deal with this chance variability? Our first task is to distinguish between parameters and statistics. Parameters are the population value. Statistics are from the sample. The statistics will vary from sample to sample: they are random variables. In contrast, the parameters are invariable; they are constants. To help draw this distinction, we use different symbols for parameters and statistics. For example, we use “mu” to represent the population mean (parameter) and “xbar” to represent the sample mean (statistic). Basic Biostat 8 - 10: Intro to Statistical Inference Basic Biostat

Parameters and Statistics We MUST draw distinctions between parameters and statistics Parameters Statistics Source Population Sample Calculated? No Yes Constants? Examples μ, σ, p Basic Biostat 8 - 10: Intro to Statistical Inference

Statistical Inference There are two forms of statistical inference: Hypothesis (“significance”) tests Confidence intervals Basic Biostat 8 - 10: Intro to Statistical Inference

Example: NEAP math scores Young people have a better chance of good jobs and wages if they are good with numbers. NAEP math scores Range from 0 to 500 Have a Normal distribution Population standard deviation σ is known to be 60 Population mean μ not known We sample n = 840 young mean Sample mean (“x-bar”) = 272 Population mean µ unknown We want to estimate population mean NEAP score µ Reference: Rivera-Batiz, F. L. (1992). Quantitative literacy and the likelihood of employment among young adults. Journal of Human Resources, 27, 313-328. Basic Biostat 8 - 10: Intro to Statistical Inference

Conditions for Inference Data acquired by Simple Random Sample Population distribution is Normal or large sample The value of σ is known The value of μ is NOT known Basic Biostat 8 - 10: Intro to Statistical Inference

8 - 10: Intro to Statistical Inference The distribution of potential sample means: The Sampling Distribution of the Mean Sample means will vary from sample to sample In theory, the sample means form a sampling distribution The sampling distribution of means is Normal with mean μ and standard deviation equal to population standard deviation σ divided by the square root of n: This relationship is known as the square root law This statistic is known as the standard error Basic Biostat 8 - 10: Intro to Statistical Inference

Standard Error of the mean For our example, the population is Normal with σ = 60 (given). Since n = 840, Standard error Basic Biostat 8 - 10: Intro to Statistical Inference

Margin of Error m for 95% Confidence The 68-95-99.7 rule says 95% of x-bars will fall in the interval μ ± 2∙SExbar More accurately, 95% will fall in μ ± 1.96∙SExbar 1.96∙SExbar is the margin of error m for 95% confidence Basic Biostat 8 - 10: Intro to Statistical Inference

In repeated independent samples: We call these intervals Confidence Intervals (CIs) Basic Biostat 8 - 10: Intro to Statistical Inference

How Confidence Intervals Behave Basic Biostat 8 - 10: Intro to Statistical Inference

Other Levels of Confidence Confidence intervals can be calculated at various levels of confidence by altering coefficient z1-α/2 α (“lack of confidence level”) .10 .05 .01 Confidence level (1–α)100% 90% 95% 99% z1-α/2 1.645 1.960 2.576 Basic Biostat 8 - 10: Intro to Statistical Inference

(1–α)100% Confidence Interval for for μ when σ known Basic Biostat 8 - 10: Intro to Statistical Inference

8 - 10: Intro to Statistical Inference Examples of CIs for μ For 99% confidence: z1 –(α/2) = z1 –(.01/2) = z.995 = 2.576 For 90% confidence, z1-α/2 = z1 –.10/2 = z.95 = 1.645 Basic Biostat 8 - 10: Intro to Statistical Inference

8 - 10: Intro to Statistical Inference Margin of Error (m) Margin of error (m) quantifies the precision of the sample mean as an estimatir of μ. The direct formula for m is: Note that m is a function of confidence level 1 – α (as confidence goes up, z increase and m increases) population standard deviation σ (this is a function of the variable and cannot be altered) sample size n (as n goes up, m decreases; to decrease m, increase n!) Basic Biostat 8 - 10: Intro to Statistical Inference

8 - 10: Intro to Statistical Inference Tests of Significance Recall: two forms of statistical inference Confidence intervals Hypothesis tests of statistical significance Objective of confidence intervals: to estimate a population parameter Objective of a test of significance: to weight the evidence against a “claim” Basic Biostat 8 - 10: Intro to Statistical Inference

Tests of Significance: Reasoning As with confidence intervals, we ask what would happen if we repeated the sample or experiment many times Let X ≡ weight gain Assume population standard deviation σ = 1 Take an SRS of n = 10, SExbar = 1 / √10 = 0.316 Ask: Has there weight gain in the population? Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 17

Tests of Statistical Significance: Procedure A. The claim is stated as a null hypothesis H0 and alternative hypothesis Ha A test statistic is calculated from the data The test statistic is converted to a probability statement called a P-value The P-value is interpreted Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 18

Test for a Population Mean – Null Hypothesis HS 67 Ch 8 - Ch 10 Summary Monday, April 17, 2017 4/17/2017 Test for a Population Mean – Null Hypothesis Example: We want to test whether data in a sample provides reliable evidence for a population weight gain The null hypothesis H0 is a statement of “no weight gain” In our the null hypothesis is H0: μ = 0 The first step in the procedure is to state the hypotheses null and alternative forms. The null hypothesis (abbreviate “H naught”) is a statement of no difference. The alternative hypothesis (“H sub a”) is a statement of difference. Seek evidence against the claim of H0 as a way of bolstering Ha. The next slide offers an illustrative example on setting up the hypotheses. Basic Biostat 8 - 10: Intro to Statistical Inference 9: Basics of Hypothesis Testing 19 Basic Biostat The Basics of Significance Testing 19

Alternative Hypothesis The alternative hypothesis Ha is a statement that contradicts the null. In our weight gain example, the alternative hypothesis can be stated in one of two ways One-sided alternative Ha: μ > 0 (“positive weight change in population”) Two-sided alternative Ha: μ ≠ 0 (“weight change in the population”) Basic Biostat 8 - 10: Intro to Statistical Inference

Test Statistic Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 21

Test Statistic, Example Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 22

P-Value from z table Convert z statistics to a P-value: For Ha: μ > μ0 P-value = Pr(Z > zstat) = right-tail beyond zstat For Ha: μ < μ0 P-value = Pr(Z < zstat) = left tail beyond zstat For Ha: μ ¹ μ0 P-value = 2 × one-tailed P-value Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 23

P-value: Interpretation P-value (definition) ≡ the probability the sample mean would take a value as extreme or more extreme than observed test statistic when H0 is true P-value (interpretation) Smaller-and-smaller P-values → stronger-and-stronger evidence against H0 P-value (conventions) .10 < P < 1.0  evidence against H0 not significant .05 < P ≤ .10  marginally significant .01 < P ≤ .05  significant 0 < P ≤ .01  highly significant Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 24

P-value: Example zstat = 3.23 One-sided P-value = Pr(Z > 3.23) = 1 − 0.9994 = 0.0006 Two-sided P-value = 2 × one-sided P = 2 × 0.0006 = 0.0012 Conclude: P = .0012 Thus, data provide highly significant evidence against H0 Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 25

Significance Level α ≡ threshold for “significance” If we choose α = 0.05, we require evidence so strong that it would occur no more than 5% of the time when H0 is true Decision rule P-value ≤ α  evidence is significant P-value > α  evidence not significant For example, let α = 0.01 P-value = 0.0006 Thus, P < α  evidence is significant Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 26

Summary Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 27

Relation Between Tests and CIs The value of μ under the null hypothesis is denoted μ0 Results are significant at the α-level of when μ0 falls outside the (1–α)100% CI For example, when α = .05  (1–α)100% = (1–.05)100% = 95% confidence When we tested H0: μ = 0, two-sided P = 0.0012. Since this is significant at α = .05, we expect “0” to fall outside that 95% confidence interval Basic Biostat 28

Relation Between Tests and CIs Data: x-bar = 1.02, n = 10, σ = 1 Notice that 0 falls outside the 95% CI, showing that the test of H0: μ = 0 will be significant at α = .05 Basic Biostat Basics of Significance Testing 8 - 10: Intro to Statistical Inference 29