Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.

Slides:



Advertisements
Similar presentations
Tests of Hypotheses Based on a Single Sample
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Inference Sampling distributions Hypothesis testing.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Review: What influences confidence intervals?
Class Handout #3 (Sections 1.8, 1.9)
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Business Statistics for Managerial Decision
Inference about a Mean Part II
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Statistics for Managers Using Microsoft® Excel 5th Edition
Probability Population:
Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Confidence Intervals and Hypothesis Testing - II
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Fundamentals of Hypothesis Testing: One-Sample Tests
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
Topic 5 Statistical inference: point and interval estimate
Significance Tests: THE BASICS Could it happen by chance alone?
LECTURE 19 THURSDAY, 14 April STA 291 Spring
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Chapter 21: More About Tests “The wise man proportions his belief to the evidence.” -David Hume 1748.
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Statistical Inference
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
CHAPTER 17: Tests of Significance: The Basics
Chapter 10 AP Statistics St. Francis High School Fr. Chris.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
CHAPTER 9 Testing a Claim
Section 10.1 Confidence Intervals
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Chapter 10: Confidence Intervals
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
AP Statistics Section 11.1 B More on Significance Tests.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
CHAPTER 9 Testing a Claim
Problems: Q&A chapter 6, problems Chapter 6:
Presentation transcript:

Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed lots of SRSs, the “average sample frequency of visits” would be centered around the true “average population frequency of visits.”

Inference Suppose that instead we interviewed an SRS of n = 400. Our estimates will be more reliable because estimates from other SRSs would be similar … that is, our estimates would be less variable.

Because we didn’t have money for 16 separate samples, we actually only collected data from the first sample, whose sample mean is = Is the true number actually 50? Is the difference between 50 and purely a fluke? Does this result exclude 50 as a possibility?

The Central Limit Theorem says that if the entire population has a mean  and a standard deviation , then in repeated samples of size n the sample mean approximately follows a Normal distribution

The first sample had a mean = and a standard deviation = nStrd Dev of = = Sample A = /sqrt(10) Sample B = /sqrt(400)

We know that 95% of all observations fall within ± two standard deviations of the mean. Likewise, 95% of all sample means fall within ± two standard deviations of the observed sample mean. So, for 1900 out of 2,000 samples, the interval will contain the true population mean.

2 x (0.0513)

Now there are two possibilities. Either 1. the true population mean is contained in the interval 2. or this is one of those 5% of samples whose interval does not contain the true value. ( , )

C is typically set at 95%, but it’s sometimes chosen to be 90% or 99%. STATA Exercise 1

z* = 1.96 if C=95%-z*= if C=95%

So don’t use 2 when constructing a 95% CI: use 1.96.

If the margin of error is too large… Reduce  –  is determined by the population: a population with a lot of variability will increase the chance that a sample contain observations very far from the true mean. – This is easier to say than to do.

16 samples. The  of the population increases from 1 to 4, increasing the spread of the sample and the likelihood of getting  wrong.

If the margin of error is too large… Increase the sample size (larger n)

If the margin of error is too large… Be less confident of your estimate … Use a lower confidence level (make C smaller, hence a smaller z*)

If the margin of error is too large… “We’re 99% sure that the President will receive 51.5% of the votes, with a ±5% margin of error.” “We’re 95% sure that the President will receive 51.5% of the votes, with a ±3% margin of error.” “We’re 90% sure that the President will receive 51.5% of the votes, with a ±1% margin of error.”

Cautions 1. Is it an SRS? 2. Is the data unbiased (or do we know the bias)? 3. Are there no outliers that influence the sample mean? 4. Is n large? If not, is the underlying population Normally distributed? 5. Do you know the true  ? Theorems of mathematical statistics are true; statistical methods are effective only when used with skill.

Cautions FALSE: “The probability that the true mean falls within is 95%” – This is false because either the interval contains the true population mean (which is not a random variable), with Pr=1, or it doesn’t, with Pr=0. TRUE: “The probability that the interval is one of the ones that contain the true mean is 95%”

Tests of Significance

Making claims about the population parameters In our sample, we observed a mean of visits to the mall per year. – Assuming that the true population mean is 50, how likely is it that we observe a sample mean as small as , or even smaller? – if the true population mean were 45, how likely is it that we observe a sample mean as large as , or even larger?

Making claims about the population parameters

x if the true population mean were 45, how likely is it that we observe a sample mean as large as ? Pr=0.68% If the true population mean were 50, how likely is it that we observe a sample mean at least as small as ? Pr=49% Making claims about the population parameters

We found that if the true mean is 45, the Pr of observing a sample mean as large as is 0.68%. Either – we’ve observed a very rare event (our sample is really unusual) – the true mean is not 45. There’s another number that makes the observed sample more likely. Making claims about the population parameters A sample outcome that would be extreme if a hypothesis were true is evidence that they hypothesis is not true.

H 0 :  =45 H a :  45 This is a two- sided alternative hypothesis These are hypothesis about the population.

Test Statistics A test statistic measures compatibility between the null hypothesis and the data. The z-score can be used as a test statistic because we can compare it against 1.96, the z-score that delimits a 0.95 area under the Normal curve. – 1.96 is called the appropriate “critical value”.

Test Statistics The Student’s t Distribution is used when n is small. It approximates the Standard Normal, z- distribution as n gets large.

Test Statistics We know that 95% of all values are between 2 standard deviations of the mean. That is, 95% of all values are between the z- score of 1.96 and the z-score of So if we get a sample outcome whose z- score is greater than 1.96 (in absolute value), we know that it it is unlikely to belong to the population of which the null hypothesis is a parameter.

Suppose – n = 110 –  = 26.4 – x = 8.1 – H 0 :  = 0 – H a :   0

Exercise A company makes cellphones using components from two countries: Ecuador and Canadaguay. Here are data on days of cellphone durability. Your retail shop buys 100 cellphones because the manufacturer claims they were made in Ecuador. On average, they stop working after 279 days of use. Is this difference (279 days versus 300 days) significant? Is it a fluke or does it mean something? # days till broken  Ecuador Canadaguay10050

Exercise The null hypothesis is that the phone typically lasts 300 days. Alternatively, it’s a lower quality phone. The z-score can tell us how far this observation is from the mean. Look up in table A the probability of observing a z-score as small as this or smaller.

Exercise Suppose the parameters were, instead Now, is this difference (279 days versus 300 days) significant? Is it a fluke or does it mean something? # days till broken  Ecuador300200

Exercise Suppose average durability of the 100 cellphones was, instead, 90 days. Now, is this difference (90 days versus 300 days) significant? Is it a fluke or does it mean something? # days till broken  Ecuador300200

We found that if the true mean is 45, the Pr of observing a sample mean as large as is 0.68%. Notice that here H 0 :  = 45 H a :  > 45 This is a one- sided alternative hypothesis

Look this up in Table D, 20-1 degrees of freedom. We have to use the Student’s t because n is small.

Tests for Population Mean 1. State the hypothesis 2. Calculate the test statistics 3. Find the P-value 4. State your conclusion in the context of your specific setting

C = 1-  for two-sided tests

 = x = H 0 :  = 0.86 n = 3 Look in Table D for the z-score on a two-tailed 1% significance level (look in the column) for df = 3-1. Is it smaller (in absolute value) than ?

 = x = H 0 :  = 0.86 n = 3 The 99% CI is ( , ) cii , level(99) Look up the t* for df=3-1, upper tail probability 0.005

P-values versus a fixed  If the z-score is , the corresponding p- value is The p-value is the smallest level of  at which the data are significant. Remember that C = 1-  for two-sided tests, and that bigger Confidence means wider CI. “The smallest level of  ” then mean the largest C and widest CI that will still contain the hypothesized value.

H 0 :  x p-value

If the P-value is larger than the chosen significance level , we say that the statistic is not significant. If the P-value is smaller than the chosen significance level , we say that the statistic is not significant.

Using Significance Tests

. tabstat guess grade diff if position<8 stats | guess grade diff mean | tabstat guess grade diff if position>=8 stats | guess grade diff mean | Is it true that, on average, people who finish earlier tend to do better? (Notice causality is not determined).

Significance Tests H 0 is our hypothesis: how plausible is it, given the data, our statistic, and its sampling variation? – If a priori H 0 seems true, very small p-values will be needed to convince people that H 0 are wrong. A small p-value means that your estimated statistic is so far from H 0 that it’s unlikely that your statistics is derived from a population where H 0 is true. H0H0

Significance Tests H 0 is our hypothesis: what are the consequences of rejecting H 0. – If rejecting H 0 led to huge changes in our behavior, with large costs, we’ll need to be very convinced. H0H0

Significance Tests Decide on a significance level, . – Remember  = 1 - C, where C is the confidence level Check if the P-value is below your pre- decided significance level. H 0 :  x p-value H 0 :  x p-value

Significance Tests Check for the practical significance (the actual size of the number) of a statistic that is statistically significant. Do exploratory data analysis. – Check for outliers. – Check for the Normality of the data. Report confidence intervals. Excel and icosahedron exercise 1

Power and Inference as a Decision

A more powerful test will be more likely to reject a false H 0 in favor of true alternatives.