"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Significance Tests Hypothesis - Statement Regarding a Characteristic of a Variable or set of variables. Corresponds to population(s) –Majority of registered.
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Is it statistically significant?
Inferential Statistics & Hypothesis Testing
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Hypothesis testing Some general concepts: Null hypothesisH 0 A statement we “wish” to refute Alternative hypotesisH 1 The whole or part of the complement.
Elementary hypothesis testing
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Chapter 11 Multiple Regression.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Chapter 2 Simple Comparative Experiments
8-2 Basics of Hypothesis Testing
Chapter 11: Inference for Distributions
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Warm-up Day of 8.1 and 8.2 Quiz and Types of Errors Notes.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Chapter 10 Hypothesis Testing
Lecture Slides Elementary Statistics Twelfth Edition
Overview Definition Hypothesis
1/2555 สมศักดิ์ ศิวดำรงพงศ์
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Testing of Hypothesis Fundamentals of Hypothesis.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
1 When we free ourselves of desire, we will know serenity and freedom.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Correlation Assume you have two measurements, x and y, on a set of objects, and would like to know if x and y are related. If they are directly related,
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
© Copyright McGraw-Hill 2004
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright© 1998, Triola, Elementary Statistics by Addison Wesley Longman 1 Testing a Claim about a Mean: Large Samples Section 7-3 M A R I O F. T R I O.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
15 Inferential Statistics.
Chapter 9: Inferences Involving One Population
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
When we free ourselves of desire,
Chapter 9 Hypothesis Testing.
Testing Hypotheses I Lesson 9.
Presentation transcript:

"Classical" Inference

Two simple inference scenarios Question 1: Are we in world A or world B?

Possible worlds: World A World B Xnumberadded [-.5,.5]38 [-1, 1]6830 [-1.5, 1.5]8719 [-2, 2]958 [-2.5, 2.5]994 (- ∞, ∞)1001 Xnumberadded [4, 6]38 [3, 7]6830 [2, 8]8719 [1, 9]958 [0, 10]994 (- ∞, ∞)1001

Jerzy Neyman and Egon Pearson

Correct acceptance of H 0 pr(D= H 0 | T=H 0 ) = (1 –  ) Type I Error pr(D= H 1 | T=H 0 ) =  [aka size] Type II Error pr(D= H 0 | T=H 1 ) =  Correct acceptance of H 1 pr(D= H 1 | T=H 1 ) = (1 –  ) [aka power] D : Decision in favor of: H 1 : Alternative Hypothesis H 0 : Null Hypothesis T : The Truth of the matter: H 1 : Alternative Hypothesis

Definition. A subset C of the sample space is a best critical region of size α for testing the hypothesis H 0 against the hypothesis H 1 if and for every subset A of the sample space, whenever: we also have:

Neyman-Pearson Theorem: Suppose that for for some k > 0: Then C is a best critical region of size α for the test of H 0 vs. H 1.

9 When the null and alternative hypotheses are both Normal, the relation between the power of a statistical test (1 –  ) and  is given by the formula  is the cdf of N(0,1), and q  is the quantile determined by .  fixes the type I error probability, but increasing n reduces the type II error probability

Question 2: Does the evidence suggest our world is not like World A?

World A Xnumberadded [-.5,.5]38 [-1, 1]6830 [-1.5, 1.5]8719 [-2, 2]958 [-2.5, 2.5]994 (- ∞, ∞)1001

Sir Ronald Aymler Fisher

Fisherian theory Significance tests: their disjunctive logic, and p-values as evidence: ``[This very low p-value] is amply low enough to exclude at a high level of significance any theory involving a random distribution….. The force with which such a conclusion is supported is logically that of the simple disjunction: Either an exceptionally rare chance has occurred, or the theory of random distribution is not true.'' (Fisher 1959, 39)

Fisherian theory ``The meaning of `H' is rejected at level α' is `Either an event of probability α has occurred, or H is false', and our disposition to disbelieve H arises from our disposition to disbelieve in events of small probability.'' (Barnard 1967, 32)

Fisherian theory: Distinctive features Notice that the actual data x is used to define the event whose significance is evaluated. Also based on H 0 and H 1 Can only reject H 0, evidence cannot allow one to accept H 0. Many other theories besides H 0 could also explain the data.

Common philosophical simplification: Hypothesis space given qualitatively; H 0 vs. –H 0, Murderer was Professor Plum, Colonel Mustard, Miss Scarlett, or Mrs. Peacock More typical situation: Very strong structural assumptions Hypothesis space given by unknown numeric `parameters' Test uses: a transformation of the raw data, a probability distribution for this transformation (≠ the original distribution of interest)

Three Commonly Used Facts Assume is a collection of independent and identically distributed (i.i.d.) random variables. Assume also that the X i s share a mean of μ and a standard deviation of σ.

Three Commonly Used Facts For the mean estimator : 1. 2.

Three Commonly Used Facts The Central Limit Theorem. If {X 1,…, X n } are i.i.d. random variables from a distribution with mean  and variance  2, then: 3. Equivalently:

Examples Data: January 2012 CPS Sample: PhD’s, working full time, age H 0 : mean income is 75k

Hyp. Value Probability H

Comments The background conditions (e.g., the i.i.d. condition behind the sample) are a clear example of `Quine-Duhem’ conditions. When background conditions are met, ``large samples’’ don’t make inferences ``more certain’’ Multiple tests Monitoring or ``peeking'‘ at data, etc.

Point estimates and Confidence Intervals

Many desiderata of an estimator: Consistent Maximum Likelihood Unbiased Sufficient Minimum variance Minimum MSE (mean squared error) (most) efficient

By CLT: approximately: Thus: By algebra: So:

Interpreting confidence intervals The only probabilistic component that determines what occurs is. Everything else are constants. Simulations, examples Question: Why ``center’’ the interval?

Confidence Intervals $68, ± $12, ``C.I. = mean ± m.o.e’’ = ($56,745.32, $81,051.01)

Using similar logic, but different computing formulae, one can extend these methods to address further questions e.g., for standard deviations, equality of means across groups, etc.

Equality of Means: BAs SexCountMeanStd. Dev All ValueProbability

Equality of Means: PhDs SexCountMeanStd. Dev All ValueProbability