Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

“Students” t-test.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Copyright © 2011 Pearson Education, Inc. Statistical Tests Chapter 16.
1 A REVIEW OF QUME 232  The Statistical Analysis of Economic (and related) Data.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Chapter 8 Estimation: Additional Topics
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Hypothesis Testing: One Sample Mean or Proportion
Point estimation, interval estimation
Introduction to Econometrics The Statistical Analysis of Economic (and related) Data.
4.1 All rights reserved by Dr.Bill Wan Sing Hung - HKBU Lecture #4 Studenmund (2006): Chapter 5 Review of hypothesis testing Confidence Interval and estimation.
Introduction to Econometrics The Statistical Analysis of Economic (and related) Data.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Inference about a Mean Part II
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter 7 Estimation: Single Population
Inferences About Process Quality
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
5-3 Inference on the Means of Two Populations, Variances Unknown
“There are three types of lies: Lies, Damn Lies and Statistics” - Mark Twain.
CHAPTER 23 Inference for Means.
Hypothesis Tests and Confidence Intervals in Multiple Regressors
Statistical inference: confidence intervals and hypothesis testing.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 7 Estimation: Single Population
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 9-2 Inferences About Two Proportions.
Topic 5 Statistical inference: point and interval estimate
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Estimates and Sample Sizes Lecture – 7.4
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 16 Statistical Tests.
Confidence intervals and hypothesis testing Petter Mostad
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 9-1 Review and Preview.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-3 Inferences About Two Means:
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
© Pearson Education Limited 2015 Introduction to Econometrics, Third Edition Update Chapters 1, 2 and 3 The statistical analysis of economic (and related)
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Review of Probability and Statistics
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Introduction to Econometrics
Review of Statistics.
Presentation transcript:

Review of Statistics

 Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots and Sample Correlation

Estimation of the Population Mean  One natural way to estimate the population mean,, is simply to compute the sample average from a sample of n i.i.d. observations. This can also be motivated by law of large numbers.  But, is not the only way to estimate. For example, Y 1 can be another estimator of.

Copyright © 2003 by Pearson Education, Inc. 3-4 Key Concept 3.1

 In general, we want an estimator that gets as close as possible to the unknown true value, at least in some average sense.  In other words, we want the sampling distribution of an estimator to be as tightly centered around the unknown value as possible.  This leads to three specific desirable characteristics of an estimator.

Three desirable characteristics of an estimator. Let denote some estimator of,  Unbiasedness:  Consistency:  Efficiency. Let be another estimator of, and suppose both and are unbiased. Then is said to be more efficient than if

Properties of  It can be shown that E( )= and (from law of large numbers), is both unbiased and consistent.  But, is efficient?

Examples of alternative estimators. Example 1: The first observation Y 1 ? Since E(Y 1 )=, Y 1 is an unbiased estimator of. But,  if n≥2, is more efficient than Y 1.

Example 2: where n is assumed to be an even number. The mean of is and its variance is Thus is unbiased and, because Var( ) → 0 as n→∞, is consistent. However, is more efficient than.

In fact, is the most efficient estimator of among all unbiased estimators that are weighted averages of Y 1, …, Y n. (Weighted average implies that the estimators are all unbiased.)

Hypothesis Testing The hypothesis testing problem (for the mean): make a provisional decision, based on the evidence at hand, whether a null hypothesis is true, or instead that some alternative hypothesis is true. That is, test

Copyright © 2003 by Pearson Education, Inc Key Concept 3.5

Terminology  Significance level; p-value; critical value  Confidence interval; acceptances region, rejection region  Size; power

 p-value = probability of drawing a statistic (e.g. ) at least as adverse to the null as the value actually computed with your data, assuming that the null hypothesis is true.  The significance level of a test is a pre-specified probability of incorrectly rejecting the null, when the null is true.

Calculating the p-value based on : where is the value of actually observed.  To compute the p-value, you need to know the distribution of.  If n is large, we can use the large-n normal approximation.

where denotes the standard deviation of the distribution of.

Calculating the p-value with  Y known

Type I and Type II Error  Type I Error (Red) , Type II Error (Blue) α /2 β 對立假設

Confidence Intervals A 95% confidence interval for is an interval that contains the true value of Y in 95% of repeated samples. Digression: What is random here? the confidence interval— it will differ from one sample to the next; the population parameter,, is not random.

In practice, is unknown—it must be estimated. Estimator of the variance of Y: Fact: If (Y 1, …, Y n ) are i.i.d. and E(Y 4 )< ∞, then  Why does the law of large numbers apply? Because is a sample average.  Technical note: we assume E(Y 4 )< ∞ because here the average is not of Y i, but of its square.

For the first term,  Define, then, and W 1, …, W n are i.i.d. .  Thus W 1, …, W n are i.i.d. and Var(W i ) < ∞, so  Therefore,  For the second term, because Therefore,

Computing the p-value with estimated:

The p-value and the significance level With a prespecified significance level (e.g. 5%):  reject if |t| >  equivalently: reject if p ≤  The p-value is sometimes called the marginal significance level.

What happened to the t-table and the degrees of freedom?  Digression: the Student t distribution If Y i, i = 1,…, n is i.i.d. N(  Y, ), then the t-statistic has the Student t-distribution with n – 1 degrees of freedom. The critical values of the Student t-distribution is tabulated in the back of all statistics books. Remember the recipe? Compute the t-statistic Compute the degrees of freedom, which is n – 1 Look up the 5% critical value If the t-statistic exceeds (in absolute value) this critical value, reject the null hypothesis.

Comments on this recipe and the Student t- distribution The theory of the t-distribution was one of the early triumphs of mathematical statistics. It is astounding, really: if Y is i.i.d. normal, then you can know the exact, finite-sample distribution of the t- statistic – it is the Student t. So, you can construct confidence intervals (using the Student t critical value) that have exactly the right coverage rate, no matter what the sample size. But…. 27

Comments on Student t distribution, ctd. If the sample size is moderate (several dozen) or large (hundreds or more), the difference between the t-distribution and N(0,1) critical values are negligible. Here are some 5% critical values for 2-sided tests: 28

Comments on Student t distribution, ctd.  So, the Student-t distribution is only relevant when the sample size is very small; but in that case, for it to be correct, you must be sure that the population distribution of Y is normal. In economic data, the normality assumption is rarely credible. Here are the distributions of some economic data. Do you think earnings are normally distributed? Suppose you have a sample of n = 10 observations from one of these distributions – would you feel comfortable using the Student t distribution? 29

30

Comments on Student t distribution, ctd. Consider the t-statistic testing the hypothesis that two means (groups s, l) are equal: Even if the population distribution of Y in the two groups is normal, this statistic doesn’t have a Student t distribution! There is a statistic testing this hypothesis that has a normal distribution, the “pooled variance” t-statistic – see SW (Section 3.6) – however the pooled variance t-statistic is only valid if the variances of the normal distributions are the same in the two groups. Would you expect this to be true, say, for men’s v. women’s wages? 31

The Student-t distribution – summary  The assumption that Y is distributed N(  Y, ) is rarely plausible in practice (income? number of children?)  For n > 30, the t-distribution and N(0,1) are very close (as n grows large, the t n–1 distribution converges to N(0,1))  The t-distribution is an artifact from days when sample sizes were small and “computers” were people  For historical reasons, statistical software typically uses the t- distribution to compute p-values – but this is irrelevant when the sample size is moderate or large.  For these reasons, in this class we will focus on the large-n approximation given by the CLT 32

Summary on The Student t-distribution If Y is distributed, then the t-statistic has the Student t- distribution (tabulated in back of all stats books) Some comments:  For n > 30, the t-distribution and N(0,1) are very close.  The assumption that Y is distributed is rarely plausible in practice (income? number of children?)  The t-distribution is an historical artifact from days when sample sizes were very small.  In this class, we won’t use the t distribution - we rely solely on the large-n approximation given by the CLT.

Confidence Intervals A 95% confidence interval for is an interval that contains the true value of Y in 95% of repeated samples. Digression: What is random here? the confidence interval— it will differ from one sample to the next; the population parameter,, is not random.

A 95% confidence interval can always be constructed as the set of values of not rejected by a hypothesis test with a 5% significance level.

This confidence interval relies on the large-n results that is approximately normally distributed and

Summary: From the assumptions of: (1) simple random sampling of a population, that is, (2) we developed, for large samples (large n): Theory of estimation (sampling distribution of ) Theory of hypothesis testing (large-n distribution of t-statistic and computation of the p-value). Theory of confidence intervals (constructed by inverting test statistic). Are assumptions (1) & (2) plausible in practice? Yes

Tests for Difference between Two Means Let be the mean hourly earning in the population of women recently graduated from college and let be population mean for recently graduated men. Consider the null hypothesis that earnings for these two populations differ by certain amount d, then

Replace population variances by sample variances, we have the standard error and the t-statistic is If both n m and n w are large, the t-statistic has a standard normal distribution.

40

Summarize the relationship between variables Scatterplots:

The population covariance and correlation can be estimated by the sample covariance and sample correlation. The sample covariance is The sample correlation is

It can be shown that under the assumptions that (X i, Y i ) are i.i.d. and that X i and Y i have finite fourth moments,

It is easy to see that the second term converges in probability to zero because so by Slutsky’s theorem.

By the definition of covariance, we have To apply the law of large numbers on the first term, we need to have which is satisfied since

 The second inequality follows by applying the Cauchy-Schwartz inequality, and the last inequality follows because of the finite fourth moments for (X i, Y i ). The Cauchy-Schwartz inequality is

 Applying the law of large numbers, we have  Also,, therefore

Scatterplots and Sample Correlation