usually unimportant in social surveys:

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Sampling: Final and Initial Sample Size Determination
POINT ESTIMATION AND INTERVAL ESTIMATION
Chapter 7 Introduction to Sampling Distributions
Sampling Distributions
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Chapter 8 Estimation: Single Population
Chapter 7 Estimation: Single Population
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Standard error of estimate & Confidence interval.
Chapter 6 Sampling and Sampling Distributions
Sample Of size 2 Of size 3 1 A,B=3,1 2 A,B,C=3,1,5 3 A,C=3,5 4
Chapter 7 Estimation: Single Population
Chapter 11: Estimation Estimation Defined Confidence Levels
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Introduction to Biostatistics and Bioinformatics Estimation II This Lecture By Judy Zhong Assistant Professor Division of Biostatistics Department of Population.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Sampling Theory The procedure for drawing a random sample a distribution is that numbers 1, 2, … are assigned to the elements of the distribution and tables.
Determination of Sample Size: A Review of Statistical Theory
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Estimation Chapter 8. Estimating µ When σ Is Known.
Sampling Distributions. Sampling Distribution Is the Theoretical probability distribution of a sample statistic Is the Theoretical probability distribution.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Confidence Interval & Unbiased Estimator Review and Foreword.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Sampling Theory and Some Important Sampling Distributions.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
1 STK 4600: Statistical methods for social sciences. Survey sampling and statistical demography Surveys for households and individuals.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Example Random samples of size n =2 are drawn from a finite population that consists of the numbers 2, 4, 6 and 8 without replacement. a-) Calculate the.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Sampling and Sampling Distributions
Chapter 6 Confidence Intervals.
Chapter 6 Confidence Intervals.
Sampling Distributions
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
STATISTICAL INFERENCE
Chapter 7 Sampling and Sampling Distributions
Basic Business Statistics (8th Edition)
Review 49% of all people in the world are male. Interested whether physics majors are more likely to be male than the general population, you survey 10.
Chapter 4. Inference about Process Quality
Section 9.2 – Sample Proportions
Representativeness The aim of any sample is to represent the characteristics of the sample frame. There are a number of different methods used to generate.
Inference: Conclusion with Confidence
Sample Mean Distributions
Parameter, Statistic and Random Samples
Random Sampling Population Random sample: Statistics Point estimate
CONCEPTS OF ESTIMATION
STAT 5372: Experimental Statistics
Econ 3790: Business and Economics Statistics
Chapter 6 Confidence Intervals.
Lecture 7 Sampling and Sampling Distributions
From Samples to Populations
Sampling Distributions (§ )
STK 4600: Statistical methods for social sciences.
Chapter 8 Estimation.
STA 291 Summer 2008 Lecture 12 Dustin Lueker.
Section 9.2: Sample Proportions
Presentation transcript:

usually unimportant in social surveys: n =10,000 and N = 5,000,000: 1- f = 0.998 n =1000 and N = 400,000: 1- f = 0.9975 n =1000 and N = 5,000,000: 1-f = 0.9998 effect of changing n much more important than effect of changing n/N

The estimated variance Usually we report the standard error of the estimate: Confidence intervals for m is based on the Central Limit Theorem:

Example n N = 341 residential blocks in Ames, Iowa yi = number of dwellings in block i 1000 independent SRS for different values of n n Proportion of samples with |Z| <1.64 Proportion of samples with |Z| <1.96 30 0.88 0.93 50 70 0.94 90 0.90 0.95

For one SRS with n = 90:

Absolute value of sampling error is not informative when not related to value of the estimate For example, SE =2 is small if estimate is 1000, but very large if estimate is 3 The coefficient of variation for the estimate: A measure of the relative variability of an estimate. It does not depend on the unit of measurement. More stable over repeated surveys, can be used for planning, for example determining sample size More meaningful when estimating proportions

Estimation of a population proportion p with a certain characteristic A p = (number of units in the population with A)/N Let yi = 1 if unit i has characteristic A, 0 otherwise Then p is the population mean of the yi’s. Let X be the number of units in the sample with characteristic A. Then the sample mean can be expressed as

So the unbiased estimate of the variance of the estimator:

Examples A political poll: Suppose we have a random sample of 1000 eligible voters in Norway with 280 saying they will vote for the Labor party. Then the estimated proportion of Labor votes in Norway is given by: Confidence interval requires normal approximation. Can use the guideline from binomial distribution, when N-n is large:

In this example : n = 1000 and N = 4,000,000 Ex: Psychiatric Morbidity Survey 1993 from Great Britain p = proportion with psychiatric problems n = 9792 (partial nonresponse on this question: 316) N @ 40,000,000

General probability sampling Sampling design: p(s) - known probability of selection for each subset s of the population U Actually: The sampling design is the probability distribution p(.) over all subsets of U Typically, for most s: p(s) = 0 . In SRS of size n, all s with size different from n has p(s) = 0. The inclusion probability:

Illustration U = {1,2,3,4} Sample of size 2; 6 possible samples Sampling design: p({1,2}) = ½, p({2,3}) = 1/4, p({3,4}) = 1/8, p({1,4}) = 1/8 The inclusion probabilities:

Some results

Estimation theory probability sampling in general Problem: Estimate a population quantity for the variable y For the sake of illustration: The population total

CV is a useful measure of uncertainty, especially when standard error increases as the estimate increases Because, typically we have that

Some peculiarities in the estimation theory Example: N=3, n=2, simple random sample

For this set of values of the yi’s:

Let y be the population vector of the y-values. This example shows that is not uniformly best ( minimum variance for all y) among linear design-unbiased estimators Example shows that the ”usual” basic estimators do not have the same properties in design-based survey sampling as they do in ordinary statistical models In fact, we have the following much stronger result: Theorem: Let p(.) be any sampling design. Assume each yi can take at least two values. Then there exists no uniformly best design-unbiased estimator of the total t

Proof: This implies that a uniformly best unbiased estimator must have variance equal to 0 for all values of y, which is impossible