Basic Sampling Theory for Simple and Cluster Samples

Slides:



Advertisements
Similar presentations
Sampling: Theory and Methods
Advertisements

Calculation of Sampling Errors MICS3 Regional Workshop on Data Archiving and Dissemination Alexandria, Egypt 3-7 March, 2007.
Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop.
Session 1: Introduction to Complex Survey Design
1 Cluster Sampling Module 3 Session 8. 2 Purpose of the session To demonstrate how a cluster sample is selected in practice To demonstrate how parameters.
Introduction Simple Random Sampling Stratified Random Sampling
Populations & Samples Objectives:
Statistical Sampling.
Estimating a Population Variance
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
© 2011 Pearson Education, Inc
Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
Complex Surveys Sunday, April 16, 2017.
Topics: Inferential Statistics
Determining the Size of
Session 7.1 Bivariate Data Analysis
Ratio estimation with stratified samples Consider the agriculture stratified sample. In addition to the data of 1992, we also have data of Suppose.
Formalizing the Concepts: Simple Random Sampling.
Understanding sample survey data
Survey Methodology Sampling error and sample size EPID 626 Lecture 4.
Determining the Size of
Clt1 CENTRAL LIMIT THEOREM  specifies a theoretical distribution  formulated by the selection of all possible random samples of a fixed size n  a sample.
Standard error of estimate & Confidence interval.
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Review of normal distribution. Exercise Solution.
Sampling: Theory and Methods
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
Sampling and sampling distibutions. Sampling from a finite and an infinite population Simple random sample (finite population) – Population size N, sample.
LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Chapter 7 Estimation Procedures. Basic Logic  In estimation procedures, statistics calculated from random samples are used to estimate the value of population.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Sampling Theory The procedure for drawing a random sample a distribution is that numbers 1, 2, … are assigned to the elements of the distribution and tables.
Populations and Samples Central Limit Theorem. Lecture Objectives You should be able to: 1.Define the Central Limit Theorem 2.Explain in your own words.
Determination of Sample Size: A Review of Statistical Theory
Estimation Chapter 8. Estimating µ When σ Is Known.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-5 Estimating a Population Variance.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
Learning Objective Chapter 12 Sample Size Determination Copyright © 2000 South-Western College Publishing Co. CHAPTER twelve Sample Size Determination.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
1 Mean Analysis. 2 Introduction l If we use sample mean (the mean of the sample) to approximate the population mean (the mean of the population), errors.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Copyright © 2012 by Nelson Education Limited. Chapter 5 Introduction to inferential Statistics: Sampling and the Sampling Distribution 5-1.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
Measuring change in sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Chapter 6 Test Review z area ararea ea
Variability. The differences between individuals in a population Measured by calculations such as Standard Error, Confidence Interval and Sampling Error.
Variability.
Dr. Unnikrishnan P.C. Professor, EEE
STANDARD ERROR OF SAMPLE
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Introduction to Sampling Distributions
Sec. 7-5: Central Limit Theorem
Central Limit Theorem General version.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Presentation transcript:

Basic Sampling Theory for Simple and Cluster Samples Malcolm Rosier Survey Design and Analysis Services Pty Ltd http://survey-design.com.au Copyright © 2000

Sample design The focus of the design for a sample must be on the magnitude of the standard errors of sampling not than on an arbitrary percentage of the target population. The standard errors are used to calculate confidence intervals around the sample data.

Standard errors The next sequence aims to explain standard errors, and how they relate to the underlying target population and a sample drawn from this population.

Graph: Target population Population: mean = , standard deviation = 

Graph: Sample Sample: mean = x, standard deviation = s

Graph: Means from many samples However we could get many different samples with different sample means from the population.

Graph: Distribution of sample means This gives us a sampling distribution of sample means:

Sampling distribution of sample means normal distribution mean =  = mean of underlying population distribution standard deviation =  / n

Standard error of a population mean The standard deviation of the sampling distribution of sample means is termed the standard error of a mean. standard error of population mean =  / n

Central limit theorem The central limit theorem states that the link between the mean of one sample and the population mean is given by: x =   z. se(popn mean) If z = 1.96 we produce a confidence interval where we find 95% of the sample means, relative to 

Estimated population mean However, we are not interested in finding the sample means given the population mean. Our aim is to locate the population mean given what we know about the sample.

Estimated population mean We start with a simple random sample and assume: s is a good estimate of  se(sample mean) is a good estimate of se(popn mean)

Estimated population mean Then instead of x =   z. se(popn mean) we can write  = x  z. se(sample mean) where se(sample mean) = s / n

Standard error of a proportion (srs) The standard error of a percentage (proportion) is: se(prop) =  [p(1-p)/n]

Standard error of a proportion = 0.50 (srs) For p=0.50 se(p50) =  [0.50(1-0.50)/n] The standard error may be multiplied by a finite population correction (FPC) of (N-n)/N

Confidence intervals Confidence intervals are usually expressed at the 95 per cent level (1.96 standard errors of sampling for a proportion)

Table: Effect of sample size on standard error

Two stage samples The most efficient method is usually sampling at the first stage with probability proportional to size (pps). This produces a self-weighting sample. Easier logistics for administration.

Two stage samples Stage 1 Primary sampling units (psu) are selected with a probability proportional to the size of the target population in the psu. Example of psu: schools

Two stage samples Stage 2 A random cluster of secondary sampling units (ssu) is selected at random from each of the psu. Example of ssu: students in schools

Deff Two-stage sampling is less efficient than a simple random sample (srs) of the same size. deff = (standard error of sampling for complex sample)2 / (standard error of sampling for srs)2

Deft The square root of deff is deft, which gives the ratio of the standard errors of sampling. deft = (standard error of sampling for complex sample) / (standard error of sampling for srs)

Simple equivalent sample The simple equivalent sample (ses) is the size of a simple random sample which has the same standard error as the complex sample. We sometime use the term effective sample (neff)

Simple equivalent sample The size of simple equivalent sample = size of complex sample / deff deff = 1 + (rho)(b-1) = 1 + (0.10)(20-1) = 2.9 where rho = intraclass correlation b = cluster size

Compare srs and ses For a simple random sample of n=1000, the 95 per cent confidence interval is given by: =  1.96 se(p50) =  1.96 [0.50(1-0.50)/1000] =  0.031 =  3.1%

Compare srs and ses For a simple equivalent sample of n=345 (corresponding to a complex sample of n=1000), the 95 per cent confidence interval is given by: =  1.96 se(p50) =  1.96 [0.50(1-0.50)/345] =  0.053 =  5.3%

Table: Values for deff and simple equivalent sample

Random clusters PPS sampling assumes a random cluster which is the same size for each ssu. In practice, we often draw an intact group at random. This usually increase the intraclass correlation for the sample.

Weighting Achieved samples are unlikely to properly represent the proportions of persons in the target populations for the strata. Weights are applied so that the achieved sample for each stratum represents its proportion in the total target population.

Weighting wh = Nh/nh where nh = the size of the achieved sample for the stratum Nh = the size of the target population for the stratum