Ch 5: Equal probability cluster samples

Slides:



Advertisements
Similar presentations
Multistage Sampling.
Advertisements

1 Cluster Sampling Module 3 Session 8. 2 Purpose of the session To demonstrate how a cluster sample is selected in practice To demonstrate how parameters.
Ch 5: Cluster sampling with equal probabilities
Section 1.3 Experimental Design.
Chapter 5 Stratified Random Sampling n Advantages of stratified random sampling n How to select stratified random sample n Estimating population mean and.
Sampling with unequal probabilities STAT262. Introduction In the sampling schemes we studied – SRS: take an SRS from all the units in a population – Stratified.
Ch 4: Stratified Random Sampling (STS)
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
Dr. Chris L. S. Coryn Spring 2012
Fundamentals of Sampling Method
PROBABILITY SAMPLING: CONCEPTS AND TERMINOLOGY
Chapter 12 Sample Surveys
Ratio estimation with stratified samples Consider the agriculture stratified sample. In addition to the data of 1992, we also have data of Suppose.
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
PROBABILITY SAMPLING: CONCEPTS AND TERMINOLOGY
Sampling Procedures and sample size determination.
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
17 June, 2003Sampling TWO-STAGE CLUSTER SAMPLING (WITH QUOTA SAMPLING AT SECOND STAGE)
Sampling: Design and Procedures
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Sampling.
1 Sampling for EHES Principles and Guidelines Johan Heldal & Susie Cooper Statistics Norway.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Near East Regional Workshop - Linking Population and Housing Censuses with Agricultural Censuses. Amman, Jordan, June 2012 Improving Efficiency.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
STATISTICS is about how to COLLECT, ORGANIZE,
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
Experimental Design 1 Section 1.3. Section 1.3 Objectives 2 Discuss how to design a statistical study Discuss data collection techniques Discuss how to.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Chapter Twelve. Figure 12.1 Relationship of Sampling Design to the Previous Chapters and the Marketing Research Process Focus of This Chapter Relationship.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Systematic Sampling (SYS) Up to now, we have only considered one design: SRS of size n from a population of size N New design: SYS DEFN: A 1-in-k systematic.
© 2009 Pearson Education, Inc publishing as Prentice Hall 12-1 Sampling: Design and Procedure Sampling Size.
Chapter Twelve. Defining some terms censusPopulation ElementsSample.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
Sampling Methods. Probability Sampling Techniques Simple Random Sampling Cluster Sampling Stratified Sampling Systematic Sampling Copyright © 2012 Pearson.
7.1Sampling Methods 7.2Introduction to Sampling Distribution 7.0 Sampling and Sampling Distribution.
Chapter Eleven Sampling: Design and Procedures Copyright © 2010 Pearson Education, Inc
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
SAMPLING TECHNIQUES LECTURE - 2 GE 608 Experimental Methods and Analysis Oct 28, 2015 Muharrum 14, 1437.
Review HW: E1 A) Too high. Polltakers will never get in touch with people who are away from home between 9am and 5pm, eventually they will eventually be.
Rome, May 2014 Structural variables Weighting the Spanish annual subsample.
PEP-PMMA Training Session Sampling design Lima, Peru Abdelkrim Araar / Jean-Yves Duclos 9-10 June 2007.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
"Time is the coin of your life. It is the only coin you have, and only you can determine how it will be spent. Be careful lest you let other people spend.
1. 2 DRAWING SIMPLE RANDOM SAMPLING 1.Use random # table 2.Assign each element a # 3.Use random # table to select elements in a sample.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Chapter Eleven Sampling: Design and Procedures © 2007 Prentice Hall 11-1.
Statistics – Chapter 1 Data Collection
John Loucks St. Edward’s University . SLIDES . BY.
SAMPLE DESIGN.
Sampling: Design and Procedures
Sampling with unequal probabilities
Sampling: Theory and Methods
Slides by JOHN LOUCKS St. Edward’s University.
Sampling: Design and Procedures
Cluster Sampling STAT262.
Sampling Chapter 6.
Presentation transcript:

Ch 5: Equal probability cluster samples 4/19/2017 Cluster sampling DEFN: A cluster is a group of observation units (or “elements”) Stat 804

Cluster sample DEFN: A cluster sample is a probability sample in which a sampling unit is a cluster

Cluster sample – 2 1-stage cluster sampling Divide the population (of N elements) into NI clusters (of size Ni for cluster i) Cluster = group of elements An element belongs to 1 and only 1 cluster Sampling unit Cluster = group of elements = PSU = primary sampling unit Can use any design to select clusters (ST, PPS) Data collection Collect information on ALL elements in the cluster

1-stage CS ST Sample of 40 elements A block of cells is a cluster A block of cells is a stratum SU is a cluster Don’t sample from every cluster SU is an element (or OU) Sample from every stratum

Cluster vs. stratified sampling Cluster sample Divide N elements into NI clusters Cluster or PSU i has Ni elements Take a sample of nI clusters Stratified sampling N elements divided into H strata An element belongs to 1 and only 1 stratum Take a sample of n elements, consisting of nh elements from stratum h for each of the H strata

Cluster sample – 3 2-stage cluster sampling Process Select PSUs (stage 1) Select elements within each sampled PSU (stage 2) First stage sampling unit is a … PSU = primary sampling unit = cluster Second stage sampling unit is a … SSU = secondary sampling unit = element = OU Only collect data on the SSUs that were sampled from the cluster

1-stage vs. 2-stage cluster sampling 1-stage cluster sample (stop here) OR Stage 1 of 2-stage cluster sample (select PSUs) Stage 2 of 2-stage cluster sample (select SSUs w/in PSUs)

Why use cluster sampling? May not have a list of OUs for a frame, but a list of clusters may be available List of Lincoln phone numbers (= group of residents) is available, but a list of Lincoln residents is not available List of all NE primary and secondary schools (= group of students) is available, but a list of all students in NE schools is not available May be cheaper to conduct the study if OUs are clustered Occurs when cost of data collection increases with distance between elements Household surveys using in-person interviews (household = cluster of people) Field data collection (plot = cluster of plants, or animals)

Defining clusters due to frame limitations A cluster (or PSU) is a group of elements corresponding to a record (row) in the frame Example Population = employees in McDonald’s franchises Element = employee Frame = list of McDonald’s stores PSU = store = cluster of employees

Defining clusters to reduce travel costs A cluster (or PSU) is a group of nearby elements Example Population = all farms Element = farm Frame = list of sections (1 mi x 1 mi areas) in rural area PSU = section = cluster of farms

Cluster samples usually lead to less precise estimates Elements within clusters tend to be correlated due to exposure to similar conditions Members of a household Employees in a business Plants or soil within a field plot We are getting less information than if selected same number of unrelated elements Select sample of city blocks (clusters of households) Ask each household: Should city upgrade storm sewer system? PSU (city block) 1 No storm sewer  households will tend to say yes PSU (city block) 2 New development  households will tend to say no

Defining clusters for improved precision Define clusters for which within-cluster variation is high (rarely possible) Make each cluster as heterogeneous as possible Like making each cluster a mini-population that reflects variation in population Minimizes the amount of correlation among elements in the cluster Opposite of the approach to stratification Large variation among strata, homogeneous within strata Define clusters that are relatively small Extreme case is cluster = element Decreasing the number of correlated observations in the sample

Example for single-stage cluster sampling w/ equal prob (CSE1) Dorm has NI = 100 suites (clusters) Each suite has Ni = 4 students (4 elements in cluster i , i = 1, 2, … , NI) Note that there are Take SRS nI = 5 suites (clusters) Ask each student living in each of the 5 suites How many nights per week do you eat dinner in the dining hall? Will get observations from a sample of 20 students = 5 suites x 4 students/suite

Dorm example – 2 Stu-dent Suite 6 Suite 21 Suite 28 Suite 54 Suite 89 3 6 2 4 Total 20 14 19 21 10

Dorm example – 3 SRS of nI = 5 dorm rooms Data on each cluster (all students in dorm room) ti = total number of dining hall dinners for dorm room i t2 = 14 dining hall dinners for 4 students in dorm room 2 Estimated total number of dining hall nights for the dorm students HT estimator of total = pop size x sample mean (of cluster totals)

Notation Response variable for SSU j in PSU i yij e.g., age of j-th resident in household i e.g., whether or not dorm resident j in room i owns a computer

Cluster-level population parameters (for cluster i ) Cluster size = Cluster population total Note that we observe cluster population total (or mean or variance) for each sample cluster in 1-stage cluster sampling We will estimate cluster parameters in 2-stage cluster sampling Ni elements

Popuation 1-stage cluster sample

Data from cluster samples Work with element and cluster-level data Element data set will have columns for Cluster id Element id within cluster Variable (y) Will also summarize this data set to generate cluster parameters (1-stage) or estimates of cluster parameters (2-stage) Cluster total (or estimate) Cluster mean (or estimate) Cluster variance (or estimate)

1-stage cluster sample Element data Cluster summary i j yij 1 y11 2 y12 3 Y13 4 y14 y21 y22 y23 y31 … i ti 1 t1 2 t2 3 t3 …

CSE1 unbiased estimation under SI – total t Estimator for population total using data collected from a 1-stage cluster sample SI of clusters Estimator of variance of

Dorm example – 4 Estimated population total Estimated variance

Dorm example – 5 Inclusion probability for student j in dorm room i N = 100 dorm rooms n = 5 sample dorm rooms Take all 4 students in dorm room ij = nI / NI = 1/20 = 0.05