1 RTI International is a trade name of Research Triangle Institute 3040 Cornwallis Road P.O. Box 12194 Research Triangle Park, North Carolina, USA 27709.

Slides:



Advertisements
Similar presentations
High Resolution studies
Advertisements

Sampling: Theory and Methods
Multistage Sampling.
Multiple Indicator Cluster Surveys Survey Design Workshop
By: Saad Rais, Statistics Canada Zdenek Patak, Statistics Canada
The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
Linearization Variance Estimators for Survey Data: Some Recent Work
NTTS conference, February 18 – New Developments in Nonresponse Adjustment Methods Fannie Cobben Statistics Netherlands Department of Methodology.
Page 1 Measuring Survey Quality through Representativity Indicators using Sample and Population based Information Chris Skinner, Natalie Shlomo, Barry.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
Addition Facts
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
Outline of talk The ONS surveys Why should we weight?
Multistage Sampling Module 3 Session 9.
1 Cluster Sampling Module 3 Session 8. 2 Purpose of the session To demonstrate how a cluster sample is selected in practice To demonstrate how parameters.
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics General approaches to sample size determinations (Session 12)
STATISTICAL INFERENCE ABOUT MEANS AND PROPORTIONS WITH TWO POPULATIONS
Recent Improvements for Dual-Frame RDD Sampling Methodology 29 th Annual Meeting March 26, 2012.
Sampling techniques & sample size
Lectures 6&7: Variance Reduction Techniques
Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
What Is Cost Control? 1 Controlling Foodservice Costs OH 1-1.
Survey sampling Sampling & non-sampling error Bias
Labour Force Historical Review Sandra Keys, University of Waterloo DLI OntarioTraining University of Guelph, Guelph, ON April 12, 2006.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 2 Planning the Project.
AADAPT Workshop South Asia Goa, December 17-21, 2009 Kristen Himelein 1.
Addition 1’s to 20.
Week 1.
Nonresponse Bias Correction in Telephone Surveys Using Census Geocoding: An Evaluation of Error Properties Paul Biemer RTI International and University.
©2006 Prentice Hall Business Publishing, Auditing 11/e, Arens/Beasley/Elder Audit Sampling for Tests of Controls and Substantive Tests of Transactions.
Weighting Methodology for the Private Landlords Survey Robert Bucknall, ONS.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
GLOBAL TOBACCO SURVEILLANCE SYSTEM Global Youth Tobacco Survey Training Workshop Introduction to the GYTS Sample Design & Weights.
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling with unequal probabilities STAT262. Introduction In the sampling schemes we studied – SRS: take an SRS from all the units in a population – Stratified.
Multiple Indicator Cluster Surveys Survey Design Workshop
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Sample Design.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
RTI International is a trade name of Research Triangle Institute 3040 Cornwallis Road ■ P.O. Box ■ Research Triangle Park, North Carolina, USA
Complexities of Complex Survey Design Analysis. Why worry about this? Many government studies use these designs – CDC National Health Interview Survey.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Near East Regional Workshop - Linking Population and Housing Censuses with Agricultural Censuses. Amman, Jordan, June 2012 Improving Efficiency.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Central egency for public mobilization and statistics.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Lecture 9 Prof. Development and Research Lecturer: R. Milyankova
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
SAMPLING TECHNIQUES AND METHODS ‘CHAR’ FMCB SEMINAR PRESENTER: DR KAYODE. A. ONAWOLA 03/07/2013.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
RTI International is a trade name of Research Triangle Institute Sampling U.S. Schools – Frame, Sampling & Field Issues International Conference on Establishment.
A Comparison of Variance Estimates for Schools and Students Using Taylor Series and Replicate Weighting Ellen Scheib, Peter H. Siegel, and James R. Chromy.
PEP-PMMA Training Session Sampling design Lima, Peru Abdelkrim Araar / Jean-Yves Duclos 9-10 June 2007.
Basic Business Statistics
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Sampling and Power Slides by Jishnu Das.
Presentation transcript:

1 RTI International is a trade name of Research Triangle Institute 3040 Cornwallis Road P.O. Box Research Triangle Park, North Carolina, USA Applying Statistical Methodologies Originally Developed for Household Surveys to the Design and Analysis of Establishment Studies Angela Pitts, Marcus Berzofsky, and Michael Witt 6/19/07

2 Topics of Discussion Using Multistage Sampling Techniques Using Area Probability Sampling Techniques Selecting Units Proportional to a Composite Size Measure Selecting Units by Sequential PPS w/ Minimal Replacement Computing Weight Adjustments Using a Model-Based Approach Large Table Production Tasks

3 Multistage Sampling Refers to the process of sampling within a previous selected sample Used to decrease data collection cost Used when a sampling frame is not available Often one would stratify within stage depending on known information

4 Multistage Sampling (continued) Establishment study examples: O*NET Establishment is PSU; sampling unit is employee National survey of employees within occupations Selected establishments, then occupations, and then employees School-based studies National School Radon Survey – selected districts, then schools, and then classes Educational Longitudinal Study (ELS) – select schools and then students

5 Area Probability Sampling Closely related to multistage sampling Refers to the multistage design process of selecting geographic areas at the early stages of design Used to minimize data collection costs and address incomplete sampling frames May be useful for non-response follow-up or response-analysis studies Sometimes clustering can decrease the precision of estimates

6 Area Probability Sampling (continued) Establishment study examples: National Study of Assisted Living for the Frail Elderly Estab is SSU; sampling unit is resident Select geographic areas Used outside sources to create a list of assisted living facilities in each area – suitable sampling frame did not exist Select facilities then residents

7 Area Probability Sampling (continued) Establishment study examples (continued): National Postsecondary Student Aid Study (NPSAS pre-1994) Estab is SSU; sampling unit is student Select 3-digit zip code areas, then higher education institutions within areas, and then students Cash Payment Study (currently being designed) Estab is SSU; sampling unit is establishment Considering oversampling establishments in a sample of geographic areas for in-person non-response follow-up

8 Selecting Units Proportional to CSM Common size measure for establishments is the number of employees, however this may not be suitable for all target population(s) of interest Composite size measure accounts for varying subsampling fractions across subdomains (Folsom et al, 1980) In a sense, the composite size measure represents the value of a PSU relative to other PSUs on the frame and relative to the desired ultimate sample

9 Selecting Units Proportional to CSM (continued) Benefits of CSM methodology: Can help equalize between PSU workload (cost efficiency) Can help equalize final probabilities of selection within domains (variance reduction)

10 Selecting Units Proportional to CSM (continued) For example, suppose subdomain is part-time/full-time workers Let N i =100 and N pi =70 and N fi =30 Let f p = n p /N p = 0.25 for all i Let f f = n f /N f = 0.80 for all i S i = f p * N pi + f f * N fi = 41.5 Note that a commonly used size measure would be 100 (total number of employees in establishment)

11 Selecting Units Proportional to CSM (continued) Establishment study examples: O*NET Subdomain is occupation within establishment Could consider using this for hospital studies Subdomain could be medical personnel in different specialties within hospital

12 Selecting Units by Sequential PPS w/ Min. Replacement Application of sequential, with replacement sampling to the PPS environment, developed by Dr. Chromy (1979) Each PSU is guaranteed to be selected within 1 of its expected value (m*S i /S + ) If expected value is less than 1, then PSU i would be selected either 0 or 1 times If expected value is greater than 1, for example 3.2, then PSU i would be selected 3 or 4 times

13 Selecting Units by Sequential PPS w/ Min. Replacement (continued) Sorting of the file prior to selection can yield variance reduction and/or better control over domain sample sizes via the implicit stratification SAS has an option for Chromys method in their survey select procedures

14 Selecting Units by Sequential PPS w/ Min. Replacement (continued) Establishment study examples: National Inmate Survey Establishment is a PSU; sampling unit is inmate Survey of correctional facilities Size measure was number of inmates in facility Implicitly stratified by region and state to ensure that at least one facility was selected in each state O*NET Composite size measure used based on occupations within establishment Implicitly stratified by industry grouping and establishment size

15 Weight Adjustments Use model-based approach, such as the generalized exponential model (GEM), developed by RTI for household surveys (Folsom and Singh, 2000) Can be used with nonresponse, post-stratification, and extreme weight adjustments Allows for a larger number of statistically significant main effects and lower-order interaction terms in the adjustment compared to a weighting class adjustment Potential to decrease bias

16 Weight Adjustments (continued) Preset bounds on resulting adjustment can be applied – this minimizes unequal weighting thereby increasing precision Maintains marginal control totals unlike logistic regression (propensity scores) May need to produce multiple weights to perform analyses on different units

17 Weight Adjustments (continued) Establishment study examples: Cyanide Survey Estab is PSU; sampling unit is advanced life support providers Survey of ALS providers regarding emergency services related to cyanide Used GEM for both nonresponse and post-stratification adjustments NIOSH Fire Fighter Fatality Investigation and Prevention Program Evaluation Estab is PSU; sampling unit is also PSU Survey of fire departments Used GEM for nonresponse adjustments

18 Table Production Technique developed for HH surveys where a large number of tables were required Uses a collection of SAS and Visual Basic programs to create analysis tables Programs produce the analysis data and populate the tables Tables can be of varying sizes and formats Tables are formatted and printer-ready

19 Table Production (continued) Establishment study examples: National Inmate Study Generated response rate tables, analysis data tables, and nonresponse tables Cash Payments Study Establishment survey designed to collect information on the use of cash versus credit/debit cards, etc. Generated response rate tables, analysis data tables, and nonresponse tables

20 Conclusion HH methods shown have not been commonly used in establishment studies In some instances, these methods are beneficial for establishment studies When list frame not available When only a portion of the establishment is the domain of interest When data collection is done in the field (as opposed to telephone)