EVAULATION OF THE NSCRG SCHOOL SAMPLE Donsig Jang and Xiaojing Lin Third International Conference on Establishment Surveys Montreal, Canada, June 21, 2007.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Presenter Name(s) Issue date National Student.
House Committee on Workforce and Technical Skills February 20, 2001.
Sampling.
Sampling A population is the total collection of units or elements you want to analyze. Whether the units you are talking about are residents of Nebraska,
Preliminary Results from the 2008 Oklahoma Health Care Insurance and Access Survey Presentation to the Oklahoma Health Care Authority Board November 13,
Roberta Spalter-Roth, Ph.D Director of Research American Sociological Association Enhancing Diversity in Science: Working Together to Develop Common Data,
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
Redesigning National School Surveys Coverage and Stratification Improvement using Multiple Datasets March 2014 FEDCASIC 2014 Prepared for:
GLOBAL TOBACCO SURVEILLANCE SYSTEM Global Youth Tobacco Survey Training Workshop Introduction to the GYTS Sample Design & Weights.
Chapter 8 Estimation: Additional Topics
Survey of Earned Doctorates National Science Foundation Division of Science Resources Statistics Mark Fiegener, Ph.D. Presentation to Clemson University.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Texas Demographic Characteristics and Trends Texas Association of Mutual Insurance Companies October 7, 2010 Round Rock, TX 1.
11 Populations and Samples.
Sampling Designs and Techniques
Sampling Methods.
Marketing Research Bangor Transfer Abroad Programme Week Three Review.
Lecture 30 sampling and field work
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
Mental Health Survey 2015: Webinar 14 th January 2015.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Sample Resume Heading Your name Your address (can be a post office box) Your contact phone number (s) (it is a good idea to include your cell phone number.
RTI International is a trade name of Research Triangle Institute 3040 Cornwallis Road ■ P.O. Box ■ Research Triangle Park, North Carolina, USA
New York State’s Labor Force Drivers Presented by Kevin Jack, Statewide Labor Market Analyst August 2008.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Addressed Based Sampling as an Alternative to Traditional Sampling Approaches: An Exploration May 6, 2013.
Data to Action: Results and Next Steps for the Healthy Kids Colorado Survey (HKCS) HKCS Advisory Committee Overview October 2014.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chapter 7 Estimates and Sample Sizes
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Creating An Account Returning users login Forgot your username and/or password? Click to create an account.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Lesli Scott Ashley Bowers Sue Ellen Hansen Robin Tepper Jacob Survey Research Center, University of Michigan Third International Conference on Establishment.
DRAFT - not for publication Nonresponse Bias Analysis in a Survey of Banks Carl Ramirez U.S. Government Accountability Office
Sampling strategy for the dual-system correction of the under-coverage in the Register Supported 2011 Italian Population Census Loredana Di Consiglio,
EvergreenEcon.com ESA 2011 Impact Evaluation Research Plan Public Workshop #1 February 20, 2013 Presented By: Steve Grover, President.
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
October 28-30, 2009 UNECE Geneva Quality Assessment of 2008 Integrated Census - Israel Pnina ZADKA Central Bureau of Statistics Israel.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Assessment of Misclassification Error in Stratification Due to Incomplete Frame Information Donsig Jang, Xiaojing Lin, Amang Sukasih Mathematica Policy.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Case Control Study Dr. Ashry Gad Mohamed MB, ChB, MPH, Dr.P.H. Prof. Of Epidemiology.
Understanding Sampling
Evaluating Ongoing Programs: A Chronological Perspective to Include Performance Measurement Summarized from Berk & Rossi’s Thinking About Program Evaluation,
Building Wave Response Rates in a Longitudinal Survey: Essential for Nonsampling Error Reduction or Last In - First Out? Steven B. Cohen Fred Rohde and.
RTI International is a trade name of Research Triangle Institute 6110 Executive Blvd. ■ Suite 902 ■ Rockville, Maryland, USA Phone
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Building a database for children with disabilities using administrative data and surveys Adele D. Furrie September 27, 2011.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
The Use of Random Digit Dialing in Household Surveys: Challenges and Changes Chris Chapman 2008 IES Research Conference Washington, DC June 11, 2008
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Onsite Quarterly Meeting SIPP PIPs June 13, 2012 Presenter: Christy Hormann, LMSW, CPHQ Project Leader-PIP Team.
Medical Eligibility Verifications 1. Medical Eligibility: Verifications Introduction After completing this course, you will be able to: Recognize shared.
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
1 Optimal Number of Replicates for Variance Estimation Mansour Fahimi, Darryl Creel, Peter Siegel, Matt Westlake, Ruby Johnson, and Jim Chromy Third International.
CEM – 599 GRADUATE SEMINAR 1 CONSTRUCTION ENGINEERING & MANAGEMENT DEPT. CEM 599 RESEARCH METHODS IN CONSTRUCTION RESEARCH METHODS IN CONSTRUCTIONBY RIZWAN.
BHS Methods in Behavioral Sciences I May 9, 2003 Chapter 6 and 7 (Ray) Control: The Keystone of the Experimental Method.
1 ERCOT COPS Round 2 Sample Design Review April 10, 2007.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
Eurostat Overview of the project Meeting of the Expert Group on the integration of the European social surveys January 2015.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section B 1.
Sampling: Theory and Methods
Effect of Panel Length and Following Rules on Cross-Sectional Estimates of Income Distribution: Empirical Evidence from FI-SILC Marjo Pyy-Martikainen Workshop.
Presentation transcript:

EVAULATION OF THE NSCRG SCHOOL SAMPLE Donsig Jang and Xiaojing Lin Third International Conference on Establishment Surveys Montreal, Canada, June 21, 2007 Donsig Jang and Xiaojing Lin Third International Conference on Establishment Surveys Montreal, Canada, June 21, 2007

Outline Sampling options on repeated establishment surveys Reasons to keep the same sample in establishment surveys Issues in keeping the same sample Example: NSRCG school sample Summary Recommendation for 2008 NSRCG School Sample Sampling options on repeated establishment surveys Reasons to keep the same sample in establishment surveys Issues in keeping the same sample Example: NSRCG school sample Summary Recommendation for 2008 NSRCG School Sample

Sampling options on repeated establishment surveys Keep the same sample over time with supplemental samples for births –Efficient change estimates BUT –Response burden –Inefficient cross-sectional estimates An independent sample in each survey round Sample coordination to maximize overlaps between samples –Rotation samples (Sigman and Monsour 1995) –Permanent random number technique (Ohlsson 1995, 2001) –Keyfitz procedure (Keyfitz 1951) Keep the same sample over time with supplemental samples for births –Efficient change estimates BUT –Response burden –Inefficient cross-sectional estimates An independent sample in each survey round Sample coordination to maximize overlaps between samples –Rotation samples (Sigman and Monsour 1995) –Permanent random number technique (Ohlsson 1995, 2001) –Keyfitz procedure (Keyfitz 1951)

Reasons to keep the same sample in establishment surveys Difficulty in identifying point of contact Costly efforts in gaining participation Often requires nontrivial process to gather information – previous survey participation would help Difficulty in identifying point of contact Costly efforts in gaining participation Often requires nontrivial process to gather information – previous survey participation would help

Issues in keeping the same sample Can they be a representative sample of the current cross-sectional population? –Depending on how dynamic the population is over time coverage issues: births vs. deaths sample efficiency: distributional changes Alternatives –Independent sample from the most up-to-date sample frame –Coordination of samples E.g., Keyfitz procedure to maximize the sample overlap between the current and the previous ones Can they be a representative sample of the current cross-sectional population? –Depending on how dynamic the population is over time coverage issues: births vs. deaths sample efficiency: distributional changes Alternatives –Independent sample from the most up-to-date sample frame –Coordination of samples E.g., Keyfitz procedure to maximize the sample overlap between the current and the previous ones

National Survey of Recent College Graduates (NSRCG) Repeated every two or three years Collects education, demographic, and employment information from recent college graduates (bachelors and masters) majoring in science,engineering, and health fields Two stage sample design –1 st stage: select schools and obtain the list of graduates from selected schools –2 nd stage: select graduates from the list provided by schools NSF-sponsored survey Repeated every two or three years Collects education, demographic, and employment information from recent college graduates (bachelors and masters) majoring in science,engineering, and health fields Two stage sample design –1 st stage: select schools and obtain the list of graduates from selected schools –2 nd stage: select graduates from the list provided by schools NSF-sponsored survey

NSRCG List collection from schools Identify point of contact (usually institutional coordinator) Gather the list of graduates with key sampling and locating information including: –degree award dates –degree level –field of major –race/ethnicity –gender –date of birth –SSN –student ID –mailing addresses including parents addresses –phone numbers (land line, cell) – s, etc. Identify point of contact (usually institutional coordinator) Gather the list of graduates with key sampling and locating information including: –degree award dates –degree level –field of major –race/ethnicity –gender –date of birth –SSN –student ID –mailing addresses including parents addresses –phone numbers (land line, cell) – s, etc.

NSRCG List collection from schools (continued) Need a good understanding on the information requested and file format Time consuming and costly efforts –different schools have different issues A crucial part for the quality of the survey –strive to get almost perfect cooperation rate (99%) –Out of 300 schools, only four final refusals in 2003 only five refusals in 2006 Need a good understanding on the information requested and file format Time consuming and costly efforts –different schools have different issues A crucial part for the quality of the survey –strive to get almost perfect cooperation rate (99%) –Out of 300 schools, only four final refusals in 2003 only five refusals in 2006

NSRCG School sample selection For 1995, 1997, 1999, 2001 surveys –275 schools initially selected in 1995 and kept with 5 supplemental samples added over three survey rounds (to account for frame coverage) A new sample of 300 schools selected in 2003: –To reflect rapid changes of S&E populations in 1990s –Health field added to the survey as eligible field of study For 1995, 1997, 1999, 2001 surveys –275 schools initially selected in 1995 and kept with 5 supplemental samples added over three survey rounds (to account for frame coverage) A new sample of 300 schools selected in 2003: –To reflect rapid changes of S&E populations in 1990s –Health field added to the survey as eligible field of study

NSRCG School sample selection (continued) Probability proportional size (PPS) with composite size measure Composite size measures calculated to achieve equal weights within each of NSRCG analytic domains constructed by a combination of: –degree year, degree level, field of majors, race/ethnicity, and gender Population dynamics –new schools (birth), closed (death), no S&E graduates (temporarily ineligible), etc Coverage issue –distributions of schools changed (in terms of composite size measures) potential factor affecting the sample efficiency Probability proportional size (PPS) with composite size measure Composite size measures calculated to achieve equal weights within each of NSRCG analytic domains constructed by a combination of: –degree year, degree level, field of majors, race/ethnicity, and gender Population dynamics –new schools (birth), closed (death), no S&E graduates (temporarily ineligible), etc Coverage issue –distributions of schools changed (in terms of composite size measures) potential factor affecting the sample efficiency

2003 NSRCG school sample In both 2001 and 2003 NSRCG 170 (57%) Only in 2003 NSRCG 130 (43%) Total300 Excessive efforts (time and resources) to achieve 99% of RR (4 schools refused)

Distribution of list submission dates in 2003 NSRCG Days

School sample after 2003 NSRCG – 2006 NSRCG Frame evaluation 2003 Frame based on AY2001 IPEDS counts 2006 Frame based on AY2003 and AY2004 IPEDS counts

Graduate counts dropped from and added to the population

2006 NSRCG School Sample No significant change of the population –Kept the same school sample without any supplemental sample No significant change of the population –Kept the same school sample without any supplemental sample

Distribution of list submission dates in 2006 NSRCG Days

2008 NSRCG ? Evaluate the current sampling strategy (keeping the same sample) by doing –frame evaluation –comparisons with other sampling schemes Independent PPS Keyfitz procedure Evaluate the current sampling strategy (keeping the same sample) by doing –frame evaluation –comparisons with other sampling schemes Independent PPS Keyfitz procedure

2008 NSRCG Frame evaluation 2003 Frame based on AY2001 IPEDS counts 2008 Frame based on AY2006 IPEDS counts

Graduate counts dropped from and added to the population

Sample Evaluation Three sample selection methods considered –Keep the 2003 school sample with a supplemental sample of size 4 –Independent PPS with composite size measures based on updated frame information –Keyfitz procedure Three sample selection methods considered –Keep the 2003 school sample with a supplemental sample of size 4 –Independent PPS with composite size measures based on updated frame information –Keyfitz procedure

PPS sample selection procedure Define Size Measure: where m d is a sample size of domain d, M d is the population size of domain d M id is the population size of domain d in school i domain d is constructed from a combination of: graduate year, degree level, field of major, race/ethnicity, and gender

PPS sample selection procedure School i selected with probability (p i ) proportional to size S i Achieve equal weight within each domain d Distributional changes of the NSRCG graduate populations would cause unequal weight variations within domains Independent PPS with up-to-date frame data is desirable if weight variation is severe School i selected with probability (p i ) proportional to size S i Achieve equal weight within each domain d Distributional changes of the NSRCG graduate populations would cause unequal weight variations within domains Independent PPS with up-to-date frame data is desirable if weight variation is severe

Keyfitz procedure Maximize the overlap between two samples The first sample (2003 NSRCG) was selected with PPS The second sample inclusion probability is dependent upon: –updated size measures –the first sample inclusion probability –the actual sample realization in the first sample Maximize the overlap between two samples The first sample (2003 NSRCG) was selected with PPS The second sample inclusion probability is dependent upon: –updated size measures –the first sample inclusion probability –the actual sample realization in the first sample

Simulation of sampling procedures Generate 1000 school independent samples for each of the following options –Keep the same school sample with a supplemental sample of size 4 from the newly eligible schools (births) –Independent PPS sampling using MOS calculated from 2008 NSRCG frame –Keyfitz procedure Generate 1000 school independent samples for each of the following options –Keep the same school sample with a supplemental sample of size 4 from the newly eligible schools (births) –Independent PPS sampling using MOS calculated from 2008 NSRCG frame –Keyfitz procedure

Summary Keeping the same sample is a cost effective option Concern about statistical inefficiency due to the nature of dynamic population Frame coverage corrected by supplemental sample Evaluate the NSRCG school sample –Empirical frame evaluation –Samples simulated based on two methods Distribution changes (in terms of composite size measure) would make the final sample inefficient: –Weight variation within planned domains –Over or under estimation of graduates in some domains Keeping the same sample is a cost effective option Concern about statistical inefficiency due to the nature of dynamic population Frame coverage corrected by supplemental sample Evaluate the NSRCG school sample –Empirical frame evaluation –Samples simulated based on two methods Distribution changes (in terms of composite size measure) would make the final sample inefficient: –Weight variation within planned domains –Over or under estimation of graduates in some domains

Recommendation Keep the same school sample with supplemental sample of size 4 for 2008 NSRCG