INFO 7470/ECON 7400/ILRLE 7400 Universes, Populations, Frames, and Sampling John M. Abowd and Lars Vilhuber February 4, 2013.

Slides:



Advertisements
Similar presentations
The Business Register Research, Design and Evaluation Division Statistical Institute of Jamaica.
Advertisements

DATA FROM ADMINISTRATIVE SOURCES
U.S. Census and American Community Survey Overview Open a web browser and go to:
National Center for Health Statistics DCC CENTERS FOR DISEASE CONTROL AND PREVENTION Changes in Race Differentials: The Impact of the New OMB Standards.
Presented to: Presented by: Transportation leadership you can trust. LEHD OnTheMap Data Planning Applications Conference, Session 2 Bruce Spear, Cambridge.
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
Co-employment and the Business Register: Impact and Solutions Brandy L. Yarbrough U.S. Census Bureau.
© John M. Abowd 2007, all rights reserved Universes, Populations and Sampling Frames John M. Abowd February 2007.
CE Overview Jay T. Ryan Chief, Division of Consumer Expenditure Survey December 8, 2010.
© John M. Abowd 2005, all rights reserved Household Samples John M. Abowd March 2005.
© John M. Abowd 2005, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2005.
INFO 4470/ILRLE 4470 Social and Economic Data Populations and Frames John M. Abowd and Lars Vilhuber February 7, 2011.
MCCORMICK SRI: GOING DEEP WITH CENSUS DEMOGRAPHIC AND ECONOMIC DATA EMPLOYMENT AND UNEMPLOYMENT ESTIMATES FROM THE U.S. DEPARTMENT OF LABOR, BUREAU OF.
Presented to: Presented by: Transportation leadership you can trust. LEHD OnTheMap Data 2011 GIS in Public Transportation Tampa, FL Bruce Spear September.
Online Industry Market Research Presented by Janet Harrah, Director Center for Economic Development & Business Research, Wichita State University.
© John M. Abowd 2005, all rights reserved Sampling Frame Maintenance John M. Abowd February 2005.
Labor Statistics in the United States Grace York March 2004.
© John M. Abowd 2005, all rights reserved Statistical Programs of the Federal Government John M. Abowd February 2005.
© John M. Abowd 2005, all rights reserved Using the Economic Census and Business Register John M. Abowd February 2005.
Census Bureau Employment Data ACS, EC, and LED… And why you should use the data from one program vs. another… SDC/CIC Annual Training Conference Wednesday,
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
INFO 4470/ILRLE 4470 Register-based statistics by example: County Business Patterns John M. Abowd and Lars Vilhuber February 14, 2011.
INFO 7470/ILRLE 7400 Universes, Populations, Frames, and Sampling John M. Abowd and Lars Vilhuber February 1, 2011.
Mexico's experience using enterprise-based surveys to measure entrepreneurship Félix Vélez Fernández Varela National Institute of Statistics and Geography,
1 The Business Register: Introduction and Overview Ronald H. Lee
Improvements in the BLS Business Register Richard Clayton David Talan 12th Meeting of the Group of Experts on Business Registers Paris, France September.
1 Constructing and Maintaining a Business Register: Singapore’s Experience By Ong Lai Heng Singapore Department of Statistics International Workshop on.
Building Quality Address Data: A Census Bureau Perspective Rocket City Geospatial Conference Huntsville, AL November 16, 2011.
Census Census of Population, Housing,Buildings,Establishments and Agriculture Huda Ebrahim Al Shrooqi Central Informatics Organization.
Data Sharing to Reduce Respondent Burden for the U.S. Census Bureau’s Business Register Presented to 12 th Meeting of the Group of Experts on Business.
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
111 American Community Survey Fundamentals 2009 Population Association of America ACS Workshop April 29, 2009.
Overview of the American Community Survey Sample Design Prepared for the Quarterly Meeting of the Occupational Information Development Advisory Panel Social.
Liesl Eathington Iowa Community Indicators Program Iowa State University October 2014.
1 Business Register: Quality Practices Eddie Salyers
The Statistical Business Register of Macao SAR Government of Macao SAR Statistics and Census Service.
1 Supplementing ACS: The LEHD Program Jeremy S. Wu Marc Roemer U.S. Census Bureau May 12, 2005 Jeremy S. Wu Marc Roemer U.S. Census Bureau May 12, 2005.
The American Community Survey: An Overview
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Planning for 2010: A Reengineered Census of Population and Housing Preston Jay Waite Associate Director for Decennial Census U.S. Census Bureau Presentation.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
The U.S. Census Bureau Population Estimates Program Victoria A. Velkoff U.S. Census Bureau APDU Annual Conference September 25, 2008.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.
Current Population Survey Joint BLS/Census Bureau Product Sampling design – About 60,000 occupied housing units monthly nationally – design National/Regional.
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
INFO 7470/ECON 7400/ILRLE 7400 Understanding Social and Economic Data John M. Abowd and Lars Vilhuber January 21, 2013.
INFO 7470 Overview of the Federal Statistical System John M. Abowd and Lars Vilhuber February 1, 2016.
© John M. Abowd 2005, all rights reserved Using the Decennial Census of Population and Housing John M. Abowd February 2005.
U.S. Census and American Community Survey Overview Open a web browser and go to:
1 Overview of the U.S. Census Bureau’s Business Register Profiling Operations Presented to International Roundtable on Business Survey Frames– Wiesbaden.
The LEHD Program and Employment Dynamics Estimates Ronald Prevost Director, LEHD Program US Bureau of the Census
INFO 7470/ECON 7400/ILRLE 7400 Register-based statistics John M. Abowd and Lars Vilhuber March 4, 2013 and April 4, 2016.
INFO 7470 Statistical Tools: Edit and Imputation Examples of Multiple Imputation John M. Abowd and Lars Vilhuber April 18, 2016.
Measuring Data Quality in the BLS Business Register Richard Clayton Sherry Konigsberg David Talan WiesbadenGroup on Business Registers Tallin, Estonia.
DIRECTORIO ESTADÍSTICO NACIONAL DE UNIDADES ECONÓMICAS National Statistical Directory of Economic Units.
INFORMATION SERVICESPopulation Technical Advisory Committee Copyright © 2010 Population Technical Advisory Committee Roles and Responsibilities January.
No Free Lunch: Working Within the Tradeoff Between Quality and Privacy
John M. Abowd and Lars Vilhuber February 16, 2011
Identifying Worker Characteristics Using LEHD and GIS
Household Surveys: American Community Survey & American Housing Survey
Census Planning and Management
Key Considerations for Planning and Management of Census Operations
Presentation transcript:

INFO 7470/ECON 7400/ILRLE 7400 Universes, Populations, Frames, and Sampling John M. Abowd and Lars Vilhuber February 4, 2013

Outline A little more statistical infrastructure What is a census? Basic definitions Demographic sampling frames Economic sampling frames Coverage and un-duplication Basic relations connecting frames 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 2

Principal Statistical Agencies Office of Management and Budget Bureau of Economic Analysis (Commerce) Bureau of Justice Statistics (Justice) Bureau of Labor Statistics (Labor) Bureau of Transportation Statistics (Transportation) Census Bureau (Commerce) Economic Research Service (Agriculture) Energy Information Administration (Energy) Environmental Protection Agency (Independent) Internal Revenue Service, Statistics of Income (Treasury) National Agricultural Statistical Service (Agriculture) National Center for Education Statistics (Education) National Center for Health Statistics (Centers for Disease Control and Prevention, HHS) National Science Foundation, Science Resources Statistics (Independent) Social Security Administration, Office of Policy (Independent) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 3

Role of OMB The Office of Management and Budget oversees the regulatory and budgeting environment Home of the Chief Statistician of the United States (currently Katherine Wallman) Budgets for all of the statistical agencies are reviewed by OMB Regulations for reporting of Race, Ethnicity, Industry, Geography 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 4

Census An attempt to enumerate every entity in a specified universe (target population) 1/25/2011 © John M. Abowd and Lars Vilhuber 2011, all rights reserved 5 “Perhaps the earliest type of survey is the census, generally conducted by governments. Censuses are systematic efforts to count an entire population, often for purposes of taxation or political representation.” (Groves et al. Survey Methodology, 2004, pp. 3-4.)

Legal v. Statistical Concepts In the United States, the decennial census of population must be conducted as an “enumeration” Department of Commerce v. U.S. House of Representatives (1999) defined enumeration so that “The Census Act prohibits the proposed uses of statistical sampling to determine the population for congressional apportionment purposes.” 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 6

Legal v. Statistical Concepts But Utah v. Evans (2002) allowed the use of “hot- deck imputation” to allocate individuals to vacant domiciles deemed inhabited on April 1, “Indeed, the Bureau's imputation method is similar in principle to other efforts used since 1800 to determine the number of missing persons, including asking heads of households, neighbors, landlords, postal workers, or other proxies about the number of inhabitants in a particular place.” 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 7

Science of the Controversy Assessing the accuracy of a census requires supplemental information The supplemental information is usually collected in the form of a coverage assessment survey Many statistical methods are used to compare the original census to the post-enumeration survey The assumptions inherent in these analyses are often difficult to test 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 8

Survey “… systematic method for gathering information from (a sample of) entities for the purposes of constructing quantitative descriptors of the attributes of the larger population of which the entities are members” (Groves et al., 2004, p. 2) Sampling is optional in general, but in this course we will use “sample survey” and “survey” interchangeably, reserving “census” for any enumeration activity But, entities covered by the survey represent entities in the population 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 9

Censuses v. Surveys Decennial Census of Population and Housing (short form until 2010) The Economic Censuses (years ending in 2 and 7) Quarterly Census of Employment and Wages Long form (through 2000) American Community Survey (since 2005) Annual Survey of Manufactures, Monthly Survey of Construction Current Employment Statistics 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 10

Administrative Record Any information concerning specific entities in a designated population that is collected by a governmental agency for the purposes of enforcing a specific law The legal distinction between administrative record and statistical information systems was made in the Confidential Information Protection and Statistical Efficiency Act of 2002 (CIPSEA) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 11

CIPSEA For the Census Bureau, these protections are part of Title 13 Many other agencies (e.g., BLS) had no statutory statistical agency status until CIPSEA Statistical use of administrative records under CIPSEA prohibits re-use of the statistically-enhanced records for the legal enforcement activities even when the statistical and administrative functions are performed inside the same agency Sharing of some administrative records for statistical purposes is explicitly authorized among agencies previously not authorized to do so (Census, BEA, BLS, and IRS) Other sharing may occur when it is not explicitly prohibited Dense legal forest that we will hike later 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 12

Administrative v. Statistical Uses Tax information reporting to the tax collecting agency from a specific taxable entity (IRS) Determination of Unemployment Insurance benefits and eligibility from employer wage reports (state UI offices) Tables of household Adjusted Gross Income by geography and size (Statistics of Income Division) Quarterly job creations and destructions based on establishment reports (BLS-BED) Quarterly job creations and destructions based on job-level reports (Census-QWI) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 13

Basic Definitions Universe (target population) Out-of-scope population Frame population Geography Population demographics Business entity demographics Government entities 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 14

Universe = Target Population Theoretical construct specifying every entity that satisfies a set of explicit qualifying conditions In probability models describing the statistical process of estimation (either finite-population or super-population methods), the universe is the event that occurs with probability 1 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 15

Target Population The “target population” is any entity satisfying the set of conditions that specify the universe. “Universe” and “Target Population” are synonyms Example 1: “Human population” All people, male and female, child and adult, living in a given geographic area at a particular date Example 2: “Establishment population” A business, industrial, service or governmental unit at a single location that distributes goods or performs services on a particular date (or during a given period) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 16

Out-of-scope Population An entity that is outside either the geographic region under study or fails to meet another specific restriction imposed on the target population Example 1: When the in-scope population is “persons age 16 or over living in households,” persons age 15 or younger and all persons living in group quarters are out of scope Example 2: When the in-scope population is “employer establishments,” all establishments with zero employees are out of scope 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 17

Frame Population Set of target population, or universe, entities that can be selected into a sample or census Also called a sampling frame The frame population or sampling frame is the physical manifestation of the universe—if an entity is not on the frame (or one of the frames for multi-frame sampling), then it cannot be in the census or survey Simple cases: (single frame sampling) a list of all addresses to be sampled; list of all people to be sampled; list of all businesses to be sampled Complex sample designs use multiple frame populations to get better coverage of the target population or universe Complex frame example: CPS or SIPP 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 18

Complex Housing Frame Example Survey of Income and Program Participation (SIPP) – Similar complex frame used for CPS Multi-stage sample Primary Sampling Units are geographic areas Within PSUs five distinct frames are used SIPP Sample Design 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 19

SIPP Details I 5 Frame populations: Unit, Area, Group Quarters, Coverage Improvement, and New Construction. Unit, area, and group quarters frames are based on census counts from the most recent decennial census of population. These account for 90% of the sample addresses – In the unit and group quarters frames, the clusters contain only a single housing unit or housing unit equivalent – Within each of these frames, clusters of housing units are selected – In the area frame, the cluster contains four “expected” housing units The coverage improvement frame includes housing units missing from the most recent decennial census but found in the post-enumeration surveys 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 20

SIPP Details II New construction frame provides coverage of structures for which building permits have been issued since the last census – Updated annually, and sampled with increasing percentage as the time since the last census increases – In the new construction frame, half the clusters have four expected housing units and half have eight expected units Statistical analysis is used to estimate the probabilities that households or individuals in the target population will be found in a particular frame (source of the design weight) The unit frame, which is based on the most recent Census, covers about 80% of the target population in the SIPP by these estimates 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 21

Geography The geography of a sampling frame assigns to every latitude and longitude a fundamental geographic area Geographic entities can be assembled uniquely by aggregating geographic areas The basic geographic entity for the U.S. Census is the “block” 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 22

© John M. Abowd and Lars Vilhuber 2013, all rights reserved Standard Hierarchy of Census Geography Entities 2/4/201323

© John M. Abowd and Lars Vilhuber 2013, all rights reserved Hierarchy of American Indian, Alaska Native, and Native Hawaiian Areas 2/4/201324

Entities In sampling frame development every geographic location (latitude and longitude) that contains a structure (natural or man-made) capable of originating economic activity is classified as a domicile, business, or both Entities are placed in the frame by declaring the target population to be humans beings (all domiciles including group quarters), economic (all businesses and service organizations), and government Notice that both for-profit and not-for-profit business activity are covered in the business entity scope 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 25

Population Demographics Human populations are usually categorized by current living quarters when designing demographic sampling frames Distinguish between household living quarters and group living quarters Frames based on landline, mobile telephone or Internet service provider are not domicile base 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 26

Household Living Quarter Definitions Housing unit: a house, an apartment, a mobile home or trailer, a group of rooms, or a single room occupied as separate living quarters, or if vacant, intended for occupancy as separate living quarters – Separate living quarters are those in which the occupants live separately from any other individuals in the building and which have direct access from outside the building or through a common hall – For vacant units, the criteria of separateness and direct access are applied to the intended occupants whenever possible Household: A household includes all the people who occupy a housing unit as their usual place of residence 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 27

Group Living Quarters Group quarters: all people not living in households There are two types of group quarters: – institutional (for example, correctional facilities, nursing homes, and mental hospitals) – non-institutional (for example, college dormitories, military barracks, group homes, missions, and shelters) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 28

Group Quarters Population Specifics Includes all people not living in households Includes those people residing in group quarters as of the date on which a particular survey was conducted Two general categories – the institutionalized population which includes people under formally authorized supervised care or custody in institutions at the time of enumeration (such as correctional institutions, nursing homes, and juvenile institutions) – the non-institutionalized population which includes all people who live in group quarters other than institutions (such as college dormitories, military quarters, and group homes). The non-institutionalized population includes all people who live in group quarters other than institutions. 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 29

2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved minute break

Business Entity Demographics Business entities have “establishments” as their basic unit Establishment: A business or industrial unit at a single location that distributes goods or performs services Establishments are collected into companies Business entity demographics separately track establishments (physical business locations) and companies (economic organizations owning establishments) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 31

Business Entity Demographics Company: (or “enterprise”) all the establishments that operate under the ownership or control of a single organization. A company may be a commercial business, service, or membership organization A company may consist of one or several establishments A company may operate at one or several locations A company may operate in one or more economic activities A company includes all subsidiary organizations, all establishments that are majority-owned by the company or any subsidiary, and all the establishments that can be directed or managed by the company or any subsidiary. 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 32

Single-unit Companies (SU) Definition: Companies for which the location and the company are one and the same A single-unit business, service agency, or membership organization is one for which all the economic activity of the owner or owners is conducted at a single location Example: the “Shop Around The Corner” in the movie of the same name (or “You’ve got mail”) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 33

Multi-unit Companies (MU) Definition: companies that have more than one location A multi-unit business, service organization, or governmental agency is one for which the owners conduct economic activity at more than one physical location Example 1: all the manufacturing and management locations of the General Motors Corporation constitute a multi-unit company Example 2: all the service delivery locations of the Salvation Army constitute a multi-unit company 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 34

Government Entities Definition: public entity created by the U.S. constitution, state constitutions or the statutes of a state In the United States these are divided into: – National government (U.S. constitution) – State government (state constitutions) – Local government (statutory entities created by states) General purpose Public school systems Special districts 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 35

Business and Government Entity Activity Classifications An entity that is engaged in economic or governmental activity may be classified in several ways – Ownership: the legal form of organization (public/private; corporate, partnership, sole proprietorship) – Activity: An industry is the most detailed category available in North American Industrial Classification System to describe business activities 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 36

Business and Government Entity Activity Classifications NAICS provides hundreds of separate industry categories, unique categories that reflect different methods used to produce goods and services. Industry categories are used to classify, collect, process, publish, and analyze business statistics. NAICS documentation 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 37

Demographic Sampling Frames Comprehensive mapping of location of the in- scope population with standardized geography Comprehensive list of domiciles in the geographic area covered Characteristics of inhabitants of the domiciles (stratifying variables) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 38

Coverage Domicile address lists are collected from multiple sources for households and group quarters The U.S. Census Bureau collects domicile addresses into the MAF (Master Address File) The U.S. Census Bureau collects physical locations into a set of files known as the TIGER (Topologically Integrated Geographic Encoding and Referencing system) Census geography resources 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 39

Refreshing and Unduplication Refreshing a demographic sampling frame consists of collecting information on new housing units (households or group quarters) and purging information on housing units that have been taken out of service Unduplication is the process of expending resources to reduce the probability that an entity is present more than once in a sampling frame 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 40

More about Unduplication When a demographic sampling frame is refreshed, addresses are added from multiple sources (tax records, new construction permits, etc.) The same address may appear more than once Some locations may appear to have no domiciles located on them but actually contain households or group quarters 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 41

© John M. Abowd and Lars Vilhuber 2013, all rights reserved Economic Sampling Frames Comprehensive mapping of location of the in- scope activity with standardized geography Comprehensive list of addresses of business and government establishments in the geographic area covered Economic activity and size measures for the entity (stratifying variables) Updating of organizational structure (by survey) 2/4/201342

Coverage Entity address lists are assembled from previous economic censuses, tax records and surveys of company organization The U.S. Census Bureau collects business establishment information into the Employer Business Register and the Non-employer Business Register A separate register is maintained for government establishments The Bureau of Labor Statistics collects business establishment information into its Current Employment Statistics (CES) frame from information collected by state departments of employment security (ES-202) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 43

Frame Development Both business frames must collect information about the birth and death of new establishments – Census Bureau: Report of Organization Survey between Economic Censuses, responses on periodic surveysReport of Organization Survey – BLS: Quarter 1 of the QCEW and state Labor Market Information officesQuarter 1 of the QCEW and state Labor Market Information offices This is complicated by the reliance on administrative reports (tax and unemployment insurance reports) and sample surveys 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 44

Basic Relations Connecting Frames Geographic relations Business relations 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 45

Geographic Relations Economic and demographic frames are directly connected through the use of common geographic identifiers A single location can be associated with household (demographic) activity, economic activity, both, or neither (undeveloped) No U.S. statistical agency maintains a single geographically integrated frame for households and businesses 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 46

Business Relations Demographic and economic sampling frames can be connected by economic relations between the households and businesses Supplier-customer relations Employer-employee relations 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 47

What is Sampling Frame Maintenance? The sampling frame or frame population defines the actual entities from the target population with a positive probability of inclusion in the survey or census Demographic and economic activity change the frame dynamically The biggest investment that statistical agencies and private firms make is the creation and maintenance of frame populations 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 48

Housing Frame Swedish registers Census MAF/TIGER system (MTdb) Housing address list updates How do you combine the information? 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 49

Register-based Population Censuses All personal addresses in Sweden are maintained in national registers The registers are updated by the individual when he/she moves using the national PIN All businesses and governmental activity use the registers to find the person The registers can be used as the sampling frame to conduct a census or survey – Swedish register-based census 2005 Swedish register-based census 2005 – German register-based census German register-based census 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 50

Census Master Address File (Census 2010) Started with 2000 Master Address File – Based on 1990 Address Control File (ACG) – Add U.S. Postal Service Delivery Sequence File (DSF) – Un-duplication (one record per address) Updated with Census 2000 address improvements – 100% block canvas (independent of LUCA) Physical visit of address; verification of use Address confirmed or deleted Missing addresses added if residential – Local Update of Census Addresses (LUCA) – Group Quarters Master File Spatial address database: Topologically Integrated Geographic Encoding and Referencing (TIGER) developed for 1990 Census 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 51

Census MAF (2) Local Update of Census Addresses (LUCA; independent of 100% canvas) – Voluntary program by county and local governments – Adds and deletes by address – Special statutory exception to Title 13 for address improvement – States were also authorized in 2010 Census to cooperate LUCA final assessment 2010 Census 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 52

Census MAF (3) New construction – To include new construction since 100% canvas and LUCA – Updated DSF – Lists of new housing construction from local governmental units Arbitration of units found by LUCA not in 100% canvas (field verification) Unduplication of new construction list Field staff visits to determine Census status 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 53

MAF/TIGER System (MTdb) Contains all historical MAF data and metadata MAF Units: – Housing unit (HU) – Group quarters (GQ) – Transitory location (TL) – Nonresidential Ongoing update efforts – USPS Delivery Sequence Files (DSF) – ZIP Move Engineering File – Locatable Address Conversion System (LACS) Overview from census.gov 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 54

Employer Business Frame Populations Census Employer Business Register BLS Establishment Register Establishment births and deaths The problem of false births and deaths The problem of multiple activity codes 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 55

Census Employer Business Register Frame population consists of businesses that file income tax returns Employers are identified from the tax forms Multiunit and single unit businesses are determined in the Economic Census Updates to the MU/SU classification are determined by the annual Report of Organization Survey Overview from census.gov 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 56

Business Register (2) Frame maintenance is conducted using weekly strips from the IRS master business returns Information is acquired with a lag that corresponds to the difference between filing deadlines and accounting periods Many different business tax forms are used 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 57

© John M. Abowd and Lars Vilhuber 2013, all rights reserved Business Register (3) To create a frame population from the Business Register (employer businesses) – Define the target population by activity and date – Select the establishments from the SU and MU business registers that meet the target population definition – Eliminate inactive establishments – Edit size measures (employment, payroll, sales) 2/4/201358

Employment/Job Frame Populations A job is a relation between an employer and an employee Target population of jobs depends on definitions of employer and employee Frame population must be constructed from legal employment definitions No U.S. agency maintains a sampling frame for jobs Closest is the LEHD Infrastructure File System Employment History File at the Census Bureau 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 59

© John M. Abowd and Lars Vilhuber 2013, all rights reserved Job Frame Populations Dynamic frames Defining an employer Defining an employee Person Frame Person ID Data Employer Frame Employer ID Data Job Frame Person ID Employer ID Data 2/4/201360

Job Frame Populations The problem of integration on the employer side – The job frame does not define a complete employer population frame, even if it is universal, unless the activity definitions for the employer and the employee report match The problem of integration on the employee side – The job frame does not define a complete individual population frame, even if it is universal, unless the individual activity definition only includes employment (no unemployment or non-participation) 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 61

Geographic Link Frame Transportation network frames Defining a workplace location Defining a residence location Frame maintenance for the workplace- residence pair 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 62

Transportation Network Frame Populations Potential target populations: – Trips to work on a particular date – Trips to purchase a good or service on a particular date – Leisure trips on a particular date Potential frame populations – Reported place of work on Decennial Census – Residential and employer address information in a job frame 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 63

Defining a Workplace Location Latitude and longitude of the trip origin – With confidentiality protections, this origin definition is too fine to protect the identity of the traveler for a public use file – Frame population can be constructed from small geographic areas that include lat/long boundaries Census 2000/ACS use tracts and Traffic Analysis Zones (TAZ) [guidelines]guidelines OnTheMap uses blocks [definitions]definitions 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 64

Defining a Residence Location Latitude and longitude identify the location of the business exactly – Same concerns for public use files as in residential address – Frame populations constructed by assigning lat./long. to a particular unique area Census 2000, ACS: TAZ, tracts OnTheMap: tracts 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 65

Transportation Frame Population Maintenance The complete frame population is defined by all possible origin/destination pairs in the geography chosen Census 2000 frame population constructed from the long form questions; not currently updated ACS frame population constructed from the 5- year composite tables OnTheMap frame population constructed from the job frame; updated annually for workplace- residence modeling 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 66

Summary There are many important details associated with frame definitions and maintenance These details directly affect the quality of internal statistical agency data files that are directly used for research The connection between the operational procedures and the publication data will be examined in upcoming classes 2/4/2013 © John M. Abowd and Lars Vilhuber 2013, all rights reserved 67