New Data in the Federal Statistical Research Data Centers Melissa Ruby Banzhaf, PhD Administrator, ARDC Center for Economic Studies U.S. Census Bureau.

Slides:



Advertisements
Similar presentations
Cornell University Institute for Policy Research What We Dont Know About People with Disabilities Presentation to Participants of the: The Future of Disability.
Advertisements

Measures of Child Well-Being from a Decentralized Statistical System: A View From the U.S. National Center for Health Statistics Stephen J. Blumberg, Ph.D.
U.S. Department of Commerce Economics and Statistics Administration U.S. CENSUS BUREAU Income, Poverty, and Health Insurance Coverage: 2009 September 2010.
Understanding the Census: The Other Nine Years Alabama State Data Center Annual Affiliate Meeting November 2, 2012 Tuscaloosa, AL Genora F. Barber, Information.
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
U.S. Vital Statistics Mortality Data: Past Uses and Future Directions Irma T. Elo Director, Population Studies Center Professor of Sociology University.
The Second Longitudinal Study of Aging Julie Dawson Weeks, Ph.D. LSOAs Project Director U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES Centers for Disease.
Semi-Permeable Boundaries Among Institutions: Non-Public Data and the Census RDC at Berkeley IASSIST 2009 – Tampere, Finland Jon StilesMay 27, 2009.
The rich multivariate data of the National Center for Health Statistics’ National Health Interview Survey Jane F. Gentleman, Ph.D., Director Division of.
Presented to: Presented by: Transportation leadership you can trust. LEHD OnTheMap Data 2011 GIS in Public Transportation Tampa, FL Bruce Spear September.
Labor Statistics in the United States Grace York March 2004.
© John M. Abowd 2005, all rights reserved Statistical Programs of the Federal Government John M. Abowd February 2005.
Arizona Department of Health Services and Rural Health Office Webinar Series: Issues in Rural Health Planning Community Health Assessment Overview Howard.
National Center for Health Statistics Research Data Center Peter S. Meyer Director, Research Data Center May 6, 2009.
Census Bureau Employment Data ACS, EC, and LED… And why you should use the data from one program vs. another… SDC/CIC Annual Training Conference Wednesday,
Profile of US Data Sources on Entrepreneurship Richard Clayton and Jim Spletzer US Bureau of Labor Statistics OECD Entrepreneurship Indicators Steering.
Socio-Economic & Demographic Data Tools for Proactive Planning Robin Blakely-Armitage STATE OF NEW YORK CITIES: Creative Responses to Fiscal Stress March.
Treasure Trove of Data: Conducting Research Using Federal Statistical Surveys.
Knowledge for Equity Conference November 13, 2012 U.S. Department of Education Office of Special Education and Rehabilitative Services National Institute.
BC Jung A Brief Introduction to Epidemiology - IV ( Overview of Vital Statistics & Demographic Methods) Betty C. Jung, RN, MPH, CHES.
New Census Bureau Data for Entrepreneurship Research Ron S Jarmin US Census Bureau OECD November 19, 2007 This report is released to inform interested.
Census: Demographics and Business Six-State Virtual Government Information Conference 8/12/10 Jerry O’Donnell Manager, Partnership & Data Services U. S.
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
Aspects of the National Health Interview Survey (NHIS) Chris Moriarity National Conference on Health Statistics August 16, 2010
The American Community Survey Texas Transportation Planning Conference Dallas, Texas July 19, 2012.
Liesl Eathington Iowa Community Indicators Program Iowa State University October 2014.
Statistical Abstract of the United States- Value of Data Ian O’Brien Branch Chief, Statistical Compendia Branch, U.S. Census Bureau.
The California Census Research Data Center Data Oct 22, 2012.
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
Accessing Aggregated Population Health Data from Select Tools of the NCHS A presentation at the Knowledge 4 Equity Conference James M. Craver November.
2006 ICE meeting Using Linked Data to Examine Injury and Disability Beth Rasch and Chris Cox National Center for Health Statistics.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
July 31, 2009Prepared by the Maine Health Information Center Overview of All Payer Claims Data Suanne Singer, Senior Consultant Maine Health Information.
1 Sources of gender statistics Angela Me UNECE Statistics Division.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
1 Overview of the National Long Term Care Survey (NLTCS) Conference on Chinese Healthy Aging and Socioeconomic Development Durham, NC August, 2004 Nicholas.
Mobility MATTERS! Connecting People to Life Who Rides the Bus? How Understanding Transit Demographic Can Improve Service May 7, 2015.
1 Reengineering the SIPP: An Assessment of the Use of Administrative Records Jim Farber and Sally Obenski US Census Bureau CNSTAT Panel January 26, 2007.
MCRDC Michigan Census Research Data Center The MCRDC is a joint project of the U.S. Bureau of the Census and the University of Michigan to enable qualified.
Using Census Data to Understand Things ​ OpenGovChicago March 26, 2014.
1 NCHS Record Linkage Activities Kimberly A. Lochner Christine S. Cox NCHS Data Users Conference July 11, 2006 U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES.
2008 NCHS Data Users’ Conference Omni Shoreham Hotel Washington, DC Wednesday, August 13, 2008.
Introduction to Secondary Data Analysis Young Ik Cho, PhD Research Associate Professor Survey Research Laboratory University of Illinois at Chicago Fall,
VerdierView Graph # 1 OVERVIEW Problems With State-Level Estimates in National Surveys of the Uninsured Statistically Enhancing the Current Population.
RESEARCH DATA CENTER Types of Data. Major NCHS Surveys and Data Systems National Health and Nutrition Examination Survey (NHANES) National Health Interview.
Record and Geographic Linkages to Inform Health Disparities Jennifer Parker and Lauren Rossen Office of Analysis and Epidemiology.
Accessing and Using NCHS Data on the Web Ann Aikin Centers for Disease Control and Prevention National Center for Health Statistics.
American Community Survey (ACS) Using Census Data by Block Group January 21, 2016 Presentation at the National Community Development Association Winter.
Grand Traverse County, MI County SNAPSHOT. Overview 01 Demography 02 Human capital 03 Labor force 04 Industry and occupation 05 Table of contents.
Using administrative data to produce official social statistics New Zealand’s experience.
Overview of National Center for Health Statistics (NCHS) Data Systems Mary Burgess
The LEHD Program and Employment Dynamics Estimates Ronald Prevost Director, LEHD Program US Bureau of the Census
Section 811 Webinar During the webinar, we will be holding a Q and A session through the GoToWebinar phone system. If you would like to ask questions.
The benefits received from Social Security are based on the earnings your employer (or you if self-employed) reported, using your Social Security number.
CAR for Immigration Stories Steve Doig Arizona State University.
Using Census Data at the Federal Statistical Research Data Centers Barbara A. Downs Director, FSRDC Center for Economic Studies U.S. Census Bureau.
Michigan Census Research Data Center
Data Available in the RDC
Data Available in the RDC
The Rocky Mountain Research Data Center
The Rocky Mountain Research Data Center
Haksoon Ahn, PhD Associate Professor
The Rocky Mountain Research Data Center
Haksoon Ahn, PhD Associate Professor
UT-Austin FSRDC Grand Opening December 13, 2017
Part 1: Data Sources Frank Porell
The Rocky Mountain Research Data Center
Presentation transcript:

New Data in the Federal Statistical Research Data Centers Melissa Ruby Banzhaf, PhD Administrator, ARDC Center for Economic Studies U.S. Census Bureau October 9, 2015

Overview  Background on Federal Statistical RDCs  Types of Data Available in the RDC (Emphasis on New Data)  How to Obtain Access to this New Data (and other data) in the RDCs

What are Federal Statistical Research Data Centers (RDCs)?  Secure computing labs where qualified researchers conduct approved statistical analysis on non-public data.  These data are collected by various government agencies (Census Bureau, NCHS, AHRQ, SSA, and more to come).  Established through an agreement between federal statistical agencies and a local research community.  Managed by the Census Bureau.

Federal Statistical Research Data Center Locations

The Atlanta Research Data Center  Located in the Federal Reserve Bank of Atlanta  corner of 10th & Peachtree  Consortium Members  Emory University  University of Georgia  Georgia State University  Clemson University  Federal Reserve Bank of Atlanta  University of Alabama at Birmingham  University of Tennessee – Knoxville  Florida State University  Georgia Institute of Technology

Types of Restricted Data Available  Economic Data  Microdata on firms and establishments  Business Register data  Demographic Data  Survey data on individuals and households  Administrative data on individuals  Linked survey and administrative datasets  Employer-Employee Jobs Data (LEHD)  Data on employees linked with data on employers  Health Data  National Center for Health Statistics  Agency for Healthcare Research & Quality

Advantages of Restricted Data  Vast number of business datasets that are not publicly available at the micro level  Census datasets can be linked together  Census datasets can be linked to external data  More detailed level of geographic identifiers  Very little top or bottom-coding

Economic Datasets Annual Survey of Manufactures Census of Construction Census of Finance and Insurance Census of Manufactures Census of Mining Census of Real Estate Census of Retail Census of Services Census of Transportation Census of Wholesale Survey of Business Owners Commodity Flow Survey Import and Export Transactions Annual Capital Expenditures Survey Business Register (SSEL) Longitudinal Business Database Manufacturing Energy Consumption Survey Medical Expenditure Panel Survey, Insurance Component National Employer Survey Pollution Abatement Costs and Expenditures Quarterly Financial Reports Research and Development Survey Survey of Manufacturing Technology Annual Retail/Wholesale Trade Surveys Kauffman Firm Survey

New Data – Management and Organizational Practices Survey  Supplement to the 2010 Annual Survey of Manufactures  Goal: Collect information on establishment’s use of structured management practices  36 questions:  16 Management (monitoring, targets, and incentives)  13 Organization (who makes decisions, data in decision-making)  7 background (number of managers/non-managers, union status)  Permits analysis of relationship between management practices and key economic outcomes (e.g., productivity)

Demographic Datasets - Survey  Decennial Surveys ( )  American Community Survey  Current Population Survey  Survey of Income and Program Participation  American Housing Survey  National Survey of College Graduates  National Crime Victimization Survey

New Data - Decennial  1950 – 1% PUMS sample  Geography: Census tract but lowest level is enumeration district (roughly 600 people)  1960 – 25% sample (densest ever)  Geography: Census tract and other sub-county geographies (Census place) but lowest level is enumeration district (roughly 600 people)  Harmonized coding across 1950 and 1960

New Data – Current Population Survey  CPS Basic Monthly Data ( )  CPS Food Security Supplement ( )  CPS Voting and Registration Supplement (2006, 2008, 2010, 2012)  CPS Fertility Supplement (1998, 2000, 2002, 2004, 2006, 2008, 2010, 2012)

New Data – Current Population Survey  Characteristics of Internal Files:  Geography: Census Tract  March CPS is only file that has PIKs  Has CPS identification key so may be able to link across CPS surveys.  Some limitations on types of analysis permitted by BLS.

New Data – National Crime Victimization Survey  National survey of households ( )  Collects information on frequency, characteristics, and consequences of criminal victimization (sexual assault, robbery, burglary, motor vehicle theft etc.)  New: Public Police Contact Survey (2011) – Collects information on perceptions of police behavior and response during encounters.

New Data – National Survey of College Graduates  Biennial survey collects information (such as occupation, work activities, salary, relationship between degree field and occupation) on college-educated individuals with particular emphasis on those in science and engineering fields.  2010 currently available  Geography at state level  Currently no PIKs

Demographic Datasets - Administrative  Census Numident File (SSA)  Housing Datasets (HUD):  Public and Indian Housing Information Center Dataset  Tenant Rental Assistance Certification Systems dataset  Computerized Homes Underwriting Management System

Demographic - Administrative Continued  Medicare/Medicaid Datasets (CMS):  Medicare Enrollment Database  Medicaid Statistical Information System

Administrative – Census Numident  Data derived from applications for Social Security Numbers  Contains data on:  Birthdate  Town or county of birth  Gender  Race  Citizenship  Date of death  PIKs

Administrative - Housing  Public and Indian Housing Information Dataset  Contains information on all members of HH with a participant in a covered program:  Housing Choice Voucher  Public Housing  Indian Housing  Includes age, race, sex, rent, household income, PIK  Geography: block level

Administrative - Housing  Tenant Rental Assistance Certification Systems (TRACS) dataset  Contains information on all members of HH with a participant in a covered program.  These programs provide rental assistance for participants living in privately-owned, subsidized housing.  Includes age, race, sex, rent, household income, PIK  Geography: block level

Administrative - Housing  Computerized Homes Underwriting Management System (CHUMS)  Contains records on approved mortgage applications insured by Federal Housing Administration (FHA)  Contains information on borrowers and co- borrowers including income, housing value, mortgage, demographic characteristics, PIKs  Geography: block level

Administrative - CMS  Medicare Enrollment Database ( )  Information on all Medicare beneficiaries  Limited to information on people not claims: eligibility dates and statuses, residence change dates, basic demographic information, PIKs  Geography: block level

Administrative - CMS  Medicaid Statistical Information System ( )  Information on all Medicaid and CHIP enrollees in each month  Limited to information on people not claims: eligibility dates and statuses, basic demographic information, PIKs  Geography: zip code level

Demographic Datasets: Linked Survey-Administrative  Current Population Survey - SSA Earnings Files  Survey of Income and Program Participation – SSA Earnings Files  National Longitudinal Mortality Study

Linked: SSA Files with CPS and SIPP  CPS and SIPP Survey Data matched to SSA earnings files by PIK  SSA records include:  Detailed Earnings Record – earnings from FICA, non-FICA, and self-employment income (1978+) from Master File  Summary Earnings Record – all earnings for each year from 1951 to present  Master Beneficiary Record – contains information (entitlement and payment data) on Social Security Recipients (including Disability).  831 Disability File – determines medical eligibility for Disability Insurance, and SSI benefits.

Linked: National Longitudinal Mortality Study  Purpose of database: to study the effects of demographic and socio-economic characteristics on mortality  Survey data: March CPS, 1980 Decennial Census (sample)  Administrative data: Death Certificate information from National Death Index (through 2011)  Geography: county level

LEHD  “Tracks” a person based on their place of employment; essentially links employees with employers  Based on unemployment insurance administrative records  Available on a state-by-state basis  Quarterly data starting in 1990 – currently through 2011  Can link employer to employer data in other Census datasets  Can link employee to data on individuals in other Census datasets  New Variables: Firm age and size, Firm ID that matches Business Register

New Data – Innovation Measurement Initiative  Goal: Improve measurement of innovation resulting from research grants, a small but important sector of the economy.  How: Integrate university data on federally funded research grants with Census Bureau data on people and businesses.  Specifically link:  Employee, vendor, sub-award transactions to the Census Business Register and LEHD (employee-employer database).  Innovation outcomes: Job placements, start-up activity and business dynamics, vendor characteristics

New Data – Innovation Measurement Initiative  Partnership between Census and Institute on Research in Innovation and Science (IRIS) at the University of Michigan  Member institutions of IRIS provide data to Census and in turn receive:  Individual and collective reports  Underlying tables and graphics for institution’s use  Access to aggregate data for researchers  Input on new product design

New Data – IMI Opportunity  Census is asking for nominations of teams of 2-5 researchers (at least one member with SSS) to assist in enhancing and documenting data for the IMI project.  What is in it for you?  Opportunity to do research on new data.  $25K in funding support for 1 graduate student.  Initial deadline for nominations: October 16

Health Data in the ARDC  These data are collected by:  National Center for Health Statistics (NCHS)  Agency for Healthcare Research and Quality (AHRQ)

What types of NCHS data? National Health Status Surveys National Health and Nutrition Examination Survey (NHANES) I, II, and III National Health Interview Survey (NHIS) Longitudinal Study on Aging I and II (LSOA) National Survey of Family Growth National Survey of Children's Health National Survey of Early Childhood Health National Survey of Children with Special Health Care Needs National Asthma Survey National Health Care Surveys National Ambulatory Medical Care Survey National Hospital Ambulatory Medical Care Survey National Survey of Ambulatory Surgery National Hospital Discharge Survey National Nursing Home Survey (NNHS) National Home and Hospice Care Survey National Employer Health Insurance Survey National Health Provider Inventory National Immunization Survey Vital Statistics Mortality and Multiple Mortality Birth Fetal Death National Death Index Marriage and Divorce

What types of NCHS data? Linked Data Sets  Linked mortality data: NHIS, NHANES LSOA II, NNHS  Linked Medicare Enrollment and Claims data: NHIS, NHANES, LSOA II  Linked Social Security Administration Data: NHIS, NHANES, LSOA II, NNHS  Linked EPA data

What types of AHRQ Data?  Medical Expenditure Panel Survey (MEPS) files include:  Household Component  Provider Component  Insurance/Employer Component  Nursing Home Component (1996 only)  Area Resource File  Two-year two panel file  MEPS-NHIS linked data  Only Household Component and portions of Provider Component are publicly available

How to Access the RDC  Develop proposal  Different guidelines for Census data vs. NCHS/AHRQ guidelines  Submit proposal for agency review  Census (and agency sponsors)  NCHS/AHRQ  Obtain Special Sworn Status (SSS)  Pay one-time fee for NCHS/AHRQ data 35

Timeframe – “Patience is a Virtue”  Census Data  Plan on 6 to 9 months before working in lab  Census approval/ Other Agency Approval  NCHS/AHRQ Data  Timeframe dependent on agency approval process  Census approval NOT required  Special Sworn Status  3 to 4 months for your security clearance

Working in the ARDC lab  All analysis conducted in the ARDC lab  Data located on server in Maryland  Access data via thin client terminals  No internet access or personal computers allowed in lab  Statistical software available: SAS, Stata, R, Matlab, GIS, Sudaan, etc.  Agency reviews output before releasing  Penalty for disclosure is $250,000 and/or 5 yrs in prison (inadvertent or otherwise)

Upcoming RDC-Related Events  Cornell University Course – INFO 7470 – Understanding Social and Economic Data  Can be connected via distance learning (and get course credit)  Intended for Ph.D. students and faculty who use large-scale restricted-access data from government suppliers  Emphasis on data accessible through the RDC network  Interested? Contact us for more information.

Contact Information  People:  Melissa Ruby Banzhaf, ARDC Administrator  Julie L. Hotchkiss, ARDC Executive Director  Resources:  ARDC website: atlantardc.orgatlantardc.org  Quarterly ARDC Newsletter ( us to get on list)