Improving researcher access to USDA’s Agricultural Resource Management Survey Charles Towe and Mitch Morehart Economic Research Service, USDA.

Slides:



Advertisements
Similar presentations
ONS Research Data Access Strategy AGENDA Background and context Confidentiality The Strategy.
Advertisements

Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Business microdata dissemination at Istat Daniela Ichim Luisa Franconi
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
Business Register Outputs in Support of Regional Policy John Perry UK Office for National Statistics.
Farm Business and Farm Household Survey Data Customized Data Summaries from ARMS for Statistical Analysis Philip Friend USDA ‘s Economic Research Service.
Statistical Metadata Strategy Elham M. Saleh - Acting Director of Economic Statistics - Director of Technical Resources Central Informatics Organisation.
Harnessing the Power of Microdata Standards, tools and best practices for microdata dissemination and management International Household Survey Network.
Chapter 14 & 15 Conceptual & Logical Database Design Methodology
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Quality Management ISO 9001 For TM. What is Quality Quality is the degree to which product or service possesses a desired combination of attributes C:
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
IHSN International Household Survey Network Strategy for the Development of Data: Improve the Availability, Accessibility, and Quality of Survey Data Mahesh.
EE325 Introductory Econometrics1 Welcome to EE325 Introductory Econometrics Introduction Why study Econometrics? What is Econometrics? Methodology of Econometrics.
The Adoption of METIS GSBPM in Statistics Denmark.
Association of Public Data Users 2010 Conference Opportunities and Challenges for the U.S. Statistical System and ERS’ Role Katherine R. Smith Administrator,
Daniel Beckler United States Department of Agriculture National Agricultural Statistics Service Timothy Mulcahy NORC at the University of Chicago Topic.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Labor Market Information Produced in Collaboration between World Bank Institute and the.
1 MODERNIZATION OF BELARUSIAN STATISTICS _________________________________________________ IMPLEMENTATION OF THE PROCESS APPROACH IN ORGANIZING THE STATISTICAL.
How ARMS Data Are Used: A Federal Perspective Jim Johnson and Mitch Morehart Data to Serve 21 st Century Agriculture: Expanding the Agricultural Resource.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Jump to first page (o ns) Modernising Statistical Systems to improve Quality The experiences of the Office for National Statistics (ONS) Presented by Emma.
 PBMA-KMS deployed in March of 2001 is the first fully operational NASA-wide multi-functional Knowledge Management System  Knowledgebase 200+ Best Practices.
United Nations Statistics Division Work Programme on Economic Census Vladimir Markhonko, Chief Trade Statistics Branch, UNSD Youlia Antonova, Senior Statistician,
The views expressed herein are those of the author and should not necessarily be attributed to the IMF, its Executive Board, or its management Data Confidentiality,
Data for secondary analysis: the experience of the UK Data Archive Hilary Beedham UK Data Archive.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
1 The NORC Data Enclave for Sensitive Microdata Timothy M. Mulcahy Senior Research Scientist, NORC/University of Chicago,
United Nations Statistics Division Dissemination of IIP data.
Державна служба статистики України Statistical confidentiality assurance framework in State Statistics Service of Ukraine Anton Tovchenko head of mathematical.
UNSD/ NSCB Regional workshop on data dissemination & communication Manila, Philippines, June 2012 Promoting (survey) microdata dissemination policies:
Presented By Margaret Hellen Atiro Uganda Bureau of Statistics at the United Nations Regional Seminar on Census Data Archiving 20 – 23 Sep 2011, Addis.
Introduction to Statistics Estonia Study visit of the State Statistical Service of Ukraine on Dissemination of Statistical Information and related themes.
Quantum Leap Project Management
Perkins School for the Blind Technology Strategic Plan
Investment Intentions Survey 2016
Data Confidentiality and the Common Good.
Development of Strategies for Census Data Dissemination
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Julia Lane New York University
Integrated Management System and Certification
Cost of Production: Uses and Users
Gerhardt Bouwer Statistics South Africa
Quality assurance in official statistics
TechStambha PMP Certification Training
Confidentiality in Published Statistical Tables
Investment Intentions Survey 2016
Gender Statistics Toolkit
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Introductory Econometrics
Presentation 2b 2018 Census Products & Services Engagement.
Dissemination guidelines at INE
IPEDS Minimum Data Set (MDS)
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Sub-regional workshop on integration of administrative data, big data
Albania 2021 Population and Housing Census - Plans
Presenter: Diana Castillo
Palestinian Central Bureau of Statistics
BCS Template Presentation February 22, 2018
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Statistical System in India
The role of metadata in census data dissemination
GSBPM AND ISO AS QUALITY MANAGEMENT SYSTEM TOOLS: AZERBAIJAN EXPERIENCE Yusif Yusifov, Deputy Chairman of the State Statistical Committee of the Republic.
Item 5 Wim Kloek, Eurostat
Palestinian Central Bureau of Statistics
Presentation transcript:

Improving researcher access to USDA’s Agricultural Resource Management Survey Charles Towe and Mitch Morehart Economic Research Service, USDA

What is the Agricultural Resource Management Survey? ARMS is USDA’s primary survey for the annual collection of data from farm operators about their: Farm--ownership, governance, management, and performance Farm--ownership, governance, management, and performance Choice of practices, inputs, and expenditures to produce crop and livestock commodities Choice of practices, inputs, and expenditures to produce crop and livestock commodities Household--demographic attributes, economic and financial activities Household--demographic attributes, economic and financial activities

1.Responding to mandates: Income for farms, Costs for commodities, Status of family farms 2.Support for U.S. National Economic Accounts (GDP, Personal Income) 3.Providing data to respond to USDA policies & programs 4.Enabling research to inform decision makers on a variety of issues Program Activities Supported by ARMS

Data delivery (pre 2004) ARMS is complex survey that has existed, in one form or another, for approximately 20 years. ARMS is complex survey that has existed, in one form or another, for approximately 20 years. Since 1996 the data collection methodology has been standardized. Since 1996 the data collection methodology has been standardized.

Data delivery

Project goals Allow user to customize table request Allow user to customize table request Allow 2 way tables Allow 2 way tables Add state level analysis Add state level analysis Support graphical representation of data Support graphical representation of data Provide advanced users access to suite of regression-type methods Provide advanced users access to suite of regression-type methods Provide this to users in an environment that protects survey participants confidentiality to ensure future participation.

Primary Primary looking at individual cells looking at individual cells class disclosure class disclosure Secondary Secondary solving from totals or known formulae solving from totals or known formulae combining data from different tables and sources combining data from different tables and sources using non-suppressed information to infer things using non-suppressed information to infer things much more difficult to check much more difficult to check Primary and complementary cell suppression algorithm

Primary disclosure 1) Threshold rule no cells with less than 3 units (enterprises) no cells with less than 3 units (enterprises) 2) Dominance rule sum of the sample minus the two largest observations (C) cannot exceed 60% of the largest value, or sum of the sample minus the two largest observations (C) cannot exceed 60% of the largest value, or C > 3/5 * U Two largest observations C UV W XY Z

Secondary disclosure 1) Algorithm (equation check) determines additional cells for obfuscation in order to keep primary disclosure intact 2) Factored in solving from totals and across cells using relationship of data in a single table 3) Could not prevent cross table searches

Collect list of violating variables Primary Disclosure Rules on Data (plus statistical reliability) Equation Check Identify the method for selecting complementary cells Done! Build table for display Key variables Final List Candidate List Data request made Primary and complementary cell suppression algorithm

Final implementation Prototype built in 2003 and presented at a peer review panel Prototype built in 2003 and presented at a peer review panel highlighted the need to disseminate data further and illustrated the risks associated highlighted the need to disseminate data further and illustrated the risks associated Resulted in approval of a weighting scheme for all data which, theoretically, eliminates need of secondary suppression. Resulted in approval of a weighting scheme for all data which, theoretically, eliminates need of secondary suppression. Pre-generated each data point Pre-generated each data point Faster response time, which Faster response time, which Allowed greater graphic capabilities Allowed greater graphic capabilities

eGov integration Testing & evaluation Data preparation Application functionality Design 4/21/03 Kickoff Team Charter Identify Goals, Project Plan, & Resources 11/25/03 First prototype presented 4/29/04 V-2 Release ltd, secure Extranet by IP address 9/24/04 Extranet Tool Released 2004 Project Timeline and Milestones 5/10/04 Peer Review 11/9/04 Tailored Reporting Tool Public Release Public website overhauled 6/30/04 V-3 Release ltd, secure Extranet open outside of ERS 3/26/04 V-1 Release Intranet, by IP address 8 6/21/04 Noise implemented 8/5/04 Security Evalua- tion 8/21/04 Audit logging

19 Enclave Basics Mission Mission To Promote Access to sensitive micro data To Promote Access to sensitive micro data To Protect Confidentiality To Protect Confidentiality To Archive, Index and Curate Micro-data To Archive, Index and Curate Micro-data Background Background Started by NIST/ATP Started by NIST/ATP Went live July 2007 Went live July 2007 Current participants/data producers: NIST/ATP, USDA/ERS (pilot), Kauffman Foundation Current participants/data producers: NIST/ATP, USDA/ERS (pilot), Kauffman Foundation Innovations Innovations Secure remote access Secure remote access Collaboratory: a collaborative environment for researchers to work, share code, ideas & work with online discovery tools Collaboratory: a collaborative environment for researchers to work, share code, ideas & work with online discovery tools Standardized metadata documentation techniques (IHSN’s microdata management toolkit; DDI compliance) Standardized metadata documentation techniques (IHSN’s microdata management toolkit; DDI compliance)

20 NORC Data Enclave: Mechanics of Portfolio Approach to Protection Provision of access – a) Technical protection (IT and operational) b) Agency-specific data protection requirements (Legal) c) Statistical protection (Statistical) d) Researcher training (Educational)