Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, 20-23 September 2011 Overview of Archiving of Microdata Session 4 United Nations.

Slides:



Advertisements
Similar presentations
Data Quality Considerations
Advertisements

Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
Chapter 2 - Legal foundation and institutional arrangements 8 th Meeting Baku, Azerbaijan September 2013 Drafting process of the Energy Statistics.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Census Data Archiving Seminar Presentation on Census Data Archiving: Confidentiality and Anonymizaton Ethiopia, Addis Ababa, 20-23rd September,2011 Etambuyu.
Farm Business and Farm Household Survey Data Customized Data Summaries from ARMS for Statistical Analysis Philip Friend USDA ‘s Economic Research Service.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
Harnessing the Power of Microdata Standards, tools and best practices for microdata dissemination and management International Household Survey Network.
POLICIES AND PROCEDURES FOR ARCHIVING DATA IN BURUNDI.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Country Practices on Census Data Archiving.
MOLLA HUNEGNAW STATISTICIAN AFRICAN CENTRE FOR STATISTICS ECASTATS.UNECA.ORG Confidentiality and Anonymization of Microdata 1 United Nations Regional Seminar.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
The 2010 World Programme on Population and Housing Censuses Paul Cheung, Director United Nations Statistics Division.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Sub-session 1B: General Overview of CRVS systems.
United Nations Statistics Division
Computer Based Information Systems Control UAA – ACCT 316 – Fall 2003 Accounting Information Systems Dr. Fred Barbee.
Changing the culture: Ethiopia’s commitment to dissemination and the multi-media approach By Yakob Mudesir Seid
Security Baseline. Definition A preliminary assessment of a newly implemented system Serves as a starting point to measure changes in configurations and.
Census Data Archiving Experience of the Central Statistical Agency (CSA) of Ethiopia Presented on the United Nations Regional Seminar on Census Data Archiving.
Data management in the field Ari Haukijärvi 2nd EHES training seminar.
Confidentiality and Security Issues in ART & MTCT Clinical Monitoring Systems Meade Morgan and Xen Santas Informatics Team Surveillance and Infrastructure.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
IHSN International Household Survey Network Strategy for the Development of Data: Improve the Availability, Accessibility, and Quality of Survey Data Mahesh.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
Assessing the Capacity of Statistical Systems Development Data Group.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Labor Market Information Produced in Collaboration between World Bank Institute and the.
Assessing The Development Needs of the Statistical System NSDS Workshop, Trinidad and Tobago, July 27-29, 2009 Presented by Barbados.
Lisbone, March ALBANIAN METADATA AlbMeta Prepared by INSTAT Working Group.
1 UNFPA Africa Region’s experience in using IMIS based REDATAM for the dissemination of census data United Nations Regional Seminar on Census Data Dissemination.
International Initiatives International Household Survey Network and Accelerated Data Program Olivier Dupriez World Bank / IHSN.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
National Statistical Offices/NSO’s/ Capabilities to Collect ICT Indicators Yasin Mossa Central Statistical Authority of Ethiopia Geneva, 9 Sept.2003.
The Role of Metadata in Census Data Dissemination Presented By Mrs. Shirley Christian-Maharaj Assistant Director of Statistics CSO Trinidad &Tobago.
Legal and institutional foundation of economic statistics Overview of international experience Regional Workshop for African Countries on Compilation of.
Statistical data confidentiality and micro data in Albania
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
OVERVIEW OF ARCHIVING OF MICRODATA SILAS M. MULWA Kenya National Bureau of Statistics United Nations Regional Seminar on Census Data Archiving for Africa.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Introduction to Census Archiving Session.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
ISO/IEC 27001:2013 Annex A.8 Asset management
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
HETUS Pilot Group 8 Privacy procedures and ethical issues Kimberly Fisher, Centre for Time Use Research – co-ordinator External consultant Kai Ludwigs.
Presented By Margaret Hellen Atiro Uganda Bureau of Statistics at the United Nations Regional Seminar on Census Data Archiving 20 – 23 Sep 2011, Addis.
Session 6: Data Flow, Data Management, and Data Quality.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
Administrative Data and Official Statistics Administrative Data and Official Statistics Principles and good practices Quality in Statistics: Administrative.
The statistical act, its application and challenges BY ABERASH TARIKU ABAYE NATIONAL STATISTICAL DATA QUALITY AND STANDARDS COORDINATION DIRECTORATE DIRECTOR.
Computer Security: Principles and Practice First Edition by William Stallings and Lawrie Brown Lecture slides by Lawrie Brown Chapter 17 – IT Security.
Quality assurance in official statistics
Access Policy and Dissemination of Microdata By Didier UYIZEYE
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Institutional Framework, Resources and Management
Sub-regional workshop on integration of administrative data, big data
BETTER AND PROPER ACCESS TO PACIFIC MICRODATA
Integrated Statistical Systems
UNFPA Africa Region’s experience in using IMIS based REDATAM for the dissemination of census data United Nations Regional Seminar on Census Data Dissemination.
The role of metadata in census data dissemination
The Role of Metadata in Census Data Dissemination
The Role of Metadata in Census Data Dissemination
Presentation transcript:

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations Statistics Division Demographic Statistics Section

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Presentation  What are microdata?  Why disseminate microdata?  Data files for archiving  Preparing the data sets  Data security  Tools for archiving of microdata  Risks of disseminating microdata

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 What are microdata? Microdata:  are electronic data files containing the information about each unit of enumeration such as person, household, housing unit  are organized data files in which each line (or record) contains information about one unit of observation  contain information in the form of coded values  contain different types of variables-numeric, alphanumeric, discrete or continuous-obtained from direct responses or derived by imputation/calculation

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Why disseminate microdata?  Main reason is to support research by offering flexibility  to define variables and modify categories in a way to meet the needs of researches  to generate more interest which facilitates wider use of census data  A closer relationship between data providers and users can improve the reliability and relevance of data

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Version of data files for archiving Data procedures often create multiple versions of microdata files. These files;  are created during different stages of census operation  differ in the quality, content and number of records  range from raw microdata files to cleaned and edited files for public use

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 What is sensitive in microdata?  In order to ensure data confidentiality, census data usually do not contain variables that are direct identifiers  Census data sets include variables that are indirect identifiers;  Detail geographic information  Detail information on professional status  Some variables in microdata sets can be sensitive due to the nature of the information contained in them  Information on income, ethnicity, religion, etc.

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Preparing the data set Acquisition  Microdata can be generated from various data sources: censuses, surveys and administrative registers  A clear acquisition policy that describes scope, source and mandate for the acquisition of microdata sets is necessary  NSO can play an important role by expanding the scope of the data archive to official sources such as line ministries

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Preparing the data set  Data file  Hierarchical/relational files are easier to analyze and more efficient for data storage  The identification variables in all data files should provide a unique identifier  Unique identifiers to merge data files should be composed of numeric variables for more efficient sorting and filtering of records  A unique household identification should not be a compilation of geographic codes since these codes are highly identifying  All unnecessary or temporary variables from the data files should be removed

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Preparing the data set  Variables and codes  All variables are labeled (variable labels) and the codes for all categorical variables are labeled (value labels)  “Missing” codes should be standardized for all variables  “Not applicable” code should be distinct from other missing codes  If “errors” or “missing data” imputed, this should be indicated in the data set

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Preparing the data set  Verification operation  If a dataset is hierarchical, all records in the individual level files should have a corresponding household in the household-level file  The number of records in each file should be verified  Data from all sections of the questionnaire should be included in the dataset ===>setting up verification rules to check data sets

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Data security  Physical security  Controlling access to rooms where data are held  Logging the removal of and access to media or hard copy material in store rooms  Network security  Not storing confidential data on servers or computers connected to an external network  Firewall protection and security-related upgrades to avoid viruses and malicious code

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Data security  Security of computer systems and files  Locking computer systems with password and installing a firewall system  Implementing password protection of, and controlled access to, data files  Protecting servers by power surge protection systems through line-interactive uninterruptible power supply (UPS) systems  Imposing non-disclosure agreements for managers or users of confidential data

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Data security  Security of personal data  Anonymising or aggregating data  Separating data content according to security needs  Removing personal information from data files and storing them separately

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Tools for archiving microdata  International Household Network Survey (IHSN)  A network of international agencies coordinated by World Bank/PARIS21  Develop tools, guidelines and training materials  Advocate compliance with good practices and international standards

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Tools for archiving microdata  Redatam based IMIS  Originally developed at CELADE to promote acess to census microdata  It is a database management tool that manages large volumes of census data  Aims to promote access to and analysis of census and other data for informed decision making for sectoral and local development policies and programmes

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Risks of disseminating microdata  Maintaining respondents’ trust: confidentiality protection is the key element of trust  Potential misuse and misunderstanding of data by users: there should be procedures to prevent misuse of microdata; good documentation and technical support to prevent misunderstanding of microdata  Exposure to criticism and contradiction: data quality may not be good enough for further dissemination; there may be inconsistency between research results based on microdata and published aggregated data

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Risks of disseminating microdata  Legal issues: it is crucial for data procedures to ensure there is a sound legal and ethnical base (as well as the technical and methodological tools) for protecting confidentiality  Costs: these will include not only the costs of creating and documenting microdata files, but the costs of creating access tools and safeguards, and of supporting and authorizing enquiries made by research community, training and support to new users of microdata files  Technical capacity: the files need to be well- documented and preserved; be reviewed to identify the risk of disclosure of individual information and the risk reduced using various techniques

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Microdata is archived: “to allow future users to retrieve, access, decipher, view, interpret, understand and experience documents, data and records in meaningful and valid ways” Jeff Rothernberg “ to create institutional memory for long term researches”

Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 THANK YOU …..