Microdata Management Toolkit Tools to facilitate archive and dissemination of surveys A PDF for Data? Metadata Editor / Nesstar Publisher 3.5 CD builder.

Slides:



Advertisements
Similar presentations
International Household Survey Network Metadata Toolkit Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November.
Advertisements

Archiving Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November 6th, 2006.
International Household Survey Network (IHSN) Microdata Management Toolkit Trevor Croft MICS3 Data Archiving, Dissemination and Further.
Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving MICS4 Data Processing Workshop.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Metadata Management at GESIS-ZA Reiner Mauer GESIS – Data Archive and Data Analysis CESSDA-Expert Seminar Odense, September 11th 2008.
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Data Archiving.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Metadata at ICPSR Sanda Ionescu, ICPSR.
Systematic Review Data Repository (SRDR™) The Systematic Review Data Repository (SRDR™) was developed by the Tufts Evidence-based Practice Center (EPC),
Using language services to enrich the LOs' descriptions Dr. Vassilis Protonotarios University of Alcala, Spain 10 th Strategic Seminar / Conference 6-7.
Microdata Management Toolkit Tools to facilitate archive and dissemination of surveys Pascal Heus International Household Survey.
NESSTAR Limitedw w w. n e s s t a r. c o m DDI-Publishing Made Easy- the Nesstar Way Jostein Ryssevik Nesstar Ltd.
The Metadata Toolbox: A User’s Perspective on DDI J.M. Eisenhauer Smith, Data Analyst/Archivist Center for Demography of Health and Aging University of.
Ørnulf Risnes IASSIST 09 Tampere Finland, 26 May 2009 Norwegian Social Science Data Services Nesstar 4.0.
The International Household Survey Network IHSN IHSN Secretariat PARIS21 Steering Committee, 14 November 2007.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Data Archiving.
Using a Content Management System Website for the Dissemination of Official Statistics By Edwin St Catherine, Director of Statistics, SAINT LUCIA UN Regional.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Open Syllabus : A Prototype Tool to Create Structured Syllabi in Sakai Jacques Raynauld – Olivier Gerbé – Emmanuel Vigne HEC Montréal July
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Research data workflow Practice in Slovenian Social Science Data Archives SERSCIDA WP4 – WORKSHOP Ljubljana September 2013.
FCM Quality of Life Reporting System Metadata By: Acacia Consulting and Research June 2002.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Changing the culture: Ethiopia’s commitment to dissemination and the multi-media approach By Yakob Mudesir Seid
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Data archive in developing countries: preservation and dissemination of microdata as an instrument for better development results Olivier Dupriez Senior.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
IHSN International Household Survey Network Strategy for the Development of Data: Improve the Availability, Accessibility, and Quality of Survey Data Mahesh.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
SERPent Project Secure Epidemiology Research Platform January – October 2010 Virtual Research Environment Rapid Innovation Project Funded.
Statistical Export and Tabulation System (SETS) Overview and SetX Debut Ann Aikin and Bob Sloss 2004 Data Users Conference Session #16 U.S. Department.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Documenting and disseminating census and survey data sets Ilpo Survo, United Nations ESCAP, Bangkok, for UNECE.
International Initiatives International Household Survey Network and Accelerated Data Program Olivier Dupriez World Bank / IHSN.
Copyright 2010, The World Bank Group. All Rights Reserved. ICT - a core management issue Part 1 Managing ICT resources Produced in Collaboration between.
Increasing the Effectiveness of Household Surveys Eric Swanson, Program Manager for Global Monitoring Development Data Group World Bank.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Data Archiving.
Artezio LLC Address: 3G Gubkina Str., suite 504, Moscow, Russia, Phone: +7 (495) Fax: +7 (495)
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
OVERVIEW OF ARCHIVING OF MICRODATA SILAS M. MULWA Kenya National Bureau of Statistics United Nations Regional Seminar on Census Data Archiving for Africa.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
SDMX IT Tools Introduction
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
FORSbase SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
UNSD/ NSCB Regional workshop on data dissemination & communication Manila, Philippines, June 2012 Promoting (survey) microdata dissemination policies:
International Household Survey Network
Olivier Dupriez World Bank / IHSN
An Overview of Data-PASS Shared Catalog
What’s New in Colectica 5.3 Part 1
The Re3gistry software and the INSPIRE Registry
DDI for the Uninitiated
Enhancing ICPSR metadata with DDI-Lifecycle
الجهاز المركزي للإحصاء الفلسطيني
5 November, 2018 Nuku’alofa, Tonga
Noumea, New Caledonia, 3 to 7 December, 2018
Albania 2021 Population and Housing Census - Plans
The role of metadata in census data dissemination
Palestinian Central Bureau of Statistics
Presentation transcript:

Microdata Management Toolkit Tools to facilitate archive and dissemination of surveys A PDF for Data? Metadata Editor / Nesstar Publisher 3.5 CD builder Guidelines for Archiving & Dissemination Session E2 - Thursday, 26 May Tools for Preservation: Integration and Assessment Preserving and improving the access to large and complex household surveys Mark Diggory Olivier Dupriez Pascal Heus Jostein Ryssevik Harvard / MIT Data Center World Bank World bank Nesstar Ltd.

Background Sponsored by World Bank / International Household Survey NetworkSponsored by World Bank / International Household Survey Network –Presented earlier this week –Created in September 2004 –International organizations actively sponsoring household surveys –Marrakech Action Plan for Statistics – Survey often under-used: limited access for users which leads to poor return on investment limited impact on the ground, difficulties in policy makingSurvey often under-used: limited access for users which leads to poor return on investment limited impact on the ground, difficulties in policy making Common obstacles: quality, technical capacity, legal/political issuesCommon obstacles: quality, technical capacity, legal/political issues Common problems:Common problems: –Accessibility, Timeliness, Coherence –Lack of metadata / documentation / data –Poorly organized archives To address technical issues: Need for new tools and guidelines  Microdata Management ToolkitTo address technical issues: Need for new tools and guidelines  Microdata Management Toolkit

Toolkit Requirements User friendly software suite and guidelines to archive and disseminate microdataUser friendly software suite and guidelines to archive and disseminate microdata Facilitate metadata exchange: compliant with common XML specifications (DDI, Dublin Core)Facilitate metadata exchange: compliant with common XML specifications (DDI, Dublin Core) Facilitate archiving: put together metadata and data, address common quality control issuesFacilitate archiving: put together metadata and data, address common quality control issues Facilitate dissemination: simple to redistribute on cd/dvd and the web, answer producer/depositor needs (subset, anonymization, quality control)Facilitate dissemination: simple to redistribute on cd/dvd and the web, answer producer/depositor needs (subset, anonymization, quality control) Works with common data formats (spss, sas, stata, statistica, cspro/imps/issa)Works with common data formats (spss, sas, stata, statistica, cspro/imps/issa) Multilingual supportMultilingual support Free or InexpensiveFree or Inexpensive Availability of technical support and trainingAvailability of technical support and training Accompanied with guidelines and training programAccompanied with guidelines and training program Supported by national, international and research communitiesSupported by national, international and research communities

Core file format - A PDF for Data? How can we carry around the information?How can we carry around the information? Looking at documents  PDFLooking at documents  PDF Can we do the same for data?Can we do the same for data? –Yes, a Nesstar file holds data + metadata! Partner with Nesstar Ltd to develop new toolsPartner with Nesstar Ltd to develop new tools –Why: strong tool for metadata management, available today, community acceptance, technical support, past experience Development agreementDevelopment agreement –Enhance existing publisher software and make available as a stand alone product –Open binary file format (not a black box) and availability of API –Free data reader (like pdf) that allows user to access at the data and metadata and convert to their favorite format –Special licensing agreement for developing countries

Toolkit Components Archiving: Metadata Editor (World Bank / Nesstar Ltd.)Archiving: Metadata Editor (World Bank / Nesstar Ltd.) –To compile survey data, documentation and metadata in a standard format (Nesstar/DDI). Free data reader for users. –Built on Nesstar Publisher Dissemination: CD Builder (World Bank / Mark Diggory)Dissemination: CD Builder (World Bank / Mark Diggory) –To facilitate the publication of survey data, documentation and metadata on CD-ROM and on the web (transforms DDI into HTML based navigation) –Based on Eclipse Platform, open source Guidelines: Handbook (World Bank / ICPSR)Guidelines: Handbook (World Bank / ICPSR) –To provide data producer with information on policies and legal aspect of data dissemination, guidelines to document datasets and recommendations in setting up a data archive

Generate HTML based CD-ROM Import metadata and prepare CD-ROM Import data and compile metadata The Toolkit Process 1 2 3

What is the Nesstar Publisher? Advanced data management programAdvanced data management program DDI /DC Metadata authoring toolDDI /DC Metadata authoring tool Import/Export to common data formatsImport/Export to common data formats Standalone or w/Nesstar serverStandalone or w/Nesstar server Easy editing/creation of DDI documented datasets. No need to know XML. Full DDI import and export for single file/language studies. Templates which lets your organization standardize the use of the DDI. Default texts in templates. Local controlled vocabularies. Possible to share the documentation work between different persons. A Category Repository which lets you share categories within a dataset and between datasets. Variable groups. Easy setting of weights. Frequency and summary statistics output, with options for each variable. Import and export to the most common statistical formats.

What is the Metadata Editor? Nesstar Publisher 3.0:Nesstar Publisher 3.0: –A tool to prepare and publish surveys to a Nesstar Server –Sold as a component of the Nesstar Software Suite –Multiple components (editor, hierarchy, cube, resources)  New Model for Version 3.5:  New Model for Version 3.5: –All components integrated under one interface –A study is stored in a single Nesstar file –Enhanced and new functionalities Quality control, computed variables, recodes, anonymize, subsetQuality control, computed variables, recodes, anonymize, subset –Availability of a free Nesstar Data Reader –Produce DDI / Dublin Core (DC) XML documents –Available as a stand-alone software package

Editor key features (1) All components integrated under a single interface Import/Export support common data formats All surveys stored as projects in a single tree hierarchy Template driven metadata editor allows for users to decide which DDI/DC elements to use.

Editor key features (2) Easy to use interface for document, survey, file and variable metadata editing

Editor key features (3) Data import preserves existing dictionary and generates summary statistics Manage variable groups DDI and Dublin Core Metadata import/export DDI and Dublin Core Metadata import/export

Editor key features (4) Support for survey documentation as Dublin Core resources Description of a dataset primary keys and hierarchy …and validation of dataset relationships Automatic randomization of primary key variables AND MORE…

Data Reader Free softwareFree software PDF philosophyPDF philosophy Access to survey metadataAccess to survey metadata Access to data (no need for specialized software)Access to data (no need for specialized software) Export to common formatsExport to common formats Single file holds data and metadataSingle file holds data and metadata

What is the CD Builder? Purpose is to publish survey metadata, documents and data on a CD-Rom (or web site)Purpose is to publish survey metadata, documents and data on a CD-Rom (or web site) Transforms DDI into an HTML based interfaceTransforms DDI into an HTML based interface User can customize the layout (branding) and content of the CD (single or multi- surveys)User can customize the layout (branding) and content of the CD (single or multi- surveys) Open source applicationOpen source application Build on the Eclipse FrameworkBuild on the Eclipse Framework Based on DDI / Dublin CoreBased on DDI / Dublin Core Integrates with Metadata EditorIntegrates with Metadata Editor Easy to useEasy to use

CD Builder Process Create new CD-ROM Project Add a survey to the project and select its type and branding 1 2 Selecting a survey consist in opening the DDI-XML or Nesstar file The survey “branding” determines the overall look and feel of the CD The survey “type” determines the default metadata content Selecting a survey consist in opening the DDI-XML or Nesstar file The survey “branding” determines the overall look and feel of the CD The survey “type” determines the default metadata content Click the “Save” button to generate the HTML interface 3 After a few minutes, your CD Project is ready for publishing! 4

Key Features Content of CD pages is fully customizable A CD-ROM project can hold several surveys Branding customization Can be published to web Multilingual support Automatic updates …and more… Branding customization Can be published to web Multilingual support Automatic updates …and more…

Sample output

Handbook Handbook on the Documentation, Dissemination, and Preservation of MicrodataHandbook on the Documentation, Dissemination, and Preservation of Microdata –Part I: Policy, legal and ethical issues and recommendations. Benefits and costs of microdata dissemination –Part II: Technical guidelines: documenting, disseminating and preserving a dataset –Part III: Setting-up a central data archive

Benefits and Users (1) What will the toolkit improve?What will the toolkit improve? –Documentation (based on standards, guidelines and validation) –Preservation: data and metadata stay together, CD archiving –Cataloguing: facilitate metadata exchange –Dissemination: CD, DVD, Web –Quality: validation procedures, use of common language, adoption of best practices

Benefits and Users (2) Potential users?Potential users? –Survey producers at national level: preservation, dissemination, harmonize framework –International survey sponsors –Data archives Who will benefit?Who will benefit? –Data producers –National & International survey sponsors –Survey data repositories –Data analysts –Policy makers and population –DDI Community

Status & Availability Publisher 3.5Publisher 3.5 –Beta version available –Nesstar commercial release during the summer CD BuilderCD Builder –Beta version available –Public release expected in September (Open Source) GuidelinesGuidelines –Draft completed –Review over the summer

Next? Distribution, training and adoption of the toolkitDistribution, training and adoption of the toolkit User acceptance tests and pilot sitesUser acceptance tests and pilot sites Release of open source components (Sourceforge, DDI)Release of open source components (Sourceforge, DDI) Future developments:Future developments: –Translations in other languages –Plug-ins for Publisher and/or Reader (open source) –Availability of API library –Basic analytical functionalities (tabulation, graphs, etc.) –Evaluation of disclosure risks / anonymization procedures –Embed document in archive file (?) –Plan for DDI 3.0 support –Bug fixes / enhancements / new features (based on user feedback) –And more based on feedback from users, DDI & open source community Integration of other tools:Integration of other tools: –Argus [confidentiality] –CSPro [production] –Virtual Data Center (VDC) [web based dissemination] Strong collaboration and participation of the communityStrong collaboration and participation of the community

Thank you! Mark Diggory Olivier Dupriez Pascal Heus Jostein Ryssevik Harvard / MIT Data Center World Bank World bank Nesstar Ltd. QUESTION / ANSWER