Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow,

Slides:



Advertisements
Similar presentations
Workshop on Metadata Standards and Best Practices November th, 2007 Session 4 The Data Documentation Initiative Technical Overview Pascal Heus Open.
Advertisements

ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
Metadata Management at GESIS-ZA Reiner Mauer GESIS – Data Archive and Data Analysis CESSDA-Expert Seminar Odense, September 11th 2008.
Metadata at ICPSR Sanda Ionescu, ICPSR.
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009 Lifecycle & Comparative Studies Metadata Needs of the Future CESSDA RI.
Group & Resource Package - Potentials to re-use metadata with DDI 3 - Uwe Jensen, GESIS – cessda Expert Seminar Nov Ljubljana, Slovenia Group &
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
STARDAT DATA ARCHIVING SUITE European Survey Research Association (ESRA), July 18 – 22, 2011, Lausanne, Switzerland Monika Linne, Evelyn Brislinger, Wolfgang.
Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.
DDI URN Enabling identification and reuse of DDI metadata IDSC of IZA/GESIS/RatSWD Workshop: Persistent Identifiers for the Social Sciences Joachim Wackerow.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition.
Survey Metadata Documentation Sue Ellen Hansen, Gina-Qian Cheung, Kirsten Alcser, Grant Benson, Ashley Bowers, Karl Dinkelmann, Youhong Liu, Beth-Ellen.
Requirements Specification
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Coolheads Consulting Copyright © 2003 Coolheads Consulting The Internal Revenue Service Tax Map Michel Biezunski Coolheads Consulting New York City, USA.
Evelyn Brislinger, Wolfgang Zenk-Möltgen
Introducing Symposia : “ The digital repository that thinks like a librarian”
 Name and organization  Have you worked with DDI before? (2 or 3)  If not, are you familiar with XML?  What kind of CAI systems do you use?  Goals.
The education variables in the European Social Survey: Advantages in using the DDI for documentation Hilde Orten and Hege Midtsæter Norwegian Social Science.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Research data workflow Practice in Slovenian Social Science Data Archives SERSCIDA WP4 – WORKSHOP Ljubljana September 2013.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
DDI: Capturing metadata throughout the research process for preservation and discovery Wendy Thomas NADDI 2012 University of Kansas.
DDI 3.0 Overview Sanda Ionescu, ICPSR. DDI Background Development History 1995 – A grant-funded project initiated and organized by ICPSR proposes to create.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
Metadata Portal Project: Using DDI to Enhance Data Access and Dissemination Mary Vardigan Assistant Director, ICPSR Director, DDI Alliance.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
Metadata Management and Tools August 1, 2013 Data Curation Course.
PACSCL Consortial Survey Initiative Group Training Session February 12, 2008 at The Historical Society of Pennsylvania.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Evolution of Data Documentation Providing Social Science Data Services Jim Jacobs, 2008.
DDI and the Lifecycle of Longitudinal Surveys Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences.
DDI Discovery: An Overview of Current RDF Vocabularies Arofan Gregory Metadata Technologies NA Joachim Wackerow GESIS.
Looking into the future… Providing Social Science Data Services Jim Jacobs.
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
The Data Documentation Initiative: more discussion Chuck Humphrey University of Alberta Atlantic DLI Workshop 2005, Acadia University.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design.
Chapter 4 Automated Tools for Systems Development Modern Systems Analysis and Design Third Edition 4.1.
NURHALIMA 1. Identify the trade-offs when using CASE Describe organizational forces for and against adoption of CASE tools Describe the role of CASE tools.
TIC Updates EDDI 2010 Wendy Thomas – 6 Dec Schedule and Process Changes Production schedule is moving to: – Summer / Winter release schedule January.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Metadata standards Using DDI to Inform, Organize, and Drive Survey Data Production.
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
Modern Systems Analysis and Design Third Edition
Modern Systems Analysis and Design Third Edition
Data stewardship life cycle
What’s New in Colectica 5.3 Part 1
Chapter 4 Automated Tools for Systems Development
DDI for the Uninitiated
Generic Statistical Business Process Model (GSBPM)
Modern Systems Analysis and Design Third Edition
Enhancing ICPSR metadata with DDI-Lifecycle
Modern Systems Analysis and Design Third Edition
RODA.
in the data production process
Modern Systems Analysis and Design Third Edition
Presentation transcript:

Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow, GESIS Wolfgang Zenk-Möltgen, GESIS

Research Data Life Cycle CollectionConceptProcessingDistributionDiscoveryAnalysis Archiving Repurposing

Current Uses of DDI DDI 2 used for many different purposes by many different archival institutions, e.g., metadata records for data catalogs, export to Web-based information systems such as Nesstar, long-term preservation, and PDF codebooks GESIS and ICPSR are developing procedures and systems to extend use of DDI in their institutions

DDI 3 Expands in Scope To date use mainly limited to Distribution and Archiving stages of data life cycle DDI 3 enables use of new elements and structures to extend markup to other stages of the life cycle - both earlier and later Emphasis is on projects and tasks already in process at each institution

DDI 3 Use at GESIS Structured Comments – Processing Translation of EVS Questionnaire – Collection Supporting Enhanced Publications – Analysis Continuity Guides: Trends by Concepts – Concept, Discovery, Repurposing

Extracting structured information in current workflow Example: building derived variables by SPSS SPSS setups contain commands and comments Necessary steps for using SPSS setups as information source for DDI –Improving comments for automated extraction formalize layout add keywords from a list –Extraction of structured comments and related commands by custom tool. –Transformation of this information into DDI 3 fragments

***v* Variables/DerivedVariables * DESCRIPTION * This section is on derived variables; ***. ***v* DerivedVariables/w101_new * NAME * w101_new * DESCRIPTION * w101_new is a derived variable from w101; * It has the original value from w101 * when w102 is equal 1 * otherwise it has the value 5; * USED VARIABLES * w101, w102 * SOURCE **. compute w101_new = 5. if ( w102 = 1 ) w101_new = w101. ** * VERSION * * AUTHOR * Achim Wackerow * * ***. SPSS Result Extractor Report (HTML) DDI 3 fragments GenerationInstruction Description Command Extracting structured information in current workflow

Translation of EVS Questionnaire DSDM

Publications with References to Data: DDI 3.1 URN contains: Agency Object Version URL of Documentatio n and/or Data URL of Documentatio n and/or Data DDI Alliance find agency gesis.de.ddi return resolver address find object return URL request document return document Publication with References (URNs) Supporting Enhanced Publications

DSDM DDI 3 EPE Simple Export Wizard 1.2.0

Grouping Trends Continuity guides in different contexts –Synoptical question / variable lists –Documentation of changes in question wording / answer scales Systematic organization by conceptual categories –CodebookExlorer tool (relational DB) –Publication as html links on variable level in ZACAT Taking advantage of DDI3 in the future –Defining the standard and comparison –Qualifying relations (e.g. q-text modified, scale modified,…)

Continuity guides Literal question text over time Conceptual categories Deviations in answer categories

Trends by concepts Conceptual categories Trend variables by study Country 1 Country 2

STUDY UNIT 1 … n DataCollection … Have you …? … LogicalProduct often … … Cat1 4 … GROUP STUDY UNIT 8-14 DataCollection … LogicalProduct … Comparison map  Equivalency  Relationship  Description DDI3 RESOURCE „Ex-post Standard“ Universe Concept Data Collection Do you …? … CODS1 Logical Product often … CATS1 Cat1 1 … Questiontext <>modified<> Values <>different>> <>generation instruction<> <>scale reversed<> Label <>identical<> GROUP STUDY UNIT 15-x DataCollection … LogicalProduct …

DDI 3 Use at ICPSR Information collected from data producers in pre- collection phase – Concept Metadata output from CAI applications – Data Collection Processor‘s dashboard – Data Processing Metadata mining: New faceted search tool to facilitate discovery through more precise searching – Data Discovery Relational database for comparison and harmonization across studies – Repurposing

SMDS Metadata Modules

DDI as backbone for structured metadata CollectionConceptProcessing DistributionDiscoveryAnalysis Repurposing SIP AIP DIP CAI Tools MQDS etc. Information extracted from SPSS etc. O A I S Archive Custom Tools (e.g. Forms-based) Statistical packages Online Analysis. Search engines. Distribution Packages Web information system  A combination of this information forms a traditional SIP.  Information from each life cycle stage - sent to the archive - can be understood as dynamic SIP.  Self-archiving by web forms can be offered for the different stages.  The structured metadata combined with data forms the core of the archive.  It would be organised in a way where metadata can be reused and information can be ingested and distributed in a dynamic way. Data / Documents outside of DDI  An AIP must be specially built, because the metadata can include just references to other reused metadata.  An AIP should include everything of one study, DDI can be also the main structure of the AIP. Data can be inline in DDI. An AIP would exist beside the core structure in the archive.  An easy roundtrip should be possible between the core structure and the AIP.  The purpose of the AIP is comparable to PDF/A where all fonts are included.  The core structure is headed to efficient processing and reuse of metadata.

DDI-based archive as collection of reusable components Metadata in DDI is structured in small items which can be identified and maintained by one or more institutions These parts can be –the basis for comparison and metadata mining (discovery of new relationships) –a candidate for reuse in other studies or new studies (like standard questions or variables) Study 1 Study-specific information Items for reuse Study 1 Study-specific information Items for reuse New study Repository of reusable components  Standard concepts  Standard questions  Standard variables  Harmonized information  Controlled vocabularies

Issues for Discussion Advantages and disadvantages of seeking to capture additional metadata throughout the data life cycle How much information to make available to funding agencies, data producers, and secondary users? Rules for structured documentation and delivery of items to archives for preservation An overall DDI tool to capture and curate all metadata and data – the Holy Grail???