3rd International Digital Curation Conference Washington, DC, Dec 2007 Paper Presentations: Interoperability, Metadata & Standards Data Documentation Initiative:

Slides:



Advertisements
Similar presentations
Workshop on Metadata Standards and Best Practices November 19-20th, 2007 Session 1 Leveraging Metadata Standards in RDC Pascal Heus Open Data Foundation.
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
IASSIST 2007 Montreal, May , 2007 Session A2 Open Data and the Common Good Technology Solutions for Difficult Challenges Pascal Heus Open Data Foundation.
Workshop on Metadata Standards and Best Practices November th, 2007 Session 2 Metadata specifications for socio-economic science and supporting initiatives.
Workshop on Metadata Standards and Best Practices November th, 2007 Session 3 Researcher Metadata in RDCs Pascal Heus Open Data Foundation
11th Annual Federal CASIC Workshops Washington, DC, March 6 - 8, 2007 Session WP4 Metadata challenges and solutions for socio-economic data Pascal Heus.
Workshop on Metadata Standards and Best Practices November th, 2007 Session 4 The Data Documentation Initiative Technical Overview Pascal Heus Open.
10th Annual Open Forum for Metadata Registries New York, NY, July 9-11, 2007 Track 3 – Future Directions Metadata challenges and solutions for socio-economic.
The SDMX Registry Model April 2, 2009 Arofan Gregory Open Data Foundation.
1 Statistics Norway Information Architecture – some challenges ODaF meeting, Colchester April 2008 Rune Gløersen Director Department for IT and.
Status on the Mapping of Metadata Standards
ODaF Europe 2008 Colchester, UK, April 14-15, 2008 Metadata in social science and the Open Data Foundation Pascal Heus Open Data Foundation
ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance.
ODaF Europe 2009 Virtual Research and Collaborative Center Pascal Heus, Open Data Foundation Tim Mulcahy, National Opinion Research Center
National Institute of Statistics, Geography and Informatics (INEGI) Implementation of SDMX in Mexico.
International Household Survey Network (IHSN) Microdata Management Toolkit Trevor Croft MICS3 Data Archiving, Dissemination and Further.
DDI 101 Presented to the : Ontario DLI training session Queens Kingston, Ontario Presented to the : Ontario DLI training session Queens Kingston, Ontario.
DLI Training Nesstar Workshop
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
RSS. March HB/The Data Archive. The RSS Working Group on Data preservation and sharing: standards for documenting data for preservation and secondary.
Meeting Disciplinary Challenges in Research Data Management Planning – March 23 rd 2012 Data Management Planning for Secure Services (DMP-SS) † Tito Castillo,
Metadata at ICPSR Sanda Ionescu, ICPSR.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Virtual Center for Collaborative Research (ViCtoR) IASSIST 2010 – Session D3: Virtual Research Environments Pascal Heus, Metadata Technology North America.
Metadata Standards and XML Technologies.
 Name and organization  Have you worked with DDI before? (2 or 3)  If not, are you familiar with XML?  What kind of CAI systems do you use?  Goals.
Q: What objects documented by DDI should be citable? All versionable objects, some may not be used Q: What elements are needed in DDI and CDISC to support.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Background Data validation, a critical issue for the E.S.S.
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
ESCWA SDMX Workshop Session: Role in the Statistical Lifecycle and Relationship with DDI (Data Documentation Initiative)
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
WP.5 - DDI-SDMX Integration
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Overview of SDMX: Statistical Data and Metadata eXchange Technical and Content Standards for Statistical Data Ann McPhail, Division Chief Statistics Department,
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
Metadata Portal Project: Using DDI to Enhance Data Access and Dissemination Mary Vardigan Assistant Director, ICPSR Director, DDI Alliance.
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Data and Metadata Session 5 Mark Viney Australian Bureau of Statistics 6 June 2007.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
DDI and the Lifecycle of Longitudinal Surveys Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Looking into the future… Providing Social Science Data Services Jim Jacobs.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
SDMX IT Tools Introduction
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
The Data Documentation Initiative (DDI) Fostering Community Engagement and Adoption Breakout 9 RDA Sixth Plenary, Paris Mary Vardigan, ICPSR, University.
Strategic Priorities for DDI Spring 2013 Mary Vardigan Director, DDI Alliance METIS -- Geneva, Switzerland May 6, 2013.
1 Joint UNECE/EUROSTAT/OECD METIS Work Session (Geneva, March 2010) The On-Going Review of the SDMX Technical Specifications Marco Pellegrino, Håkan.
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
The evolution of the SDMX infrastructure and services
Interoperable data formats: SDMX
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
SDMX in the S-DWH Layered Architecture
Capitalising on Metadata
The role of metadata in census data dissemination
IASSIST 2007 Montreal, May , 2007 Session A2 Open Data and the Common Good Technology Solutions for Difficult Challenges Pascal Heus Open Data.
Introducing the Data Documentation Initiative
The Role of Metadata in Census Data Dissemination
Palestinian Central Bureau of Statistics
Presentation transcript:

3rd International Digital Curation Conference Washington, DC, Dec 2007 Paper Presentations: Interoperability, Metadata & Standards Data Documentation Initiative: Toward a Standard for the Social Sciences Mary Vardigan, Pascal Heus, Wendy Thomas ICPSR/University of Michigan / Open Data Foundation / Minnesota Population Center / /

DDI Alliance – What is Metadata? Common definition: Data about Data Unlabeled stuffLabeled stuff The bean example is taken from: A Managers Introduction to Adobe eXtensible Metadata Platform,

DDI Alliance – Managing data and metadata is challenging! We are in charge of the data. We support our users but also need to protect our respondents! We want easy access to high quality and well documented data! We need to collect the information from the producers, preserve it, and provide access to our users! Producers Librarians Users General Public Policy Makers Sponsors Media/Press Academic Business Government We have an information management problem

DDI Alliance – Metadata issues Without producer / archive metadata –researchers cant work discover data or perform efficient analysis Without researcher metadata –Research process is not documented and cannot be reproduced (Gary King replication standard!) –Other researchers are not aware of what has been done (duplication / lack of visibility) –Producer dont know about data usage and quality issues Without standards –Such information cant be properly managed and exchanged between actors or with the public Without tools: –We cant capture, preserve or share knowledge

DDI Alliance – XML to the rescue! XML stands for eXtensible Markup Language Technology that is driving todays web service oriented architecture of the Internet and Intranets Using XML, we can capture, structure, transform, discover, exchange, query, edit and secure metadata and data XML is platform & language independent and can be used by everyone XML is both machine and human readable XML is non-proprietary, public domain and many open tools exist Domain specific standards are available!

DDI Alliance – Suggested XML metadata specifications for socio-economic data Statistical Data and Metadata Exchange (SDMX) –Macrodata, time series, indicators, registries – Data Documentation Initiative (DDI) –Microdata (surveys, studies) – ISO –Semantic modeling, concepts, registries – ISO –Geography – Dublin Core –Resources (documentation, images, multimedia) –

DDI Alliance – The Data Documentation Initiative (DDI) International XML based specification for the documentation of social and behavioral data –Started in 1995, now driven by DDI Alliance (30+ members) –Became XML specification in 2000 (v1.0) –Current version is 2.1 with focus on archiving (survey/codebook) New Version 3.0 (2008) –Focus on entire survey Life Cycle –Provide comprehensive metadata on the entire survey process and usage –Aligned on other metadata standards (DC, MARC, ISO 11179, SDMX, …) –Include machine actionable elements to facilitate processing, discovery and analysis DDI is being adopted by producers/archives but needs to extends to the researchers (who are using the data!)

DDI Alliance – DDI 3.0 and the Survey Life Cycle A survey is not a static process: It dynamically evolved across time and involves many agencies/individuals DDI 2.x is about archiving, DDI 3.0 across the entire life cycle 3.0 focus on metadata reuse (minimizes redundancies/discrepancies, support comparison) Also supports multilingual, grouping, geography, and others 3.0 is extensible

DDI Alliance – Metadata Components Producer metadata: –Codebook, questionnaires, reports, methodologies, processing, scripts, quality, admin, etc. Research metadata –Recodes, analysis, table, scripts, papers, logs, data quality, usage –Citations, references –Activities, discussions, knowledge base Outputs –Papers, presentations, tables, reports

DDI Alliance – When to capture metadata? Metadata must be captured at the time the event occurs! (not after the facts) Documenting after the facts leads to considerable loss of information This is true for producers and researchers

DDI Alliance – Solutions? Simple solutions: use good practices –File and variable naming conventions, sound statistical methods (metadata in names!) –Comment source code –Document your work Adopt DDI & other standard based metadata solutions: –DDI tools, citation database, source code level metadata capture, variable recodes, table disclosure, data quality feedback, comparability Take advantage of web based collaborative tools –Wiki, blogs, discussion groups, lists

DDI Alliance – Benefits Comprehensive data documentation –Through good metadata practices, comprehensive documentation captured by producers, librarians and users is available to ALL researchers Preservation, integration and sharing of knowledge –Research process is captured and preserved in standard formats –Research knowledge becomes integrant part of the survey and available to all –Reduce duplication of efforts and facilitates reuse –Producer gets feedback from the data users (usage, quality issues), which lead to better and more relevant data Research outputs and dissemination –Facilitate production of research outputs –Facilitate dissemination and fosters broader visibility of research results

DDI Alliance – Conclusions Metadata is a crucial component of social and behavioral science The Data Documentation Initiative (DDI) is a globally accepted specification for capturing microdata documentation and knowledge Latest version 3.0 extends into the entire survey Life Cycle Producers and data archives are rapidly adopting metadata standards. This adoption process should extend into the research community Best practices in data and metadata management benefit all users and have the potential to change the way we conduct research or