DDI for the Uninitiated

Slides:



Advertisements
Similar presentations
DDI 101 Presented to the : Ontario DLI training session Queens Kingston, Ontario Presented to the : Ontario DLI training session Queens Kingston, Ontario.
Advertisements

DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
Metadata Management at GESIS-ZA Reiner Mauer GESIS – Data Archive and Data Analysis CESSDA-Expert Seminar Odense, September 11th 2008.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Metadata at ICPSR Sanda Ionescu, ICPSR.
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Implementation of the DDI at the Roper Center A Pilot Project on Resource Integration Marc Maynard and Hui Wang The Roper Center.
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers Converting Legacy Documentation to DDI:
NESSTAR - the data archive perspective by Margaret Ward UK Data Archive.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
NESSTAR Limitedw w w. n e s s t a r. c o m DDI-Publishing Made Easy- the Nesstar Way Jostein Ryssevik Nesstar Ltd.
Data Processing A simple model and current UKDA practice Alasdair Crockett, Data Standards Manager, UKDA.
Linux Operations and Administration
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
ISO as the metadata standard for Statistics South Africa
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Introduction to Metadata, the DDI and the Metadata Editor Presentation to the SERPent project team by Margaret Ward 3 March 2010.
Data Management Console Synonym Editor
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Introduction to HTML Simple facts yet crucial to beginning of study in fundamentals of web page design!
FORSbase SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Lesson 10—Networking BASICS1 Networking BASICS The Internet and Its Tools Unit 3 Lesson 10.
Chapter 29. Copyright 2003, Paradigm Publishing Inc. CHAPTER 29 BACKNEXTEND 29-2 LINKS TO OBJECTIVES Attach an XML Schema Attach an XML Schema Load XML.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
Beyond HTML: Extensible Markup Language (XML)
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Section 2.1 Section 2.2 Identify hardware
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
Chapter 1 Introduction to HTML
DLI Website.
XML QUESTIONS AND ANSWERS
Leveraging BI in SharePoint with PowerPivot and Power View
Markup of Educational Content
Core LIMS Training: Advanced Administration
Distributed web based systems
Product Retrieval Statistics Canada / Statistique Canada Title page
Michigan Questionnaire Documentation System (MQDS)
Using Access and the Web
Microsoft Office Illustrated
Exploring Microsoft Office 2013 Word Comprehensive
CFS Community Day Core Flight System Command and Data Dictionary Utility December 4, 2017 NASA JSC/Kevin McCluney December 4, 2017.
A Brief Introduction to the Internet
What’s New in Colectica 5.3 Part 1
ICPSR: Resources for Instructors Finding and Analyzing Data 9/26/2012
ICPSR Tools for the Metadata Portal
What’s New in Colectica 5.3 Part 2
الجهاز المركزي للإحصاء الفلسطيني
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Introduction to HTML Simple facts yet crucial to beginning of study in fundamentals of web page design!
SharePoint 2010 – SharePoint 101
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Tutorial 7 – Integrating Access With the Web and With Other Programs
CSE591: Data Mining by H. Liu
Implementing DDI in a Survey Organisation
Capitalising on Metadata
Intro Project Introduction to HTML.
Data Liberation Initiative (DLI)
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Exploring the DLI Product line
Palestinian Central Bureau of Statistics
Presentation transcript:

DDI for the Uninitiated Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta Data Management Issues Greetings: Good afternoon everyone, glad to be with you for this discussion. ACCOLEDS /DLI Training: December 2003

Cataloguing Experiences How many have catalogued using MARC Dublin Core

Cataloguing Experiences Objectives of cataloguing Inventory control Location tool Access Distribution

Enter DDI Documentation in a standardized mark-up language Data Documentation Initiative (DDI) http://www.icpsr.umich.edu/DDI/ The Data Documentation Initiative is a standard coming out of the IASSIST community and housed with the ICPSR. The DDI committee has produced what is known as a Document Type Definition (DTD) for "markup" of data documentation, which have been called in the past as codebooks by some. The DTD employs the eXtensible Markup Language (XML), which is a dialect of a more general markup language, SGML.

The DDI project is housed at the ICPSR, which contains the detailed description for the DDI on its web site.

The DDI DTD is composed of five sections, each having its own set of tags and tag attributes. The five sections are the document description, study description, data files description, variable description and other related materials.

An Example American Public Opinion and U.S. Foreign Policy, 1994 http://www.icpsr.umich.edu/DDI/samples/index.html http://www.icpsr.umich.edu:8080/DDI/SAMPLES/06561.xml http://datalib.library.ualberta.ca/accoleds/workshops/index.html

XML-DDI Benefits The display of data documentation through a variety of style sheets; Input for further processing, such as creating statistical package command files, conducting advanced searches, comparing variables across data files, driving data extraction engines, etc. The benefits of a standard like XML DDI is its use to structure the content of data documentation for both the purposes of display but also as input for a variety of processing, including creating statistical package command files, conducting advanced searching, comparing variables across data files, and driving data extraction engines. The possibilities become quite large when the structured data documentation can be subsequently processed on the basis of the tags of the DTD. I was just thinking about the possibility of visualizing skip patterns in the original questionnaire based on the tag structure in a DDI compliant document.

Data Documentation There is a need for comprehensive data documentation that allows easily Finding variables By subject groupings By keywords, phrases or terms By response categories (value labels) Through linkages from the questionnaire Secondly, the better and more comprehensive the data documentation, the easier it is to discover the variables needed by researchers both for their analysis and for extracting case subsets. We need to simplified ways both to find variables and to trace variables to their origin. It has been interesting watching the reactions of users to the utility distributed with the Survey of Labour and Income Dynamics called SLIDRET. This small dbase application allows finding and extracting variables from the huge number that exists. Simply having the variables organized by subject groupings has been helpful. While this utility has been helpful with SLID, I see another approach as providing a more versatile platform for organizing data documentation, which I’ll mention shortly.

Data Documentation There is a need for comprehensive data documentation that allows easily Tracing variables back to their origins To a question To a response category for a multiple response item To the variables from which it was computed for a derived variable. Discovery of variables can flow from the list of variables back to the survey instrument just as easily as flowing from the questionnaire to the variable list. Knowing the context from which a variable originates is important information. Does a variable come from a question that occurs within a skip pattern? Is the variable part of a multiple response question? If the variable was derived, from which variables was it computed?

Data Documentation There is a need for comprehensive data documentation that allows easily Understanding the corrections that must be made because of the sampling methodology Also, clear instructions about the steps that must be taken to correct for the sampling procedure is critical.

What’s next? Let’s assume we have <ddi> compliant files … so what’s next? What are the choices? Also, clear instructions about the steps that must be taken to correct for the sampling procedure is critical.

General Choices Feed your own system (input from a structured file) Look at systems using <ddi> files directly Wait for SAS, SPSS, etc. to become XML enabled Wait and see Also, clear instructions about the steps that must be taken to correct for the sampling procedure is critical.

Projects Using DDI NESSTAR Health Canada -- DAIS SDA, Berkeley ICPSR’s metadata University of Minnesota US Census Bureau Harvard Virtual Data Center Also, clear instructions about the steps that must be taken to correct for the sampling procedure is critical.

Global Access, Local Support Data users NESSTAR Central Server Data Producers

Data Observatory Workbench Text Journal articles User guides Methodology instructions Tools Finding and sorting Browsing Analysing Publishing Hyperlinks Data Survey Indicators Administrative Geographical People Email Conferences Experts Discussion lists

Data Sharing - The NESSTAR Way (in 3 Steps) Prepare your data using the Nesstar Publisher Microdata in SPSS, SAS, Stata, Statistica, ascii or other formats Table- or aggregated data in Excel, Ascii or other formats Documentation/metadata in various text-formats, including XML Data or metadata sitting in relational databases Import Import data and metadata from a variety of formats Cut and paste additional metadata from external sources Use templates to enforce structure and local ”best practice” Organize your variables in groups and sub-groups Add local controlled vocabularies or thesauri Validate your data/metadata against the DDI and your local ”best practice” Output DDI-instances and/or publish to a Nesstar server

Data Sharing - The NESSTAR Way (in 3 Steps) – (cont’d) Publish your data to a Nesstar server Publish over the Web or a local area network (LAN) Organize your data in folders and sub-folders Define the access conditions of your data Customize the user-interface to your data Publish Data Store

3. Share and explore your data through a variety of interfaces Data Sharing - The NESSTAR Way (in 3 Steps) – (cont’d) 3. Share and explore your data through a variety of interfaces Nesstar Explorer – a feature rich data browser (Java application) Nesstar light – the standard web-browser interface to Nesstar resources and services Choose between a variety of customized interfaces Develop your own customized interface or integrate Nesstar services in an existing web-application Access Data Store

Demo URL: http://nesstar1-4.essex.ac.uk/nesstarlight/

Where do we go from here? Need to start producing <ddi> files Need to create incentives for survey managers to create <ddi> files Need to work cooperatively to convert legacy files

What’s ACCOLEDS’ role?