Trials and Tribulations of creating DDI Codebooks at the University of Guelph A.Michelle Edwards and Carol Perry, Data Resource Centre, University of Guelph.

Slides:



Advertisements
Similar presentations
ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance.
Advertisements

DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
DDI in Ontario Putting the Pieces Together A. Michelle Edwards, Ph.D. Data Resource Centre, University of Guelph May 16, 2007 Presented at: IASSIST 2007.
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
DDI Tags Why are we here? Some of the questions Some possible solutions Need for input and discussion.
Anne Etheridge Economic and Social Data Service IASSIST May 2006 METADATA MANAGEMENT THE FORGOTTEN WORLD OF THE BACK OFFICE.
DDA and metadata handling Questions Variables Study description Adresses Administrative data related to studies.
Foundational Objects. Areas of coverage Technical objects Foundational objects Lessons learned from review of Use Case content Simple Study Simple Questionnaire.
Documentation tool for WebSphere® MQ Do you have challenges like… keeping documentation of your Queue managers up to date finding changes in your MQ configuration.
CG0119 Web Database Systems Parsing XML: using SimpleXML & XSLT.
Metadata at ICPSR Sanda Ionescu, ICPSR.
XSL XSLT and XPath 11-Apr-17.
Click to edit Master title style Click to edit Master subtitle style DDI Across the Life Cycle: One Data Model, Many Products IASSIST Meeting Tampere,
XML-Based Course Websites Michael Wollowski Computer Science and Software Engineering Department Rose-Hulman Institute of Technology.
California Digital Library Applications in the Real World: The Counting California Experience with the DDI Patricia Cruse Ilona Einowski Juri Stratford.
INTER-UNIVERSITY CONSORTIUM FOR POLITICAL AND SOCIAL RESEARCH Social Science Data and Resources for Researchers Converting Legacy Documentation to DDI:
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
Demonstration of a Blaise Instrument Documentation System “BlaiseDoc” Gina-Qian Cheung May 25, 2005 Institution for Social Research University of Michigan.
May 14, 2001California Digital Library Using DDI Extensions as Intermediary for Data Storage and Data Display Patricia Cruse Marsha Fanshier Fredric Gey.
The Metadata Toolbox: A User’s Perspective on DDI J.M. Eisenhauer Smith, Data Analyst/Archivist Center for Demography of Health and Aging University of.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fluency with Information Technology Third Edition by Lawrence Snyder Chapter.
Unlocking Public Opinion Poll Data in Canada May 27, 2009 IASSIST 2009 Michelle Edwards, PhD, University of Guelph Jane Fry, Carleton University.
Python and Web Programming
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley A Case Study in Database Organization The iDiary Database lawrence snyder.
Documentation Tools in the Survey Lifecycle. Outline What is NSFG Webdoc? Instrument documentation != Survey documentation Data Cleaning/Processing in.
September 15, 2003Houssam Haitof1 XSL Transformation Houssam Haitof.
Connect To RefWorks from CSU University Library Page A B.
 Name and organization  Have you worked with DDI before? (2 or 3)  If not, are you familiar with XML?  What kind of CAI systems do you use?  Goals.
Data Management: Documentation & Metadata Types of Documentation.
Understanding Page Template Components Lila Bronson Training Manager, OmniUpdate, Inc.
© 2014 by the Regents of the University of Michigan Metadata from Blaise and DDI 3.0/3.2 Gina Cheung Beth-Ellen Pennell North American DDI Conference April.
Understanding HTML Style Sheets. What is a style?  A style is a rule that defines the appearance and position of text and graphics. It may define the.
Label production Solution with Label Gallery programs Label Gallery is used for general label design and print GalleryData is used to create small database.
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
Nesstar: A Web-based Data Extraction and Analysis System Richard Pinnell & Sandra Keys, University of Waterloo Libraries.
Implementing Forms and Form Renderers in the Open Source Portfolio David McPherson, Chris Maurer Will Trillich, Janice Smith Materials by Sean Keesler.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
My ODS: Real-World Uses of Modifying Table Templates Steve James Centers for Disease Control and Prevention Atlanta, Ga.
ECA 228 Internet/Intranet Design I XSLT Example. ECA 228 Internet/Intranet Design I 2 CSS Limitations cannot modify content cannot insert additional text.
JSTL, XML and XSLT An introduction to JSP Standard Tag Library and XML/XSLT transformation for Web layout.
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
XSLT Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Transforming Documents „a how-to of transforming xml documents“ Lecture on Walter Kriha.
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
BLAISE to DDI Vipavc Irena, ADP, Slovenia CESSDA - Seminar, September, 2004.
XSLT Introduction. XSLT is the transformation part of XSL An XSLT file contains rules which are applied against and XML file to produce an output Outputs.
Introduction to Metadata, the DDI and the Metadata Editor Presentation to the SERPent project team by Margaret Ward 3 March 2010.
Data Management Console Synonym Editor
1 Introduction  Extensible Markup Language (XML) –Uses tags to describe the structure of a document –Simplifies the process of sharing information –Extensible.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
SAS ODS (Output Delivery System) Donald Miller 812 Oswald Tower ;
Databases Letts Chapter 11. A database program can be used to:  sort a file into a different order;  search through the records for a matching string.
Ontario Data Documentation, Extraction Service and Infrastructure.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Metadata Content Entering Metadata Information. Discovery vs. Access vs. Understanding Cannot search on content if it is not documented. Cannot access.
1 Dataset Builder Tool Canadian Research Data Centre Network Statistics Canada NADDI 2014.
Getting Familiar with Metadata Laurie Porth Rocky Mountain Research Station Audience: Scientists/researchers who have heard of metadata and now need to.
Database to HTML and Back again A programmers tale.
ODS TAGSETS - a Powerful Reporting Method Derek. 2 Agenda Understanding the tagset How to use tagsets Conclusion.
Today’s ODESI … is under construction is under construction.
Creative Create Lists Elizabeth B. Thomsen Member Services Manager
Metadata standards Using DDI to Inform, Organize, and Drive Survey Data Production.
Data Access North of the (US) Border
An Overview of Data-PASS Shared Catalog
DDI for the Uninitiated
DDI in Ontario Putting the Pieces Together
Virginia Lenvik Geography 375 Spring 2013
Data Liberation Initiative (DLI)
July, 2019 Joint Statistical Meetings
Presentation transcript:

Trials and Tribulations of creating DDI Codebooks at the University of Guelph A.Michelle Edwards and Carol Perry, Data Resource Centre, University of Guelph Guelph, Ontario

Current Search Function

Search Results

Current Documentation

Identifying Variables

Rationale for Change 522 datasets to date. No comprehensive metadata search function. No current variable search within dataset. Limits researcher’s autonomy.

XML tags Started with approx. 30 or so tags… As of June 5, tags 59 are filled Information contained inside tags

Codebook Templates Used Maddie to develop initial template. Edited the template to add tags as required. Filled in fields common to all codebooks.

Codebook Templates Statistics Canada data ICPSR data B2020 data format

Statistics Canada Codebook

Differences between Codebook Templates Authoring entity Distributor (DLI vs. ICPSR) Licenses Other material – ICPSR abstract link B2020 No direct link to database No variables

How do we move our information from an HTML readme file to an XML file???

Readme to XML Document Description Study Description Data Files Description

Readme to XML Currently – copy and paste information from the Readme (html) file into the XML Codebook. Script extracts metadata from html and places into XML. Same amount of time.

Variable Information

Sources of Variable information Variable names, labels, and position from the SAS program. Frequencies for each variable value from SAS output.

Variable Information Sources of Variable information Literal questions from questionnaires if available.

Variable Information Script: Looks into the SAS program – pulls out the variable names, labels and positions. Looks into a SAS output file for frequencies and variable value labels.

Variable Information Script: If questionnaire is available – seeks out questions and matches with variables.

Variable Information Problems with Script: SAS programs must be consistent in their format. SAS output and questionnaires – matching variables.

SAS to XML SAS XML engine and ODS XML. Can create XML SAS output. Variable names, labels, value labels, and frequencies. Variable positions with the input statement and Proc Print  XML.

SAS to XML Frequency Output

SAS to XML Proc print output

SAS to XML

Advantages: SAS programs do not need to be consistent. Use one program from start to finish – SAS. Still in development.

XML to Viewable Document Saxon – to render our XML documents to HTML using XSL Stylesheets. XSL – pull out info from XML document and display with HTML tags.

XSL Templates Set for each: Statistics Canada ICPSR B2020 Initial templates from University of Virginia samples.

XSL Templates Abstract Study Info Methodology & File Dimensions Questions Variables & Frequencies Other Documents

XSL Stylesheets

Search Uses SAS IntrNet to call and run the UNIX SGREP search. Creates an XML file with results. Calls Saxon to render the file with the Variable XSL Stylesheet.

“Final Product” Frames to put it all together. Links to each component (abstract, etc.). Returns the rendered HTML on the fly.

“Final Product”

Sun Exposure Survey S/XML/SES2/ses96cbk.html

“Finished Product” 522 datasets to date. 35 Completed DDI-compliant codebooks. Fall completion ???

“Final Product”