StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.

Slides:



Advertisements
Similar presentations
Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
Advertisements

DLI Training Nesstar Workshop
Data Documentation Initiative (DDI) Workshop Carol Perry Ernie Boyko April 2005 Kingston Ontario.
Metadata at ICPSR Sanda Ionescu, ICPSR.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Using Multiple Metadata Formats in DSpace ARD Prasad Indian Statistical Institute Bangalore, India.
California Digital Library Applications in the Real World: The Counting California Experience with the DDI Patricia Cruse Ilona Einowski Juri Stratford.
Oregon Spatial Data Library Partnership Metadata Training OU Knight Library Eugene, Oregon December 3, 2009 Kuuipo Walsh Institute for Natural Resources.
Implementation of the DDI at the Roper Center A Pilot Project on Resource Integration Marc Maynard and Hui Wang The Roper Center.
Is Your Data Facility ISO Compliant? Progress Towards Harmonizing the DDI and ISO/IEC Dan Gillman Information Scientist US Bureau of Labor Statistics.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Präsentationstitel IAB-ITM Find the right tags in DDI IASSIST 2009, 27th-30th Mai 2009 IAB-ITM Finding the Right Tags in DDI 3.0: A Beginner's Experience.
Ferrett and Connecticut Patrick McGlamery Homer Babbidge Library University of Connecticut Acknowledgements to Robert Cromley & Ann Green.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
FGDC, Meet the DDI Adding Geospatial Metadata to a Numeric Data Catalog Julie Linden Yale University.
Attribute databases. GIS Definition Diagram Output Query Results.
Implementing ISO Aleta Vienneau and David Danko ESRI.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
Lecture-8/ T. Nouf Almujally
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Digital Encoding What’s behind E-text Resources?.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Digital Library Architecture and Technology
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Metadata Understanding the Value and Importance of Proper Data Documentation Exercise 2 Reading a Metadata File Exercise 3 Using the Workbook Exercise.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Why We Create Metadata and How it is Useful Bruce Godfrey University of Idaho Library INSIDE Idaho
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
METADATA What Is It and What Can I Do With It? Vicki L. Gregory Associate Professor School of Library & Information Science University of South Florida.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Sherry Lake Candidate for Metadata Specialist for User Projects.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Metadata Management and Tools August 1, 2013 Data Curation Course.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
Introduction to metadata
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Geography Markup Language (GML). What is GML? – Scope  The Geography Markup Language is  a modeling language for geographic information  an encoding.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Geography Markup Language (GML). GML What is GML? – Scope  The Geography Markup Language is  a modeling language for geographic information  an encoding.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
METADATA ORGANISATION ESDS APPROACHES AND RESOURCES …………………………………………
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
An Overview of Data-PASS Shared Catalog
Digital library and OR 21 October 2002 Members’ Council
Repository Software - Standards
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Building Search Systems for Digital Library Collections
Introduction to Metadata
DIGITAL ARCHIVES Into the Light
Metadata to fit your needs... How much is too much?
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
Presentation transcript:

StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Themes Collaboration Domain-specific, not media or location specific Cross-media data finder Portal to Internet resources Numeric and spatial social science data

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Social Science Data Archive at Yale Digital collection since 1972 Partnership between Social Science Library and Social Science Research Services Shared responsibility for the SSDA catalog

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. History of the SSDA Catalog Contained: Records for SSDA holdings – data from ICPSR, Roper Center, federal agencies, IGOs/NGOs, commercial vendors. Designed as: SPIRES database on the mainframe, migrated to the Web. Maintained by: data librarian and Statlab

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. The new catalog: StatCat Created a new structure to improve both front-end interface and back-end production and maintenance. –WAIS searching inadequate –Maintenance too difficult

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Goals for StatCat: domain Not a media-specific catalog, rather a domain-specific (social sciences) catalog. Includes datasets on Yale’s Statlab server, CDs in the Library collections, and data available at other web sites.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Tapes CDs Files on server CDs Files on server Internet CDs Files on server Internet Link to external catalog CDs Files on server Internet Cross-database search Evolution of StatCat

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Goals for StatCat: functionality Search fielded full text of records. Full location information to retrieve actual data.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Goals for StatCat: Adhere to standards Base records upon a DDI subset (so that every field in StatCat maps to a DDI field). Potential output to multiple systems or metadata formats: MARC, DC, OAI, DDI, FGDC.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Related Standards BibliographicStatistical Domain Related domains MARC Dublin Core OAI GILS DDI ISO FGDC EAD TEI

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Data Documentation Initiative Consists of these parts:  Document description  Study description  File description  Data description  Related material

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. DDI Study Description section Citation – bibliographic information for the data collection Scope – information about the study’s subject, geographic & temporal coverage (including abstracts and keywords) Methodology & process – information about how the data were collected (e.g. sample design) Data access – access conditions & terms of use for the data collection Other study description materials

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. XML vs. Database XML is good at describing 1)Hierarchical data 2)Great for presenting multiple views into the same data source 3)Exchanging data between independent sites in a highly structured manner 4)Transport format: ASCII, fully tagged 5) DDI and ICPSR are using it: will receive records in some version of DDI XML

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. XML vs. database Decided to go with database and not XML at this time –Database met immediate requirements: improved searching and ease of maintenance. Well known technology. –XML tools still under development. –Drawback: records are no longer in “webspace” –Eventually database will generate XML records.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Designing the database 1. Determined what fields we needed: –Examined ICPSR's "slightly modified” version of the DDI codebook DTD and compared it to the current version of DDI. –Mapped our catalog fields to DDI. –Mapped out catalog fields to Dublin Core, looked at OAI.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Designing the database 1. Determined the type of queries we were going to ask of the data. 2. Determined relations between tables. 3. Determined which fields in which tables.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. StatCat database design (with DDI element numbers)

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Designing the database 4. Decided how to parse our records into the database fields.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Side effects of the conversion process Scrutinize and clean up existing records Leads to questions: what are we cataloging, and why? What are we collecting, and why? Implications for archiving policies.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. StatCat v.2 PHP migrated to a Java server-side application. More modular and extensible MySQL dbms migrated to PostgreSQL New avenues this opens –Spatial searches –Pre-analysis of data before downloading from our archive –Give the client metadata and data in the same download

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission.

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Near-term next steps Add records for geospatial data Ability to sort or separate results to distinguish GIS and non-spatial data Limit search by media type Continue to catalog data on the Internet Interoperability with other catalogs

StatCat: Building a Statistical Data Finder IASSIST 2002, 13 May 2002Please do not cite or copy without permission. Long-term next steps Link study description to live data sets, including documentation and software setups. Spatial queries Search variables and question text. Develop StatCat as a portal to social science numeric data services.