OAI & NSDL Research at Grainger Briefing to UIUC Library Faculty 15 April 2003 Timothy W. Cole (t-cole3@uiuc.edu) William H. Mischo (w-mischo@uiuc.edu)

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

NSF – DLF – JISC/UKOLN Digital Library Service Registry Workshop National Science Foundation, Arlington, VA March 2006 The University of Illinois.
Building Reliable Distributed Information Spaces Carl Lagoze CS /22/2002.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
NSDL 2 nd Generation Mathematics Digital Library ASEE Annual Meeting June 13, 2005 Portland, OR William H. Mischo
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
1 An introduction to the NSDL William Y. Arms Cornell University.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Enriching Metadata for XML Journal Articles Through Extraction of MathML and Function Names Timothy W. Cole William.
Introduction to the OAI Metadata Harvesting Protocol Hussein Suleman, Digital Library Research Laboratory Virginia Tech.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
IESR Interfaces: Current Services and Future Plans Ann Apps MIMAS, The University of Manchester, UK.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
National Science Foundation The National SMET Education Digital Library (NSDL) Program: Context and Vision August 10, 2000 US-Korea Joint Workshop on Digital.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
1 The NSDL Program Stephen Griffin National Science Foundation.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata and OAI DLESE OAI Workshop June 29 to July 2, 2002 Katy Ginger Presentation available at:
IMLS DCC Project Briefing ( ) Jenny Benevento ( ) Timothy W. Cole.
NSDL & Access Management David Millman Columbia University Jan ‘02.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
Award Number IUG 2004 Boston, MA Integrating Digital Libraries and Traditional Libraries Sue Cody Arlene Hanerfeld Dan Pfohl University of North.
Distributed Service Registry Workshop, Warwick, U.K. 1 Distributed Functionality in the UIUC OAI Registry
Search Interoperability, OAI, and Metadata An Introduction to the OAI Protocol for Metadata Harvesting Sarah Shreeves University of Illinois at Urbana-Champaign.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
DLF Fall Forum The Distributed Library: OAI for Digital Library Aggregation UIUC’s Role: Registry of OAI Data Providers
Digital libraries research IG Cataloging and metadata IG Web services and metadata switch February 2003 Web services and metadata switch February 2003.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
1 CS 430: Information Discovery Lecture 13 Case Study: the NSDL.
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Utility of an OAI Service Provider Search Portal
Vision... “… a network of learning environments and resources for Science, Mathematics, Engineering and Technology education, will ultimately meet the.
University of Illinois at Urbana-Champaign OAI Alpha Experiences
? What is Institutional Repository for Rutgers University
NSDL: OAI and a large-scale digital library
Introduction to Metadata
OAI and Metadata Harvesting
eCulture Science Gateway – reloaded
Digitometric Services for Open Archives Environments
NSDL Data Repository (NDR)
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Open Archive Initiative
Digital Library Issues and Trends
Digital Libraries and the Future of Academic Libraries
JISC Information Environment Service Registry (IESR)
IVOA Interoperability Meeting - Boston
Presentation transcript:

OAI & NSDL Research at Grainger Briefing to UIUC Library Faculty 15 April 2003 Timothy W. Cole (t-cole3@uiuc.edu) William H. Mischo (w-mischo@uiuc.edu) http://dli.grainger.uiuc.edu/Publications/TWCole/LibFac2003/

Projects Open Archives Initiative: National Science Digital Library: Illinois OAI Metadata Harvesting (Mellon) IMLS Digital Collections & Content (IMLS) Grainger OAI Resources in Science & Engineering National Science Digital Library: 2nd Generation Math Resources (NSF / DUE)

OAI Protocol for Metadata Harvesting Harvesting approach to interoperability at metadata level Divides world into Metadata Providers & Service Providers Builds on HTTP, XML, & Dublin Core http://www.openarchives.org/

OAI is a tool All about moving metadata (not data) around A building block, useable by many communities – supports new models of scholarly communication Can facilitate, in some cases enable, advanced digital library services & functions Assumes widely distributed content, but centralized indexing(!) – requires critical mass Providers build once, share many times Purpose of OAI is to foster interoperability

Harvesting vs. Federation Competing approaches to interoperability Federation is when services are run remotely on remote data (e.g. Broadcast Searching) Harvesting is when data/metadata is transferred from the remote source to the destination where the services are located (e.g. Union Catalogs) Federation requires more effort at each remote source but is easier for the central system and vice versa for harvesting OAI focuses on harvesting

Reliance on HTTP, XML, DC OAI is a REpresentational State Transfer (REST) protocol – i.e., URL-based Z39.50, Web services, SOAP are RPC-based OAI requests are sent via the HTTP protocol using GET or POST OAI responses are valid XML documents XML allows validation, increases reliability of what’s harvested (in terms of structure) DC is OAI’s Lowest Common Denominator Communities encouraged to use additional schemas

How OAI Works OAI “VERBS” Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord Service Provider Metadata Provider H A R V E S T E R R E P O S I T O R Y OAI HTTP Request OAI (OAI Verb) HTTP Response (Valid XML)

As Compared to Z39.50 Z39.50 OAI Content (Objects) Distributed World View Bibliographic Object Presentation Data provider Searching is Centralized Search done by Service provider Metadata searched is Up to date Stale Semantic Mapping When searching Metadata delivery

Mellon-OAI Project Create a web portal to scholarly information resources in cultural heritage harvested via OAI Primary objectives: Develop & make available OAI harvesting tools Build harvesting and search services Investigate viability and utility of searching OAI harvested resources Explore issues of advanced search/indexing/display Explore user needs & metadata usage patterns Identify critical issues and best practices for using OAI with cultural heritage material

Mellon-OAI Achievements Developed harvesting tools (Open Source) Refined data provider tools (Open Source) Investigated logistics of harvesting activities Investigated metadata provider usage of DC, EAD Created XSL stylesheets for metadata transformations (MARC to DC; EAD to DC) Experimented w/ configurations to address scalability & performance issues Usability testing with students in College of Education

Metadata aggregation 39 providers (OAI-compliant and surrogates) Metadata describing resources of 580 institutions (CIMI, CDP) 1.1 million original records 2.6 million including item-level records derived from EAD finding aids

IMLS Digital Collections & Content Build registry of all National Leadership Grant collections with digital content. Assist & guide NLG projects in making item-level metadata sharable using OAI. Build repository, search & discovery tools for integrated access to content of NLG collections Research best practices for sharing metadata about diverse digital content & supporting interests of diverse user communities. Collaboration between UIUC Library, GSLIS, & IMLS

Project Sites UIUC OAI Cultural Heritage Repository Mellon-OAI Project Site IMLS DCC Project Site

National Science Foundation NSDL Program National Science, Technology, Engineering, Mathematics Digital Library. http://www.nsdl.org/ Coverage: K to Grey. National system for distributed science education; characterized by a set of exemplary resource collections and services. Highly competitive grants: 3 years, 339 proposals, 105 funded; three main categories: collections, services and targeted research.

2nd Generation Math Resources Collaboration with UIUC Library, Wolfram Research Inc., & COE Dept of Theoretical and Applied Mechanics. Project Objectives: Adding interactive and graphical content to two feature-rich Wolfram sites. Generating and extracting OAI-compliant metadata, establishing OAI Provider site, adding mathematics controlled vocabulary terms. Developing courseware and problem libraries for TAM courses.

Providing Metadata to NSDL Exposing metadata via OAI Preferred method for bringing metadata into the NSDL repository (requires little manual intervention) Sending metadata via ftp Enabling metadata "scraping" Creating and editing directly to the NSDL metadata repository See also: NSDL Metadata Primer

Wolfram Functions Web Site Source HTML Page Derived Metadata <dc:identifier> <dc:description> <dc:date> <dc:rights>

Wolfram Functions Web Site Source HTML Head Extracted Metadata <dc:title> <dc:description> <dc:subject> <dc:format> <html> <head> <title>Square root: Primary…</title> <meta name='Description' content='Primary definition …' > <meta name='Keywords' content='Sqrt, square root, …' > <meta http-equiv='Content-Type' content='text/html; charset=iso-…'> </head> …

Sample Metadata File for a Wolfram Functions Web Page <oai_dc:dc … > <dc:title>Square root: Primary definition (formula …</dc:title> <dc:subject>Sqrt</dc:subject> <dc:subject>square root</dc:subject> … <dc:description>Primary definition (2 formulas)</dc:description> <dc:description><math … </math></dc:description> <dc:date>2001-10-29</dc:date> <dc:publisher>Wolfram Research, Inc.</dc:publisher> <dc:type>Text</dc:type> <dc:format>text/html; charset=iso-8859-1</dc:format> <dc:identifier>http://functions.wolfram.com…/Sqrt/02/0001/</dc:identifier> <dc:identifier>http://functions.wolfram…/01.01.02.0001.01</dc:identifier> <dc:language>en</dc:language> <dc:rights>© 2002 Wolfram Research, Inc.</dc:rights> </oai_dc:dc>

The NSDL metadata repository Core Integration Project – Cornell, Columbia, DLESE. The metadata repository is a resource for service providers. It holds information about every collection and item known to the NSDL. Services Users Metadata repository From “The NSDL Metadata Strategy,” A presentation by William Y. Arms and Diane I. Hillman. Available: http://nsdl.comm.nsdlib.org/allprojects01/metastrategy.ppt Collections

Working Assumptions The WWW is the primary medium (for now) Content is a mix of “born digital” and analog There is no lack of “great piles of ‘stuff’ ” There is a need for “piles of great ‘stuff’ ” The “unit” of content can and will shrink Users will increasingly be creators, and vice versa While much of the use will be “free”, there is a need to explore multiple models of sustainability Experimental nature of distributed digital library building - “one library, many portals” Increasingly, analog content is produced digitally (though still distributed in analog form). Relative mix of content will shift towards more “born digital”. Java applets are an example of content with small granularity, but potentially very high reusability. (Granularity inversely proportional to reusability.) Implication of decreasing granularity is that more users can become creators: reusing, repurposing content. Also providing commentary on content will also be a contribution. “Free” as in valued as a “public good” and hence supported publicly.

Related Links http://mathworld.wolfram.com/ http://functions.wolfram.com/ OAI Resources in Science & Engineering