Interoperability and Web Services

Slides:

Advertisements

Similar presentations

Possibility in Digital Collection Management Introduction to CONTENTdm TM Hitoshi Kamada University of Arizona Presentation for OCLC-CJK Users Group Annual.

Advertisements

Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey.

A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,

Electronic Theses: The Next Stage Metadata for ETDs Richard Milne The Robert Gordon University 27 th September 2004.

Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.

Regina 2014 GALLOP Portal Status Update, Future Plans Greg Salmers Saskatchewan Legislative Library A Library & Research Session.

The OCLC Metadata Switch Project Jean Godby, Thomas Hickey, Diane Vizine-Goetz OCLC Office of Research Digital Library Federation May 14, 2003.

The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.

Introducing Symposia : “ The digital repository that thinks like a librarian”

Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.

Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.

XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.

How to participate in the Union Catalogue Project Hussein Suleman Sivulile – Open Access South Africa Advanced Information Management.

8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.

Metadata Repositories for Interoperable/Shareable Metadata.

1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,

The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.

Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.

OCLC Research: an update Lorcan Dempsey

ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.

Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.

Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.

Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.

New approaches to the catalog T. Hickey Svensk Biblioteksförening 2005 October 28.

Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.

07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.

Metadata in a distributed information environment: Interoperability as recombinant potential Lorcan Dempsey OCLC/SCURL pre-IFLA conference, 15/16 Aug 02.

1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.

1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.

Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath

1 The Future Of Union Catalogues Some BL Perspectives Neil Wilson Head of Bibliographic Development Scholarship & Collections Boston Spa 17 th March 2006.

Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi

Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.

Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.

Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.

How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.

Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.

Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,

ETDs and NDLTD Hussein Suleman University of Cape Town May 2004.

Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.

Digital libraries research IG Cataloging and metadata IG Web services and metadata switch February 2003 Web services and metadata switch February 2003.

1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.

Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.

Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.

A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.

Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.

CRAI Library Catalog of University of Barcelona

Resource Discovery Landscape

Getting a Leg Up on OAI for the NSDL

Information modeling and infrastructures for metadata

An Overview of Data-PASS Shared Catalog

CRAI Library Catalog of University of Barcelona

Repository Software - Standards

WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000

The Hosted Model Charl Roberts Good morning again,

Accessing a national digital library: an architecture for the UK DNER

A step-by-step guide to DOI registration

Metadata for Special Collections

OAI and Metadata Harvesting

Implementing an Institutional Repository: Part II

Cataloging the Internet

Open Archives Initiative

Digitometric Services for Open Archives Environments

Tech introduction.

IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.

Open Archive Initiative

JISC Information Environment Service Registry (IESR)

Attributes and Values Describing Entities.

Implementing an Institutional Repository: Part II

How to Implement an Institutional Repository: Part II

Presentation transcript:

Interoperability and Web Services T. Hickey Thank you. With Access 2002

Interoperability of What? The problem Decentralized digitization projects Various standards and procedures Desired: Uniform access Wide use

Two Major Option Federated search of distributed systems Centralized search of harvested metadata Looked at the CIDL

Federated Search Option Z39.50/SRU HTTP/ HTML Problem: Weakest link problem

Aggregation Option OAI-PMH HTTP/ HTML OAI-PMH/Z39.50/SRU Harvested Records

Interoperability Technologies Z39.50/SRU/SRW HTML & Web OAI-PMH OpenURL, Proprietary solutions Will be talking mostly about OAI

What are Web Services? Machine-to-machine communication Run over standard Web protocols XML syntax, HTTP packaging Several approaches REST Representational State Transfer SOAP Simple Object Access Protocol

Z39.50 → SRU/SRW Search and retrieve Designed for interactive sessions SRU does it with URL’s SRW does it with SOAP Designed for interactive sessions Widespread use Records can be written as well as read Many interoperability issues are now being dealt with http://oclc.org/SRW/WorldCat?query=title+exact+"Hunting%20Dogs"

Straight HTML Expose each record as an HTML page Create a ‘tree’ above them Search via

OAI Protocol Uses HTTP Fairly simple URLs to Identify the server List formats, record sets Get records (can specify date modified) Has flow control so that large sets can be managed http://www.openarchives.org/

OAI Protocol (continued) XML version of Dublin Core is required Other metadata formats possible (wrapped in XML) Typical uses: Publish metadata for a special collection Use to keep two catalogs synchronized

Synchronized Metadata Distribute creation Distribute responsibility Centralize searches Avoid federated searching Propagate changes Cataloguing Authorities Share data

State of OAI-PMH Version 2.0 as of 2002-06-14 Migration Document Implementation Guidelines Repository Implementers Harvester Implementers Aggregators, Caches, and Proxies

State of OAI (continued) Hundreds of repositories Dozens of aggregators Quickly moving to 2.0 Forming the basis for many services ePrints UK NDLTD Some interesting aggregation services

Metadata is Small Moving typical collections at T1 speeds 10,000 ~10 seconds 100,000 ~2 minutes 1,000,000 ~15 minutes 50,000,000 ~15 hours WorldCat plus authority files Equivalent to 3 DVD movies Have a copy on my PC

Distributing Creation Creation is a communal activity Doesn’t mean everyone has to be on a central system Need access to tools Authority files Analysis Transformations Needs some central coordination

Access to Tools Web Services Proper ‘granularity’ XML/SOAP Web infrastructure Proper ‘granularity’ How often accessed Whether interactive

Distribution of Responsibility OAI provides the mechanism Changes can be detected OAI is completely ‘pull’ Still need agreements Issues in how corrections are made

Centralized Searches Federated searches aren’t worth it Cheaper and easier to consolidate metadata Easier to ensure uniformity E.g. being able to find documents Technical point of view: replicate

An Example: ePrints UK Records harvested from archives Web services to enhance metadata http://www.rdn.ac.uk/projects/eprints-uk/

e-Prints UK Institutional e-print archives Non-institutional e-print Personal e-print archives OAI-PMH Subject Classification Service e-Prints UK SOAP Name Authority Service SOAP HTTP Z39.50 RDN Gateway/portal Service Citation Analysis Service RDN Gateway/portal Service RDN Gateway/portal Service

Example: XTCat Experimental Theses Catalogue Union catalogue of thesis and dissertation metadata http://alcme.oclc.org

XTCat (continued) Records extracted from WorldCat Available via OAI-PMH Separate ‘set’ for electronic theses We harvest from other repositories Integrate with existing records Allow re-harvesting (e.g. by VTLS)

XTCat (continued) Have ‘seed’ data Have an organization Problems WorldCat Theses and Dissertation records Have an organization NDLTD encouraging OAI Problems Reliable servers Reliable harvester Duplications Multiple formats

Lessons from Z39.50 Protocols aren’t magic Need to worry about: Formats Indexes Contents of fields Normalization Language

Title: Bill's first log cabin Contributor: Robert E. Gard Contents: Photograph [1], black and white, 14 x 9 cm and glass slide, 9 x 9 cm. From William W. Parker, Waskatenau, Alberta. "We Remember" Collection, [19--]. Date Created: [19--] Genre: Other Coverage Time: [19--] Language: English Source Collection: Alberta Folklore and Local History Collection Source Series: Photographs Accession No: 96-93-958 Rights Access: Open Creators or Authors: Parker, William W. Subject Headings: Log cabins - Alberta – Photographs Photographs A couple of examples of carefully done metadata This one looks like a Dublin Core derivative Alberta Folklore and Local History Collection, Bruce Peel Special Collections Library, U of Alberta Libraries

Title: Interior of observing station, view 1 Published: [1884] Physical Description:1 photograph : b&w ; 10 x 12 cm Scope and Content: Item is a photograph of the interior of the wooden cabin used as an observing station and living quarters, probably at Ashe Inlet. The photograph shows the stove and a wooden table with two shelves mounted on the wall above. The photograph may have been taken by W.A. Ashe, observer at Ashe Inlet during the Hudson Bay Expedition of 1884. Source of Title: Title based on contents of photograph. Inscriptions: "IIX" Location: Ms. Coll. 310, box 15, folder 2 ID Number: P10026 General Note: All the supplies needed to construct the station buildings were carried on the S.S. Neptune, along with fuel and other provisions sufficient to maintain the stations over the winter, until they could be re-supplied the following summer. Forms part of the file "Hudson Bay Expedition photo album". Subjects: Cabins Looks like MARC Barren Lands J.B. Tyrrell’s Expeditions for the Geological Survey of Canada 1892-1894, Thomas Fisher Rare Book Library, U of Toronto

Some Givens XML Z39.50 → SRU/SRW FTP → OAI Standardizes character set Standardizes record format Z39.50 → SRU/SRW FTP → OAI Interoperability is still hard Digital preservation is harder still

Questions?

Proprietary Solutions Many times cheapest Costs faced up front Possibly less flexibility Probably less open Not as interesting Attractive when problem is well understood Examples: ContentDM, MUSE, Fretwell-Downing, SiteSearch (‘open’ source)

OpenURL URL based Currently being extended Basic operation: Send a coded citation Magic happens Not interactive Aimed at ‘appropriate copy problem’ http://sfx.bath.ac.uk/sfxmenu?genre=book&isbn=1234-5678