METS Dissemination: Interfaces METS Opening Day 28 October, 2003 Leslie Myrick.

Slides:



Advertisements
Similar presentations
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Raman Ganguly Computer Center University of Vienna.
Advertisements

Introduction to METS (Metadata Encoding and Transmission Standard) Jerome McDonough New York University
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
1/7 ITApplications XML Module Session 8: Introduction to Programming with XML.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
MacKenzie Smith Associate Director for Technology MIT Libraries.
METS: An Introduction Towards a Digital Object Standard Rick Beaubien Library Systems Office U.C. Berkeley.
METS: An Introduction Structuring Digital Content.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Implementation of One Stop Search by XSLT By Dave Low University of Hong Kong 9-Dec-2003.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
Scientific Markup Languages Birds of a Feather A 10-Minute Introduction to XML Timothy W. Cole Mathematics Librarian & Professor of.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Another PillowTalk Presentation  2004 Dynamic Systems, Inc. Introduction to XML for SOA Lee H. Burstein,
From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Session IV Chapter 9 – XML Schemas
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Transforming Documents „a how-to of transforming xml documents“ Lecture on Walter Kriha.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
METS Case Study: The NYU Digital Library Team METS Opening Day 27 October, 2003 Leslie Myrick.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
1 “Universal Data-Speak”: The eXtensible Markup Language Zack Ives CSE 590DB, Winter 2000 University of Washington 3 January 2000.
EAD: An Introduction and Primer Christopher J. Prom, Ph.D. Assistant University Archivist University of Illinois Archives July 7, 2003.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Dom and XSLT Dom – document object model DOM – collection of nodes in a tree.
Challenges in the Nursery: Linking a Finding Aid with Online Content Elizabeth Johnson, Lilly Library Jenn Riley, Digital Library Program DL Brown Bag,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Martin Kruliš by Martin Kruliš (v1.1)1.
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Markus Höckner Computer Center University of Vienna.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
XML Tools (Chapter 4 of XML Book). What tools are needed for a complete XML application? n Fundamental components n Web infrasructure n XML development.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Do Real Archivists Use OAI? Mid-Atlantic Regional Archives Conference Gettysburg, PA October 31, 2003 Chris Prom Assistant University Archivist University.
Apache Cocoon – XML Publishing Framework 데이터베이스 연구실 박사 1 학기 이 세영.
I Copyright © 2004, Oracle. All rights reserved. Introduction.
Java Aufgabe asp.net 2 input files: XML & XSLT using XALAN parser very simple to implement the output is an HTML file which is redirected to the.
1 Introduction to XML Babak Esfandiari. 2 What is XML? introduced by W3C in 98 Stands for eXtensible Markup Language it is more general than HTML, but.
and Transmission Standard overview – and case study
7th Annual Hong Kong Innovative Users Group Meeting
The Fedora Project March 10, 2003
XML Related Technologies
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Building Search Systems for Digital Library Collections
XML Data DTDs, IDs & IDREFs.
XML Problems and Solutions
ICOM 5016 – Introduction to Database Systems
CSE591: Data Mining by H. Liu
The Fedora Project April 28-29, 2003 CNI, Washington DC
Oracle and XML Mingzhu Wei /7/2019.
Introduction to METS (Metadata Encoding and Transmission Standard)
Presentation transcript:

METS Dissemination: Interfaces METS Opening Day 28 October, 2003 Leslie Myrick

NYU Collections using METS Interfaces EAD Finding Aids Tokyo Tribunal Proceedings Afghanistan Digital Library * CRL Web Archiving Project DRAM Hemispheric Institute REPO History Sign Project

Ingredients Tomcat Servlet Engine XSLServlet or SaxonServlet XT or Saxon Transformation Engine MySQL Database for generation Perl DBI and CGI for interface to DB

Why XSLT? Relatively simple Open-source, platform-neutral, standards- based Official Recommendation of W3C It is XML

Free XSLT Tools Abound Editors: emacs, NoteTab + Xalan.bat Servlet Containers – Tomcat, Resin Transformation Engines – Xalan, Saxon, XT Parsers – Xerces, Aelfred,XP/Sax, Crimson Parsing APIs: DOM, SAX

METS as a Functional Syntax METS designed not only for transfer and archival management, but for giving access to, navigating an object METS + XSLT can create dynamic interfaces with links to resources and their metadata METS can be dumped into Oracle, indexed and searched using context-aware queries.

How to Navigate a METS Document ID, IDREF, IDREFS – Each ID must have a matching IDREF and v.v. – To match an ID against more than one value use IDREFS (e.g. multiple ADMID values in METS:file Keys – More flexible; they make document into a database

ID, IDREF, IDREFS Provide navigable relationships between files and their metadata in complex Schema e.g. METS Must be defined in Schema or DTD Restrictive: Element can have only one ID; ID values must be unique (e.g. authorID and artistID can’t be same)

Keys; the key() function Creates an index Defined in the stylesheet and not in the DTD/Schema Flexible – many keys on one element: one for each attribute. Any number of elements can match a given value

Uses for METS From the humble Finding Aid … to …

METS and Finding Aids Beyond the href pointer Useful for managing complex image structure – e.g. multiple scans of multiple pages of letters Holistic way to present descriptive metadata along with inline image (all in one package) Also useful for presenting technical metadata that EAD does not yet accommodate

METS Pageturners Creates HTML page or frameset with links to resources Creates navigable relationships between resources in a METS file Creates complex time-based media synchronizationss

Sfquad.xml redux Question: could XSLT mimic java in rendering METS? The answer at the time: no – Dynamic frame reloading a special problem

N-YHS Edisto Album Album of 77 images from the Civil War period Logical structure: album – page - images Two to four images per page Presented with or without collapsible TOC

Tokyo Tribunal Simple nested structure: jpg page views of Decision taken by the Tokyo Tribunal Collapsible TOC to unpack logical structure of various parts

Afghanistan Digital Library 40 books from (400 eventually) Simple structure – no chapters for the most part METS Web viewer + PDF / CD version Page Images (TIFF at 600 dpi); service files at dpi

CRL Political Web Archive Collaboration between Stanford, Cornell, Texas, NYU, IA under aegis of CRL, Mellon Sub-Saharan Africa, South East Asia, Latin America, Western Europe Testbed: 400 URLs; websites from radical groups, NGOs Internet Archive.arc files

.arc file 100 MB aggregate of harvested files, along with HTTP headers and crawler- generated header for each file Fine as a simple SIP, but basically unmanageable as an AIP or DIP At present accessed using byte offsets to grab content from aggregate file Only searchable by URL (Wayback Machine)

Can METS save.arc? One solution: a METS file for each website contained in.arc At collection level, ur-METS file to manage the different versions of website on different dates in different.arcs Alternatively, a METS file for each arc, delineating content by byte offset? Naah.

It’s the Structure, Silly Ur-METs with to versions (cf. serials model) Failure of web-archiving access models to date due to indexing at page level only Netarkivet.dk – NWA Document format xml document for each page; indexed by FAST Results: thousands of hits and no context.