A Library Science Perspective on Digitization Bryan Heidorn University of Arizona.

Slides:



Advertisements
Similar presentations
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
MacKenzie Smith Associate Director for Technology MIT Libraries.
Behind the scenes Presented by: Doug Dunlop. Metadata 101 o Simple definition: Data about data. o What it does: Describes content, Represents and creates.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
1 Managing Legal Deposit for Online Publications in Germany Cornelia Diebel.
OAI-PMH Dawn Petherick, University Web Services Team Manager, Information Services, University of Birmingham MIDESS Dissemination.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
OAI Standards for Sheet Music Meeting March 28-29, 2002 Basic OAI Principals How They Apply to Sheet Music Presenter: Curtis Fornadley, Senior Programmer/Analyst.
LSTA Digital Imaging Grants Presentation Projects Workshop September 13, 2002 Wendy Sistrunk Music Catalog Librarian University of Missouri—Kansas City.
The Repository Bridge project Sally Mcinnes, NLW.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
OCLC Online Computer Library Center Strategic Partnerships: An International View 30 October 2003.
Cataloging and Metadata at the University Library.
Connecting to Ensemble: AlgoViz. AlgoViz Community  Sharing educational resources Visualizations for data structure and algorithms  Sharing experience.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
OCLC Research: an update Lorcan Dempsey
© WRLC November 2005 Research Commons Supporting Scholarship in the 21st Century.
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
LIS 654 BUILDING DIGITAL LIBRARIES FALL 2011 NOVEMBER 03, 2011 The OAI-PMH Harvester Plugin for The Omeka Content Management System JAMES R. GRIFFIN III.
OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting Presenter: Knud Möller Friday,
Developing Databases and Selecting an Appropriate Library System.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
AACR2 Pt. 1, Monographic Description LIS Session 2.
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Evidence from Metadata INST 734 Doug Oard Module 8.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
The Open Archives Initiative Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
DSpace - Digital Library Software
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
NSDL & the Open Archives Initiative A Brief Introduction to OAI Timothy W. Cole Mathematics Librarian & Professor of Library Administration.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
AN ARCHETYPE FOR INFORMATION ORGANIZATION AND CLASSIFICATION OCLC WorldCat.
Lecture 12 Why metadata? CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Information modeling and infrastructures for metadata
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Georges Arnaout Chaitanya Krishna
VI-SEEM Data Repository
Metadata - Catalogues and Digitised works
Open Archive Initiative
Presentation transcript:

A Library Science Perspective on Digitization Bryan Heidorn University of Arizona

Library-Museum Parallels Intellectual Property Rights Physical/Digital Objects Sharing Descriptive Metadata Formats Preservation Metadata Transport Metadata Formats Communication Protocols (no so much) Similar Digitization Workflow OCR Challenges

Intellectual Property Rights Expanded to 75yrs in US from 25 Academic Publishing anomalies Attribution required (data no so much) Decoupling of Data from Text

Online Computer Library Center (OCLC) Collaborative Automation of libraries including copy cataloging Started 1967 Catalog 271 million items/year 72,000 libraries in 170 countries and territories use OCLC services to locate, acquire, catalog, lend and preserve library materials.

Descriptive Metadata Formats MARC(XML) 21 Standard METS Dublin Core (Interchange Format only)

Biodiversity Heritage Library Workflow Courtesy: Martin Kalfatovic Program Director, Biodiversity Heritage Library, Smithsonian Institution Libraries

MARC 21 Standard Formats: Bibliographic, Authority, Holdings, Classification, Community Bibliographic Material Types: – Books (BK) – Continuing resources (CR) – Computer files (CF) – Maps (MP) – Music (MU) – Visual materials (VM) – Mixed materials (MX)

MARC Fields 00X: Control Fields 01X-09X: Numbers and Code Fields Heading Fields - General Information 1XX: Main Entry Fields 20X-24X: Title and Title-Related Fields 25X-28X: Edition, Imprint, Etc. Fields 3XX: Physical Description, Etc. Fields 4XX: Series Statement Fields 5XX: Note Fields 6XX: Subject Access Fields 70X-75X: Added Entry Fields 76X-78X: Linking Entry Fields 80X-83X: Series Added Entry Fields X: Holdings, Location, Alternate Graphics, Etc. Fields

MARC Book Example eader/00-23*****nam##22*****#a# /00-01ta 008/ s1991####nyu###########001#0#eng## 020##$a :$c$29.95 (£19.50 U.K.) 020##$a (pbk.) 040##$a[organization code]$c[organization code] 05014$aPN S4$bT $a791.45/75/0973$ #$aTerrace, Vincent,$d $aFifty years of television :$ba guide to series and pilots, /$cVincent Terrace. 2461#$a50 years of television 260##$aNew York :$bCornwall Books,$cc ##$a864 p. ;$c24 cm. 500##$aIncludes index. 650#0$aTelevision pilot programs$zUnited States$vCatalogs. 650#0$aTelevision serials$zUnited States$vCatalogs.

Difference between Museum and Library Full Darwin code has parallels in MARC Many more commercial and custom products Larger installed base Library Entries somewhat more detailed There is a MARC(XML) and MARC Lite MARC differentiates among material types

Digital Content Transport METS – Metadata Encoding and Transmission Standard The METS schema is a standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language.

Courtesy: Martin Kalfatovic Program Director, Biodiversity Heritage Library, Smithsonian Institution Libraries

METS Components METS Header Descriptive Metadata Administrative Metadata File Section - The file section lists all files containing content which comprise the electronic versions of the digital object. elements may be grouped within elements, to provide for subdividing the files by object version. Structural Map Structural Links Behavior

I/O Submission Information Package (SIP), which is sent from the information producer to the archive; the Archive Information Package (AIP), which is the information package actually stored by the archive; and the Dissemination Information Package (DIP), which is the information package transferred from the archive in response to a request by a consumer.

Courtesy: Martin Kalfatovic Program Director, Biodiversity Heritage Library, Smithsonian Institution Libraries

Open Archives Initiative Protocol for Metadata Harvesting The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a low- barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP.

OAI Verbs Get Identify ListIdentifiers ListMetadataFormats ListRecords ListSets

Get ier=oai:arXiv.org:cs/ &metadataPrefix =oai_dc

<OAI-PMH xmlns=" xmlns:xsi=" xsi:schemaLocation=" T08:55:46Z <request verb="GetRecord" identifier="oai:arXiv.org:cs/ " metadataPrefix="oai_dc"> oai:arXiv.org:cs/ cs math <oai_dc:dc xmlns:oai_dc=" xmlns:dc=" xmlns:xsi=" xsi:schemaLocation=" Using Structural Metadata to Localize Experience of Digital Content Dushay, Naomi Digital Libraries With the increasing technical sophistication of both information consumers and providers, there is increasing demand for more meaningful experiences of digital information. We present a framework that separates digital object experience, or rendering, from digital object storage and manipulation, so the rendering can be tailored to particular communities of users. Comment: 23 pages including 2 appendices, 8 figures

Metadata Collection and Workflow (Macaw)

Physical/Digital Objects Sharing Books both part of an Edition and Unique 20 th century books have standard front matter LMS contained Metadata Only Journals indexed by article Most digital content is commercially owned and born digital 2011 author-publishing exceeded commercial Born analog digitization (Google Books and BHL)

Governance Libraries pay for OCLC OCLC is Participatory Close Collaboration with Library of Congress on Standards School System exists to train librarians Libraries are being cut in academic, public and school sectors