Building the digital library community John Mark Ockerbloom Carnegie Mellon University February 8, 1999.

Slides:



Advertisements
Similar presentations
Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
Advertisements

Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
Keeping Digital Documents Usable: Managing Data Formats John Mark Ockerbloom Carnegie Mellon University April 27, 1999.
An e-Learning Strategy to promote technology enabled learning i n UCC Teaching & Learning workshop 30 October, 2012.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
8.
Beyond the Digital Incunabular Period: Toward Web 2.0 Gideon Burton Asst. Prof. of English Assoc. Editor, BYU Studies Presentation to the Harold B. Lee.
Chapter 17: Client/Server Computing Business Data Communications, 4e.
TC2-Computer Literacy Mr. Sencer February 4, 2010.
Thee-Framework for Education & Research The e-Framework for Education & Research an Overview TEN Competence, Jan 2007 Bill Olivier,
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Introducing Symposia : “ The digital repository that thinks like a librarian”
John OckerbloomDec. 6, 2002 Supporting learning at the library Towards integrating LMS and digital library technology at Penn John Mark Ockerbloom CNI.
HAN Conference © History Data Service The History Data Service : Promoting Good Practice and Standards of Scholarship Cressida Chappell Head of.
Digital Repository Service ___________________________ Yale University Library Audrey Novak, Head IS&P 7 March 2007.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Knowledge Portals and Knowledge Management Tools
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
GyanSys Contact: Phone: Simplified Document Management Faster, Better, Stronger Search Integration - Site Mailbox.
Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.
1 CP586 © Peter Lo 2003 Multimedia Communication Introduction to Multimedia and Development Tools.
Digital Library Architecture and Technology
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
Internet Based Information Sources on Urbanism - Tutorial - Authors: D. Milovanovic, D. S. Furundzic, yubc.net.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Web Services Architecture1 - Deepti Agarwal. Web Services Architecture2 The Definition.. A Web service is a software system identified by a URI, whose.
An introduction to digital libraries Jens Vigen, 2 nd CERN-UNESCO School on Digital Libraries, Rabat, Morocco, Nov
Amos Kujenga ADLSN Training Coordinator Addis Ababa, Ethiopia 5 – 7 November 2014 Introduction To Digital Libraries and Repositories.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Unit no. 5 Digital Library Adolf Knoll National Library of the Czech Republic © Adolf Knoll, National Library of the Czech Republic.
Government IT Professionals Online Survey Results FINAL REPORT September 2010.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
The Future of Libraries is Digital. Automated Reference Assistant Initial funding from the McCune Foundation Further funding being sought from the Buhl.
John Mark OckerbloomMay 10, 2004 The Typed Object Model Support for diverse formats John Mark Ockerbloom File Formats for Preservation Seminar May 10,
Chapter 17: Client/Server Computing Business Data Communications, 4e.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
By Laili Hashim Rafidah Abd. Aziz laili Hashim & Rafidah Aziz.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
WebFOCUS Magnify: Search Based Applications Dr. Rado Kotorov Technical Director of Strategic Product Management.
Building an Infrastructure for Digital Humanities: Issues and Considerations Peter Zhou 周欣平 University of California, Berkeley October 8, 2009.
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
+ The Use of Databases in the Instructional Program Increasing Rigor and Inquiry Throughout the Curriculum Donna Dick, Jacob Gerding, and Michelle Phillips.
Kevin Gilbertson - Web Services Librarian Jean-Paul Bessou - Systems Librarian Z. Smith Reynolds Library Wake Forest University Growing Your Own Next-Gen.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Transparent Format Migration of Preserved Web Content D. S. H. Rosenthal, T. Lipkis, T. S. Robertson, S. Morabito Lib Magazine, 11(1), 2005
DIGITAL INFORMATION SOURCES, RESOURCES AND E-LEARNING : SCOPE AND CHARACTERISTICS.
Databases vs the Internet
Databases vs the Internet
DIGITAL LIBRARY.
Metadata to fit your needs... How much is too much?
Chapter 17: Client/Server Computing
Presentation transcript:

Building the digital library community John Mark Ockerbloom Carnegie Mellon University February 8, 1999

The library community Centuries of people who have created the works we can read and research in the library The modern-day community of library maintainers and users More than just a collection of information and people –information is organized in a usable manner –people are trained professionals, supported by their universities, governments More than just one library

The digital revolution Population of the Internet (both people and data) is growing exponentially –much of the information library users want is now digital Particular appeals: –Near-instant, cheap access to information –Breadth (and sometimes depth) of content –Easy to store, serve large quantities of information –Just about anyone can easily publish content, as well as “consume” it Although… –it’s often hard to find the high-quality information you really need (obscure, or not put on-line) –little coordination of information –distractions abound (noise mixed in with signal)

A vision: Revolutions in learning, teaching, creating, collaborating Very large libraries (tens of millions of “volumes”) available to scholars, students, general public Distributed: enables distance learning, collaboration Accessible quickly, at low cost, through a variety of methods Easily searchable, citable, extendable, preservable New kinds of resources available (e.g. conceptual networks, intelligent tutors)

A nightmare? Thousands of uncoordinated, isolated systems? Prohibitive cost structures? Hard to use systems, lacking the affordances of previous media? Data and metadata that become unusable within years -- or less? –“404 Not found” –Vendor becomes unreachable… content becomes unreadable!

An opportunity Libraries can take the lead in forming communities for digital information –have expertise in managing information and organizing it for users –acquire much of the content University libraries can play special role –have large collections, variety of expertise –University DLs both serve the university and promote it –they need to be both conservative and innovative Challenges are both technical and social –library science, computer science, HCI, sociology/anthropology... –organization and support (politics, economics…)

Some design challenges in digital libraries Acquisition Cataloging/Searching –can be done at much finer grain –new ways of searching for things –new kinds of metadata may be important Access control Presentation / Interface Preservation and Maintenance

A key design principle: Sharing the work Useful even at small scale –Coordinated cataloging: The On-Line Books Page »8400 listings, 1M hits/month (60% nongraphical) –Sharing crucial metadata: Catalog of Copyright Entries –Coordinated acquisition: Catholic Encyclopedia –Inter-project dialogue: Book People mailing list Larger libraries, projects can enhance each other’s collections at larger scale

A specific problem: Data format mismatch Much of the information in a digital library is from outside sources, in variety of formats Most clients only understand a few formats They therefore cannot effectively use many materials –data may be in incomprehensible form –data may be in form not easily worked with Particularly problematic: –formats that have complex (but useful) structure –legacy data and programs (obsolete format assumptions) Most of the information in large libraries is “legacy”; long lifespan essential!

Standards are a partial solution “The wonderful thing about standards is that there are so many to choose from” –Data: SGML/XML, Word processor formats, HTML, PDF, Quark, specialized scientific formats, page image formats…. –Metadata: USMARC, Dublin Core, RDF... Standards allow common understandings... …But no one standard fits all –different sources may make different data choices –lowest common denonimator often not good enough –needs, applications, standards change (sometimes quickly)

TOM: A data model for mediating among diverse data formats Allows unfamiliar formats to be –operated on via outside services that understand the format –related to familiar formats –converted into usable formats »for the needs of a particular application or user »for migration from obsolete technology to new technology Works for data that is: –accessible as a (typed or typable) sequence of bytes, or –accessible through a well-defined, working protocol

TOM lets you get this...

…from this From: Sherry T Haddock To: Subject: CAETI Community Meeting Info Date: Thu, 15 Feb :12: (CST) Mime-Version: 1.0 Content-Type: MULTIPART/MIXED; BOUNDARY=" =:20798" Cc: Sherry T Haddock This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. Send mail to for more info =:20798 Content-Type: TEXT/PLAIN; charset=US-ASCII Here are maps detailing the March CAETI Community Meeting Location.... Thanks again, Sherry =:20798 Content-Type: TEXT/PLAIN; charset=US-ASCII; name="CaetiMap.hqx" Content-Transfer-Encoding: BASE64 Content-ID: Content-Description: KFRoaXMgZmlsZSBtdXN0IGJlIGNvbnZlcnRlZCB3aXRoIEJpbkhleCA0LjAp DQoNCjojODBLQ0E0VCklZUtGISI2NiUzYzgmIjgtYCMzIiQpIU4hLSlEMyVt ZC1tNGkrJ2EnWiUhTiIhbCEhLSFyW20NCg0KKiEhQiFOIVgiISohJCEzIzMj IiEhISFtIU4hLSIhKiEkcltxMyFgIzMjMnEzcnJxM1hJaHJOIS1BISohJCFg Iw0KDQozIWAzIU4hLSYhKiEkIkojMyFgRiFOIS0pISohJCMzIzMhYFMhTiEt LCEqISQkISMzIWBkIU4hLTEhKiEkcltxDQoNCjMhcmxyTiEtNCEqISQlSiMz IWEtIU4hLTghKiEkJjMjMyFhQiFOITJxcmohJHJbcTNycnEzVCEiNSEqIXE ( ed, MIME-attached, base64, binhexed, Powerpoint 3)

TOM: Key ideas Formats can be described by what information they contain how they represent it how they relate to other formats Object-oriented models capture these aspects I use an object-oriented metadata schema to describe data formats Much useful format info, services, distributed throughout the Net. Mediators give uniform access to diverse knowledge bases, services I use a network of mediators (type brokers) to assist with unfamiliar formats 1 2 So:

The architecture supporting TOM (simplified) Type Broker ServerClient Server Type Broker Clients get info on formats, request operations (e.g. conversions) Servers implement operations Brokers maintain info on formats, invoke servers for operations Brokers can trade info, consult other brokers Clients can also register new formats, operations, server information...

What’s good about this design? It’s simple (and therefore flexible): –Minimal, basic, well understood standards It’s accommodating: –Describes past, present and future data formats with good breadth and depth of expressiveness –It can be composed with a wide variety of programs and databases (including the Web, off-the-shelf programs) –Benefits start with very low investment, then increase It’s scalable (largely by taking advantage of distributed, interactive nature of Net): –Anyone can define new formats and services –Brokers coordinate contributions from Net community

A cooperative library network (simplified) Library Broker ServerClient Server Library Broker Clients get info on materials, request services (search, convert...) Servers provide materials, services Brokers maintain info on services, invoke servers for operations Brokers can trade info, consult other brokers Clients can also register new materials, metadata, services...

The importance of open content Sharing content and/or metadata gives each library boost from others Enables distributed indexing and cross- referencing –Alta Vista et al made possible by open content Enables replication, minimizing risk of information loss More flexibility in adapting and migrating information to new situations Users can improve, augment resources and feed them back to libraries

Summary The best large-scale digital libraries are built around community –technically: distributed, cooperative infrastructure (e.g. broker architecture of TOM) developed by experts in multiple domains –socially: cooperation between disciplines, organizations; designs that meet the needs of various constituencies University libraries like Penn’s can take a leading role in creating DL community –Have much of the collections, experts –Can provide testbeds, help their users, gain visibility Potential for revolutionary benefits, with the right designs

To find out more: TOM home page: –Conversion service, other demos, technical details, thesis document The On-Line Books Page: –Catalog, Book People archives, copyright entries, selected resources on on-line texts and libraries Personal home page: