Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.

Slides:



Advertisements
Similar presentations
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Advertisements

A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
Management Information Systems, Sixth Edition
Project 1 Introduction to HTML.
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
XML Prashant Karmarkar Brendan Nolan Alexander Roda.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
1 WMES3103 : INFORMATION RETRIEVAL WEEK 13 DIGITAL LIBRARIES.
1st Project Introduction to HTML.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
Digital Library Architecture and Technology
Chapter 1 Introduction to HTML, XHTML, and CSS
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Database System Concepts and Architecture
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Architecture for a Database System
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
1 NDLTD Welcome and Introduction ETD 2011: 14 th Int. Symp. on ETDs Cape Town, South Africa Edward A. Fox Executive Director, NDLTD,
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
Shruthi(s) II M.Sc(CS) msccomputerscience.com. Introduction Digital Libraries have become the source of information sharing across the globe for education,
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Chapter 1 1 Lecture # 1 & 2 Chapter # 1 Databases and Database Users Muhammad Emran Database Systems.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
1 Chapter 1 Introduction to Databases Transparencies.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
Benjamin Post Cole Kelleher.  Availability  Data must maintain a specified level of availability to the users  Performance  Database requests must.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 Video Message: Welcome ETD 2015: 18 th Int’l Symposium on ETDs New Delhi, India Edward A. Fox Executive Director, Chairman of the Board NDLTD,
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DSpace - Digital Library Software
1 IBM Academic Initiative Introduction for Pamplin School of Business Virginia Tech – October 13, 2011 “IBM Academic Skills Cloud and Computing Education.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Visual Semantic Modeling of Digital Libraries Qinwei Zhu, Marcos André Gonçalves, Rao Shen, Edward A. Fox – Virginia Tech,, Blacksburg, VA, USA Lillian.
Object storage and object interoperability
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
SCENARIO-BASED GENERATION OF DIGITAL LIBRARY SERVICES Rohit Kelapure, Marcos André Gonçalves, Edward A. Fox Virginia Tech, Blacksburg, VA, USA.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
DISSERTATION COLLECTIONS DISSERTATION COLLECTIONS NETWORKED DIGITAL LIBRARY OF THESES AND DISSERTATIONS
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Chapter 1 Introduction to HTML.
Introduction Multimedia initial focus
Project 1 Introduction to HTML.
NSDL Data Repository (NDR)
The Database Environment
Database Design Hacettepe University
Presentation transcript:

Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet and tomorrow’s universally accessible digital repositories of all human knowledge The President’s Information Technology Advisory Committee

Traditional Library

Digital Libraries have been positioned at the intersection of Library and Information Science Computer Science Networked System

Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet and tomorrow’s universally accessible digital repositories of all human knowledge The President’s Information Technology Advisory Committee

History of Digital Library Janus Digital Library, 1993, $105,000 Digital Library Phase I, , $24 millions, 6 major projects Digital Library Phase II, 1998 – now about 145 millions, about 30 projects each year

1993, $105000, “electronic preservation”

Digital Library Phase I , $24 millions, 6 major projects

1 1, Integrated Speed, Image and Language Understanding for Creating Digital Video Library Carnegie Mellon University. This is the only one focused on Video Medium.

2, Interoperation mechanisms among heterogeneous services Stanford University. This project is focused on providing a uniform way to access a variety of servers and information sources. --- InfoBus Protocol.

3, a prototype of a scalable, intelligent, distributed electronic library University of California at Berkeley. A prototype for environmental information.

4, Towards a Distributed Digital Library University of California at Santa Barbara. This project is about “Digital Earth”, a collection of information about the world.

5, Digital Library infrastructure for a University Engineering Community University of Illinois at Urbana_Champaign. It provides effective access to engineering and physics journal articles.

6, Intelligent agents for information location University of Michigan. Combines the traditional library and internet technologies to provide the best support for their users.

Digital Library Phase II Start from 1998, $145 million, about 30 projects each year,

The Theory of Digital Library In the earlier years, the theory of digital libraries was based on its structures and its behaviors. In 2001, Edward A. Fox from Virginia Polytechnic Institute and State University, propose the fundamental abstractions of Streams, Structures, Spaces, Scenarios and Societies (5S), the 5S theory.

Structure and Behavior Hypertext Information Storage (Database System) Information Retrieval Multimedia Services Human Computer Interaction Program Language Interoperation

Information Storage A digital library must be capable of storing a large amount of data in a variety of formats and be able to access this data as quickly as possible., Relational database,, Active Database,, Mobile Database,, Multiple Database, Object Oriented Database

Relational Database A relationship between the tables

Active Database An automatic reaction by event-condition- action rules.

Mobile Database Dynamic data and location, Currency Protocol

Multiple Database A Multiple Database System consists of a collection of autonomous and heterogeneous local databases.

Object-Oriented Database Object-oriented databases are designed to work well with object-oriented programming languages such Java, C#, and C++. This is because object-oriented databases used the same exact model as object-oriented programming languages.

Information Retrieval Metadata searching Full-text searching Union search platform

Metadata Searching Metadata– data about data, structured data, data about “Who, What, Where, When” Metadata tags: Title, Creator, Subject, Description, Publisher, Contributor, Date, Type, Format, Identifier, Source, Language, Relation, Coverage, Right. Metadata Attributes: Name, Identifier, Version, Registration, Authority, Language, Definition, Obligation, Data Type, Maximum Occurrence, Comment.

Full-Text Searching This is a example of searching for the string, “Visual basic, Oracle”. It will search throughout a document to find a match.

Union Search Platform Various providers produce the many types of database retrieval systems that exists today. End users want the ability to access different types of data using a universal interface. The solution to this problem is to create a new application that integrates multiple search requests into a union search platform.

Problems: 1, Why should each digital library start from scratch? 2, Interoperability across heterogeneous digital library systems.

A Fundamental Digital Library Theory 5S theory

Streams Definition: A stream is a sequence whose codomain is a nonempty set. A sequence of abstract items, used to describe both static and dynamic content. It can be text, video, audio, or a software program.

Structures Definition: A structure is a tuple (G, L, F), where G = (V, E) is a directed graph with vertex set V and edge set E, L is a set of label values, and F is a labeling function F: (V  E)  L. A labeled directed graphs which imposes organization. Collection, catalog, hypertext, document, metadata, organizational tool. How is the information organize?

Spaces Definition: A space is a measurable space, measure space, probability space, vector space or a topological space. Contains rules to operate on the abstract items. User interface, index, retrieval model. Different logic and presentational properties. The operation of digital library components.

Scenarios A sequences of events or actions in order to accomplish a functional requirement. Service, event, condition, action Communication between users and software developers. Definition: A scenario is a sequence of related transition events (e 1, e 2, …,e n) on state set S such that e k = (s k, s k+1 ), for 1  k  n.

Societies Definition: A society is a tuple (CR), where C = {c 1, c 2, …,c n } is a set of conceptual communities. R= {r 1, r 2, …,r n } is a set of relationships. A set of entities and activates, and the relationships between them. Community, managers, actors, classes, relationships, attributes, operations. Actors and managers act together to carry out the digital library behavior.

Digital Library is a collection of digital object. Definition: A digital library is a 4-tuple (R, DM, Serv, Soc), where R is a repository; DM is a metadata catalog, Serv is a set of services containing at least services for indexing, searching, and browsing; Soc is a society of users of the digital library.

5S Language is an XML realization of the 5S model.

Study Case

NDLTD In Virginia Tech. 177 universities and 27 institutions in worldwide. A student creates a ETD file (Electronic Theses and Dissertation) from his or her theses and dissertation. The ETD file is then checked for formatting errors and quantity requirements. The ETD file is then cataloged and placed on a electronic bookshelf.

, Stream Model:

, Model:, Structural Model: Electric Thesis and Dissertation – Metadata Structure

This is a part of the code.

, Spatial Model, Spatial Model

, Scenario Model:, Scenario Model: An example scenario of a searching service in an NDLTD DL.

, Societal Model, Societal Model

Digital Library Generation Process with 5SL.

5SGraph, A Domain-Specific Visual Modeling Tool

Digital Library In a Box Simplifies and enables the creation of a digital library Can be developed with little or no programming Built with an interoperable design Creates a minimal digital library in less than an hour

Open Digital Library The goal is universal access to digital libraries and information services.

Report to the President Digital Libraries: Universal Access to Human knowledge

Report to the President, Digital Libraries: Universal Access to Human knowledge