Prototyping Digital Libraries Handling Heterogeneous Data Sources – An ETANA-DL Case Study Unni Ravindranathan, Rao Shen, Marcos André Gonçalves, Weiguo.

Slides:



Advertisements
Similar presentations
ETANA-DL: Leveraging Digital Library Technologies to Support Archaeology Vanderbilt University Nashville, TN -- Sept. 8, 2006 Weiguo Fan, Edward A. Fox,
Advertisements

Digital Libraries. Synchronous Scholarly Communication Same time, Same or different place.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
‘european digital library’ (EDL) Julie Verleyen TEL-ME-MOR / M-CAST Seminar on Subject Access Prague, 24 November 2006.
Digital Library in a Box Ming Luo, Hussein Suleman, Edward Fox Virginia Tech Subcontract to Collaborative Project led by University of Florida (also with.
1 CS5604 October 13, 2010 “5S Overview for Modules” by Edward A. Fox and Lillian (Boots) Cassel (on Ensemble) Dept. of.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Digital Library Architecture and Technology
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Web Archives, IDEAL, and PBL Overview Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science Virginia Tech Blacksburg, VA, USA 21.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Requirements Gathering and Modeling of Domain Specific Digital Libraries with the 5S Framework: An Archaeological Case Study with ETANA ECDL 2005, Vienna,
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Collaborative Research: Curriculum Development for Digital Library Education Presentation in May 1,2006
Yinlin Chen, Edward A. Fox Dept. of CS, Virginia Tech, Blacksburg, VA USA Contact info: Ensemble Project Meeting, May 18-19, 2009, Portland,
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
ETANA-DL NSF Digital Library Project Edward A. Fox, Virginia Tech ASOR Annual Meeting, 2004
ETANA-DL Managing complex information applications: An archaeology digital library This research is funded in part by NSF-ITR grant #IIS Edward.
CITIDEL: Computing & Information Technology Interactive Digital Educational Library Web Page: Contacts: Future.
CitiViz: A Visual User Interface to the CITIDEL System ECDL 2004, Bath, England, September 2004 Nithiwat Kampanya, Rao Shen, Seonho Kim, Chris North, and.
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Unit no. 5 Digital Library Adolf Knoll National Library of the Czech Republic © Adolf Knoll, National Library of the Czech Republic.
ETANA-ADD: An Interactive Tool for Integrating Archaeological DL Collections JCDL 2006, Chapel Hill, NC June 13, 2006 Naga Srinivas Vemuri, Rao Shen, Sameer.
Digital Library Component Models hussein suleman uct cs honours 2005.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Edward A. Fox, N. Srinivas Vemuri Virginia Tech ASOR ETANA-DL: Leveraging DL Technologies to Support Archaeology.
Implementing PTFS ArchivalWare at York St John University: a project under the JISC Repositories Start-up and Enhancement (SUE) strand Helen Westmancoat.
Incremental, Semi-automatic, Mapping- Based Integration of Heterogeneous Collections into Archaeological Digital Libraries: Megiddo Case Study ECDL 2005,
1 Slides for Steve Griffin, NSF “ETANA and Digital Library Integration” by Edward A. Fox Oct. 3, Dept. of Computer.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
ETANA-DL ( Electronic Tools and Near Eastern Archives Digital Library) Edward A. Fox, Virginia Tech James W. Flanagan, Case Western Reserve U. AIA 106.
XXDL and CSTC and Virginia Tech NSDL Fall 2000 PI Meeting September 22-24, 2000 NSF, Arlington, VA Edward A. Fox CS DLRL.
La Propuesta de Software de Código Abierto: Su Lugar en la Educación Superior Universidad de Buenos Aires May 19, 2004 Edward A. Fox
What is a Successful Digital Library? ECDL 2006, Alicante, September 18, 2006 Rao Shen, Naga Srinivas Vemuri, Weiguo Fan, and Edward A. Fox
Logging in Digital Libraries. Last week …. Introduction to quality indicators and the way in which these are formalized and made computable, according.
Open Source y Educación Superior Biblioteca Central Universidad Nacional del Sur Bahia Blanca, Argentina May 17-18, 2004 Edward A. Fox
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Exploring Digital Libraries: Integrating Browsing, Searching, and Visualization Paper by: Rao Shen, Naga Srinivas Vemuri, Weiguo Fan, Ricardo da S. Torres,
1 Video Message: Welcome ETD 2015: 18 th Int’l Symposium on ETDs New Delhi, India Edward A. Fox Executive Director, Chairman of the Board NDLTD,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Exploring Digital Libraries: Integrating Browsing, Searching, and Visualization JCDL 2006, Chapel Hill, NC, June 12, 2006 Rao Shen, Naga Srinivas Vemuri,
DSpace - Digital Library Software
1 IBM Academic Initiative Introduction for Pamplin School of Business Virginia Tech – October 13, 2011 “IBM Academic Skills Cloud and Computing Education.
ETD Search Services Ming Luo Edward A. Fox Virginia Tech.
Visual Semantic Modeling of Digital Libraries Qinwei Zhu, Marcos André Gonçalves, Rao Shen, Edward A. Fox – Virginia Tech,, Blacksburg, VA, USA Lillian.
Computing and Information Technology Interactive Digital Educational Library Technical Development Content Collection Edward Fox (director) John A. N.
ECDL 2006, Alicante, September 18, 2006 Naga Srinivas Vemuri, Ricardo da S. Torres, Rao Shen, Marcos Andre Goncalves, Weiguo Fan, and Edward A. Fox A Content-Based.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
SCENARIO-BASED GENERATION OF DIGITAL LIBRARY SERVICES Rohit Kelapure, Marcos André Gonçalves, Edward A. Fox Virginia Tech, Blacksburg, VA, USA.
Foundations of, and Experiences with, Componentized Digital Libraries OCKHAM Panel ECDL Rome, Italy Edward A. Fox Digital Library Research.
5S Perspective Digital Libraries Foundations Workshop at JCDL 2007 Vancouver – June 23 Edward A. Fox Virginia Tech, USA
Designing Protocols in Support of Digital Library Componentization Hussein Suleman and Edward A. Fox Digital Library Research Laboratory Virginia Tech.
Open Digital Libraries Edward A. Fox Virginia Tech, Dept. of Computer Science.
1 Digging into Digital Libraries: From Archaeology to Formalism Edward A. Fox Virginia Tech, Dept. of CS CSC Spring Colloquium Villanova – February.
OAI and ODL Building Digital Libraries from Components Hussein Suleman Virginia Tech DLRL 12 September 2002.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
ETANA-DL (Electronic Tools and Near Eastern Archives Digital Library)
Outline Pursue Interoperability: Digital Libraries
Introduction to DSpace
ECDL 2006, Alicante, September 18, 2006
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
SDMX IT Tools SDMX Registry
Presentation transcript:

Prototyping Digital Libraries Handling Heterogeneous Data Sources – An ETANA-DL Case Study Unni Ravindranathan, Rao Shen, Marcos André Gonçalves, Weiguo Fan, Edward A. Fox, James W. Flanagan Virginia Tech, Blacksburg, VA, USA (and CWRU) ECDL 2004, Bath, England, September 2004

Acknowledgements (Selected) Sponsors: NSF grant ITR ; AOL, ASOR, CWRU, ETANA, Vanderbilt U., Virginia Tech Faculty/Staff: Lillian Cassel, Debra Dudley, Roger Ehrich, Manuel Perez, Naren Ramakrishnan VT (Former) Students: Aaron Krowne, Ming Luo, Fernando Das Neves, Ricardo Torres, Hussein Suleman

Acknowledgements (contd.) Karen Borstad, MPP Douglas Clark, Walla Walla College Joanne Eustis, CWRU Nick Fischio, CWRU Paul Gherman, Vanderbilt U. Andrew Graham, U. Toronto Tim Harrison, U. Toronto Larry Herr, Canadian University College Christopher Holland, LRP Paul Jacobs, Mississippi State U. Douglas Knight, Vanderbilt U. Stan LaBianca, Andrews U. David McCreery, Willamette U. Eric Meyers, Duke U. Adam Porter, Illinois College Jack Sasson, Vanderbilt U. Tom Schaub, Indiana U. of Penn. Randall Younker, Andrews U.

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Problems Interoperability among heterogeneous archaeological systems Delay in publication of primary archaeological data Lack of sustainable solutions to long-term preservation of valuable information Lack of services useful to the archaeology community, including “traditional DL services” Difficulty in understanding complex archaeological information systems Difficulty in requirements elicitation for archaeological systems

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Open Archives Initiatives Promotes interoperability among DLs Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Data Provider possess metadata and share it (internally / externally) via well-defined OAI protocols (e.g., database servers) Service Provider harvest data from Data Providers provide higher-level services to users

Traditional Digital Libraries ? Program Document Document Document Program Program Image Image Image Video Video Video ? Monolithic and/or Custom-built web-based application UsersDigital Library Digital Objects

Introduction to ODL (Open Digital Libraries) Open Digital Libraries Framework for componentized Digital Libraries Design principles for components Protocols for inter-component communications Built upon OAI

Open Digital Libraries Approach UsersETANA-DLSites Bone Search Filter Union Recent Browse USER INTERFACE Filter Seed Figurine Pottery

Basic ODL Model: An application for Archaeology OAI Data Provider OAI-PMH ODL Protocol User Interface Nimrin ETANA-DL Union Catalog OAI-PMH ETANA-DL Search Engine ODL Service Provider Component WWW Interface ODL Protocol

Componentized services example User Search Handler Servlet Query Results IRDB Search Engine User Interface Index DB Query in the IRDB query language Results in XML Query Parsed XML

5S Model – Informally Digital libraries are complex information systems that: help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams)

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Solution – our approach Applying and extending Digital Library (DL) techniques to solve the following problems: interoperability, making primary data available, data preservation Modeling archaeological information systems using 5S theory to better understand the domain and design the system and the supported services Rapidly prototyping DLs that handle heterogeneous archaeological data using componentized frameworks: requirements elicitation, provide useful services.

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

ETANA-DL Archaeological Digital Library Applies and extends the OAI-PMH Open Archives Initiative Protocol for Metadata Handling Design considerations Componentized Distributed architecture Extensible Portable

ETANA Digital Library Core Components - DigBase DigBase (DB) Central repository - stores metadata Union catalog - for the collections in ETANA-DL Various kinds of digital objects – excavation records, images, text collections, etc. General services - Search, Browse, Annotate, Recommend, etc. Archaeology-specific services - artifact analysis, visualizations, artifact interpretation, workflows, etc.

ETANA Digital Library Core Components - DigKit DigKit (DK) A suite of tools for collecting and recording archaeological data in the field, that can be used for a new dig Metadata will migrate to DigBase (DB). Real-time collaborative archaeology: Metadata in DB will be rapidly available to others.

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Architecture Union Catalog Inverted Files DB used by Services Index Browse Engine Search Component Browse DB Other ETANA-DL Services Web Interface XOAI DigBase DB Data Mapping Component OAI Data Provider OAI Archaeological Site ETANA-DL DigKit Configure

Modeling ETANA-DL – An Archaeological DL Meta-model Text Video Audio *Site *Sub-partition *Container*Artifact*LocusRegion Taxonomies Temporal Artifact-specific Space model Structure model Metadata DrawingPhoto3D Stream model *Partition Society model Archaeologist General public Geographic space Service Manager Information Satisfaction Value added Repository building Scenario model Services Domain specific User interfaceMetric space Spatial

Modeling ETANA-DL – The ETANA-DL model *Field*Pail *Bone *LocusJordan Taxonomies Space model Structure model Field record, locus sheet Figurine image (photo) Stream model Umayri Society model Archaeologist Generic public Site-specific coordinate system Web interface Vector space ETANA-DL Service Manager Searching, Browsing Annotation, binding Harvesting, Converting Scenario model Services Object comparison, marking item for analysis Archaeological periods Bone type Seed species *Square *Figurine *Quadrant*Bag *Locus Jordan Valley Nimrin *Square *Field*Basket*LocusSouthern IsraelHalif*Area *Seed Site/field plan (drawing) Preliminary/Final Report (application/pdf) Spatial

Modeling ETANA-DL – Mapping heterogeneous data to the structural model SitePartition Sub- partition LocusContainer Lahav Field I Area A8 Locus A8074 Basket 224 Nimrin Quadrant NW Quadrant Value N25/W50 Locus 96 Bag 240 Umayri Field A Square 7J59 Locus 001 Pail 12

Data Mapping

ETANA-DL Schema Design Bone Seed Figurine ETANA-DL Object Count Animal …… Species Name …… Description Dimensions …… Owner Subpartition Partition Locus ID Container Collection ……

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

ETANA-DL Services: Categories Information satisfaction Searching Browsing Recommendation Archaeology (Domain) specific Object comparison Marking items Value-added Annotation Items of interest (Binding service) Recent searches/discussions User management

Searching: Search Interface

Searching: Search Results

Searching: Advanced Search

Searching: Advanced Search Results

Multi Dimensional Browsing Site structure Temporal Object-specific User context

Searching within a Context

Searching within a Context: Search Results

Restoring Browsing Contexts

Object Comparison: Selecting Objects for Comparison

Object Comparison: Editing Attributes

Object Comparison: Comparing Objects

Object Comparison: Comparison Results

Marking items

Viewing marked items

Remarking items

Discussion Board (Annotation): View Messages

Discussion Board (Annotation): Post Messages/Replies

Collections Description

Other services Items of Interest (Binding service) Recent searches/discussions Recommendation User management Account creation Login

Items of Interest: Binding Service

Recent Searches/Discussions

Recommendation

User Management: New User Account

User Management: Login

User Management: Navigations

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Heterogeneous data handling Site Artifact Type Original data source Number of attributes in original record Number of attributes in harvested record Number of records harvested LahavFigurine Tab-delimited text file Nimrin Bone field record Table in Oracle DB Seed field record Table in Oracle DB Umayri Bone field record 2 tables in Access DB Total10537

Heterogeneous data handling Site Data Analysis (in hours) Data Mapping (in hours) Data Provider Implementation (in hours) Service Provider Implementation (in hours) Lahav Nimrin48 41 Umayri Total

Heterogeneous data handling

Rapid prototyping: Lines of Code Type of Service LOC for implementing service LOC reused from components Total LOC Reuse Percentage Componentized Non- componentized Total

Rapid prototyping: Service development times Componentized Services Non-componentized Services

User Analysis Initial comments from all 3 projects, plus others interested in ETANA-DL Positive feedback – users liked: Data integration Prototype cross-collection information access services Information structuring Utility of supported services Negative feedback – user concerns: Need for service enhancements Usability

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Conclusions Apply 5S to the archaeological domain Identified requirements for future versions of system Extensible and componentized approach for handling heterogeneous archaeological data from disparate sources Rapidly generated prototype archaeological DL Making primary archaeological data available without significant delay

Outline Problems Background Approach ETANA-DL ETANA-DL Prototype System Modeling ETANA-DL ETANA-DL Services Analysis Conclusions Future Work

Componentizing current DL services Creating next-generation DL services from expanding set of requirements Integrating richer content (Semi-)automatic data mapping Automating the ingest of DL content Enhancing interface capabilities Formal usability studies

Visual Browsing Visual Browse By sites

Visual Browsing: Topographical Drawings Full siteNorth west quadrant Square: N40/W20

Visual Browsing: Square information Loci layout Square: N40/W20 Locus: 86

Visual Browsing: locus sheet

Publications 1.U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, J. W. Flanagan. ETANA-DL: A Digital Library for Integrated Handling of Heterogeneous Archaeological Data. To be presented at the ACM- IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7-11, U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, J. W. Flanagan. ETANA-DL: Managing Complex Information Applications – An Archaeology Digital Library. Demo to be presented at the ACM-IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, AZ, June 7-11, U. Ravindranathan, R. Shen, M. A. Goncalves, W. Fan, E. A. Fox, J. W. Flanagan. Prototyping Digital Libraries Handling Heterogeneous Data Sources – The ETANA-DL Case Study. European Conference on Digital Libraries (ECDL 2004), Bath, U.K., September 12-17, 2004 (submitted).

Questions/Feedback ??