ETANA-DL Managing complex information applications: An archaeology digital library This research is funded in part by NSF-ITR grant #IIS Edward A. Fox, Virginia Tech James W. Flanagan, Case Western Reserve University ASOR Annual Meeting, Atlanta November 21, 2003
Acknowledgements Karen Borstad, MPP Douglas Clark, Walla Walla College Joanne Eustis, CWRU Weiguo Fan, Virginia Tech Nick Fischio, CWRU Paul Gherman, Vanderbilt U. Marcos Goncalves, Virginia Tech Larry Herr, Canadian University College Christopher Holland, LRP Paul Jacobs, Mississippi State U. Douglas Knight, Vanderbilt U. Stan LaBianca, Andrews U. Ming Luo, Virginia Tech David McCreery, Willamette U. Unni Ravindranathan, Virginia Tech Jack Sasson, Vanderbilt U. Rao Shen, Virginia Tech Ricardo Torres, U. Campinas, Brazil Randall Younker, Andrews U.
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
NSF ITR Funding IT Research Digital library: Integration of DB, HCI, HT, IR, LIS, MM, … Complexity! Variety! Distributed! => 5S Framework + OAI / ODL Archaeology Research Multiple sites Multiple kinds of artifacts Multiple terminologies General/special services Multiple views Hypothesis testing Rapid publication
Map courtesy: Current ETANA-DL Member Locations Virginia Tech Mississippi State University Vanderbilt University Canadian University College Walla Walla College Andrews University CWRU Willamette University
ETANA Website
Lahav Website
Nimrin Website
Umayri/MPP Website
ETANA-DL Website
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
ETANA-DL Architecture UsersServicesData ETANA-DL Union ServicesUsers DigKitDigBase
ETANA Digital Library Core Components - DigKit DigKit (DK) Tools for collecting and recording archaeological data in the field Metadata will migrate to DigBase (DB) Real-time collaborative archaeology: metadata in DB will be rapidly available to others
ETANA Digital Library Core Components - DigBase DigBase (DB) Central repository - stores metadata Union catalog - for collections that are in ETANA-DL Various kinds of digital objects – excavation records, images, text collections, etc. General services - Search, Browse, Annotate, Recommend, etc. Archaeology-specific services - artifact analysis, visualizations, artifact interpretation, workflows, etc.
ETANA-DL Architecture DigBase and DigKit Lahav Nimrin Umayri Hisban Megiddo Jalul New Sites DATABASEWRAPPERSDATABASEWRAPPERS ETANA-DL UNION CATALOG Search USERINTERFACEUSERINTERFACE Browse Recommend Note Personalize Review Visualizations Archaeology Specific DigKitDigBase Work in progress …
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
Content Types Text Documents Articles, Reports, Books Video Audio Speech, Music Geographic Information (Aerial) Photos Software, Programs Models Simulations Bio Information Genome Human, animal, plant Images and Graphics 2D, 3D, VR, CAT Digital Library Content
Computing (flops) - e.g., VT’s Terascale Computing Facility, 10 teraflops - 3 rd fastest in world Digital Content Communicat i ons (bandwidth, connectivity) Digital Libraries in Computing and Communications Technology Space Digital Libraries technology trajectory: intellectual access to globally distributed information lessmore
5S Model - Informally Digital libraries are complex information systems that: help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams)
5S in Archaeology - Structures Streams Structures Spaces Scenarios Societies 5S Regions Example: Madaba Plains
5S in Archaeology – Structures (contd.) REGION*PARTITION*SUB-PARTITION *LOCUS *CONTAINER *FIND *SITE has subdivided has contains Lahav Nimrin Umayri Lahav:Field Nimrin:Quad Umayri:Field Lahav:Area Nimrin:Quad Umayri:Square Bone Pottery Seed Figurine Lahav:Basket Nimrin:Bag Umayri:Pail Below, Above, Co-existing *Specific- FINDing Human Mandible … planned
SitePartitionSub- partition LocusContainer LahavField I Area A8 Locus A8074 Basket 224 NimrinQuadrant NW Quadrant Value N25/W50 Locus 96 Bag 240 UmayriField A Square 7J59 Locus 001 Pail 12 5S Structural Model Organization
Data Organization in ETANA-DL Bone Seed Figurine ETANA-DL Object Count Animal …… Species Name …… Description Dimensions …… Owner Subpartition Partition Locus ID Container Collection ……
Database Representation QUADNW/EWLocusAnimalBone NWN40/W25178SHEEP/GOATMETAPODIAL SWS40/W1701HOMO SAPIENS- NEN50/E50-UNIDENTIFIED …..
A Sample Bone Record in XML 1 Nimrin Bone NW N40/W IRON II BC METAPODIAL SHEEP/GOAT ……
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
Open Archives Initiative OAI
Discovery Current Awareness Preservation Service Providers Data Providers Metadata harvesting The World According to OAI
Some OAI Data Providers Analytical Sciences Digital Library California Digital Library Repository Caltech Archives Oral Histories Online Carnegie Mellon U Informedia Public Domain Video Archive DSpace at MIT Library of Congress Open Archive Initiative Repository Perseus Digital Library The University of Michigan Library The University of Tennessee Library University of Illinois Library U of Pittsburgh Electronic Thesis and Dissertation Archive Virginia Tech ImageBase
repository repositoryrepository OAI protocol harvesterharvester support data harvesting data items
selective harvesting - datestamps repositoryrepository harvest within date range record
Data and Service Providers Data Providers possess metadata and share it (internally / externally) via well-defined OAI protocols (e.g., database servers) Service Providers harvest data from Data Providers provide higher-level services to users (e.g., search engines) Who will fit where in ETANA-DL? Data Provider – YOUR PROJECT Service Provider – ETANA-DL
What then is an Open Archive? Any WWW-based system accessed through the well-defined interface of Open Archives Protocol for Metadata Harvesting Also known as OAI-Compliant Repository No implications for: Physical storage of data Cost of data Metadata and data formats Access control to server Will my current digital system be affected? NO An Open Archive is built separately without disturbing the data or the current system
Introduction to ODL (Open Digital Libraries) Open Digital Libraries Framework for componentized Digital Libraries Design principles for components Protocols for inter-component communications Built upon OAI
Traditional Digital Libraries ? Program Document Document Document Program Program Image Image Image Video Video Video ? Monolithic and/or Custom-built web-based application UsersDigital Library Digital Objects
Open Digital Libraries Approach UsersETANA-DLSites Bone Search Filter Union Recent Browse USER INTERFACE Filter Seed Figurine Pottery
Basic ODL Model: An application for Archaeology OAI Data Provider OAI-PMH ODL Protocol User Interface Nimrin ETANA-DL Union Catalog OAI-PMH ETANA-DL Search Engine ODL Service Provider Component WWW Interface ODL Protocol
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
Home Page
Login Page New Account
New Account Creation Page
Navigations ETANA-DL Tutorial Collections Description Items of Interest, Marked Items, Browse, Search, etc.
Collections Description Page More information about the site
Search Results Links to result pages Objects belonging to site: Lahav, area:G6, and locus:G6006 Objects belonging to site: Lahav and Area:K7 Search Query / Number of Hits
Detailed Display (Lahav Figurine) From Lahav data collection Lahav Terms Field Area Locus Basket
Detailed Display (Nimrin Seed) Nimrin Terms Quad Quadrant Locus Bag
Detailed Display (Umayri Bone)* *Data Integration in progress Umayri Terms Field Square Locus Pail …..
Advanced Search Page
SitePartitionSub-partitionLocusContainer LahavField I Area A8 Locus A8074 Basket 224 NimrinQuadrant NW Quadrant Value N25/W50 Locus 96 Bag 240 UmayriField A Square 7J59 Locus 001 Pail 12 5S Structural Model Organization
Advanced Search Options Example
Search Results Query
Browsing Fields in Umayri’s bone collection
Browsing - II Squares in Field ‘H’ Records in Field ‘H’ for Umayri
Adding to Items of Interest Add item to personal collection
Items of Interest Display Objects user is interested
Marking an Item Mark Items
Marking – writing notes for a specific user Marking Items
Marked Items Display Sender, Date, Object OAI ID Sender Comments Options: View Record, Add record to Items Of Interest, Re-mark item (Redirect), Unmark item (Remove item from list)
Discussions Page Discussions about an object View/Post messages, create new threads
Recommendations Items recommended on the basis of similar interests
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements
Conclusions ETANA-DL: integrated services built upon many archaeology projects Harvesting (, OAI, ODL) 5S Framework: Model, Tailored Generation Links for more information Welcome collaboration!
Harvesting vs. Federation Competing approaches to interoperability Federation is when services are run remotely on remote data (e.g., Meta-searching) Harvesting is when data/metadata is transferred from the remote source to the destination where the services are located (e.g. Union catalogues) Federation requires more effort at each remote source but is easier for the local system and vice versa for harvesting OAI currently focuses on harvesting
5S Model ModelsExamplesObjectives Stream Text; video; audio; imageDescribes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data Structures Collection; catalog; hypertext; document; metadata; organization tools Specifies organizational aspects of the DL content Spaces Measure; measurable, topological, vector, probabilistic Defines logical and presentational views of several DL components Scenarios Searching, browsing, recommending,Details the behavior of DL services Societies Service managers, learners, Teachers, etc. Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
5SLGen: Automatic Digital Library Generation
Links to Resources ETANA Home Page ETANA-DL Home Page ETANA-DL Prototype Proposal submitted to NSF-ITR Open Archives Initiative OAI Metadata Harvesting Protocol Virginia Tech DLRL Projects
Overview An Archaeology Digital Library (DL) ETANA-DL Architecture Digital Libraries, 5S Framework -> Structures Open Archives Initiative, Open Digital Libraries Canned Demonstration Conclusions Discussion: Archaeology DL Requirements