Foundations of, and Experiences with, Componentized Digital Libraries OCKHAM Panel ECDL Rome, Italy Edward A. Fox Digital Library Research Laboratory
Acknowledgements Thanks go to many colleagues: Su-Shing Chen, Joe Futrelle, David Garza, Marcos Goncalves, Eberhard Hilf, Rohit Kelapure, Neill Kipp, Ming Luo, Ryan Richardson, Alfredo Sanchez, Rao Shen, Hussein Suleman, Jun Wang, Layne Watson, Qinwei Zhu, the OCKHAM team, … Thanks go to DLF for supporting the OCKHAM meeting Portions of this work were funded in part by the US National Science Foundation through grants DUE , , , , and ; and IIS , and Among these are subcontracts with original funding to UNC Wilmington, U. of Arizona, and U. of Florida. Portions of this work were funded in part by the Mellon Foundation, especially through a subcontract with original funding to SOLINET for AmericanSouth.org.
2 Groups of Applications Mellon funded OAI effort –AmericanSouth.org –With Martin Halbert, Emory team, SOLINET, … NSF funded research on DLs –NSF-CONACyT project with Puebla, Monterrey –NSF-DFG project with U. Oldenburg, Germany –CITIDEL (NSDL collection project) –DL-in-a-box (NSDL services sub-project)
2 Approaches 5S, 5SL, 5SLgen –MARIAN and its component pool –5S as a framework for DL description, … –Specifications using 5S language (5SL) Interface: design specific DL (from general schema) –Generation from specifications (5SLgen) OAI, XOAI, ODL –Develop component pool –Extend OAI as the glue to connect components
DL Requirements (selected from 5S paper) ACTORSACTIVITIESCOMPONENTSSOC/ECO/LEGALENVIRNMENT HumansCreatingDocumentsPoliciesDisciplines LearnerCollectingRepositories Rights mngmnt Business ReaderOrganizingKnow. Org. Srcs Privacy CS LibrarianSelectingClusters Billing Engineering AgentsDisseminatingHandlesStandardsPurposes CrawlerRequestingSubstrate Description Nat’l Library MediatorPreserving Communictn Transmission Education DistributedEvaluating ProtocolsQualitiesScope ClientsAbstracting Modules Interoperability Personal ServersPersonalizing OS, UI Sustainability Worldwide
5S Layers Societies Scenarios Spaces Structures Streams
Definition: Digital Libraries are complex systems that help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams)
open digital library Program Document Document Document Program Program Image Image Image Video Video Video OA PMH XPMH Concept of Open Digital Library
ETD Digital Library Program Document Document ETD Program ETD Image Image ETD Video Video ETD-4 Search Filter Union Recent Browse PMH ODLRecent ODLBrowse ODLUnion ODLSearch ODLUnion PMH USER INTERFACE Students and researchers ETD collections Example of Open Digital Library
Open Digital Libraries Components Running now –XML-File (data provider from file system) –Union, search, browse, recent, filter –E-journal/review, Submit, Edit, Annotation Class projects –High performance multilingual search –Recommender, Rating; Mirroring (see JCDL’02) –Working with NCSA: from DB, unstructured text Others discussed –Classification/categorization and browsing
Harvest from data providers DBUnion Archive Merger Component DBBrowse Browse Engine IRDB Search Engine As Metadata Search Service Provider XML File Collection 1 As Metadata Browse Service Provider XML File Data Provider 1 XML File Collection 2 XML File Data Provider 2 XML File Collection 3 XML File Data Provider 3 Open Digital Library: Basic
Harvest from data providers DBUnion Archive Merger Component DBBrowse Browse Engine IRDB Search Engine As Metadata Search Service Provider As Metadata Browse Service Provider XML File Coll. & Data Provider 1 XML File Coll. & Data Provider 2 XML File Coll. & Data Provider 3 Open Digital Library: Intermediate What’s New Engine As What’s New Service Provider OAI-PMH Data Provider Submit Archive OAIB (NCSA: from RDBMS) Filter
Harvest from data providers DBUnion Archive Merger Component DBBrowse Browse Engine IRDB-1 Search Engine As Metadata Search Service Provider As Metadata Browse Service Provider XML File Coll. & Data Provider 1 XML File Coll. & Data Provider 2 XML File Coll. & Data Provider 3 Open Digital Library: Extended What’s New Engine As What’s New Service Provider OAI-PMH Data Provider Submit Archive OAIB (NCSA: from RDBMS) Filter Recommend Rate Engine Annotation Engine IRDB-2 Search Engine As Annotation Search Service Provider As Recommend & Rate Service Provider
Digital library architecture for local and interoperable CITIDEL services
NSDL Information Architecture Developed by the Technical Infrastructure Workgroup referenced items & collections referenced items & collections Special Databases NSDL Services NSDL Services Other NSDL Services CI Services annotation CI Services discussion CI Services personalization CI Services authentication CI Services browsing Core Services: information retrieval Core Collection- Building Services harvesting Core Collection- Building Services protocols Core Services: metadata gathering Portals & Clients Portals & Clients Portals & Clients Usage Enhancement Collection Building User Interfaces NSDL Collections NSDL Collections NSDL Collections Core NSDL “Bus”
NSDL Interconnection Schemes (initial and my suggested additions) Harvesting: OAI PMH Federated Search: SDLIP Access: Shibboleth Portal: Uportal Some of My Suggestions: –XOAI (see Suleman & Fox, start of Session 13A) –Logging (See Goncalves et al., end of Session 3B) –DL–Viz (VIDI: recall Rao JCDL Viz. Wkshp)
What’s Next Discussions inside NSDL, … Sharing of components, evaluation, refinement of components & process Integration with related fields –Content management –Courseware management –Hypertext, Information retrieval –Knowledge management –Multimedia …