Download presentation
Presentation is loading. Please wait.
Published byAugustine Bryan Modified over 9 years ago
1
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University (shi,maly,zubair)@cs.odu.edu
2
June 3-6, 2003E-Society Lisbon Overview Introduction Background Architecture & Design Experimentation & Implementation Conclusion & Future Works
3
June 3-6, 2003E-Society Lisbon Introduction Many approaches for DL Interoperation Harvesting and distributed search Earlier work on LFDL – Lightweight Federated Digital Library Universal search interface DL specification in DLDL DL registration Query mapping Limitations Organizing result set and performance Enhanced LFDL Interactive user-centered search
4
June 3-6, 2003E-Society Lisbon Background Levels of interoperability Technical: protocol, format Contents: data, metadata, messages Organizational: rules for access, payment, authentication General models Federation complete, but requires more from data providers Harvesting some efforts from both data and service providers Gathering Little from data providers
5
June 3-6, 2003E-Society Lisbon LFDL Introduction General principle Aim at non-cooperating digital libraries Distributed search Lightweight: both to data and service providers Basic solution DL specification definition language Dynamic DL metadata registration Universal interface Dynamic Query mapping Local repository
6
June 3-6, 2003E-Society Lisbon Limitations and Issues Limited service usability Search results presented in flat structure Need metadata to present rich search results Performance Caching is neither flexible nor efficient Need local metadata repository to generate intelligent cache Solution Retrieve metadata from remote digital libraries
7
June 3-6, 2003E-Society Lisbon Metadata Retrieval - Approach Available metadata sources List page of search results Detail page of a selected document/record Approach Define specification on how metadata are presented in those pages Use Dublin Core as common metadata mapping set Develop metadata parser to extract metadata Store parsed metadata in local repository
8
June 3-6, 2003E-Society Lisbon Architecture
9
June 3-6, 2003E-Society Lisbon
10
June 3-6, 2003E-Society Lisbon Metadata Retrieval Workflow Define metadata parsing rules in DL specification in DLDL Start parsing when search results arrive from remote DL Parse list page If metadata available at record level, parse record page for each document of results list Metadata are merged and presented to users Metadata are saved to a local repository
11
June 3-6, 2003E-Society Lisbon Metadata Parsing Rules Definition Extended DLDL Two levels: list page and record page String parsing: separate raw string to segments corresponding to metadata fields
12
June 3-6, 2003E-Society Lisbon Part of DTD for DL parsing rules specification
13
June 3-6, 2003E-Society Lisbon Sample Specification for CogPrints null name="DC.title" " name="DC.creator" /><meta content=" " name="DC.creator" ; CREATOR
14
June 3-6, 2003E-Society Lisbon Local Metadata Repository All searches are served locally first A secondary in memory metadata cache for better performance and system reliability Cache grouped by metadata instead of query string
15
June 3-6, 2003E-Society Lisbon Results
16
June 3-6, 2003E-Society Lisbon
17
June 3-6, 2003E-Society Lisbon Populate metadata repository more efficiently Richer functions, more user-friendly in presenting results Cache maintenance: size, consistency… Conclusion and Future Works
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.