Download presentation
Presentation is loading. Please wait.
Published byCharlene Flowers Modified over 6 years ago
1
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Digital Libraries Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
2
Need of Digital Library
Now days, researchers are making their work available online in the form of postscript or PDF documents. To access this growing body of scientific literature we need Digital Libraries.
3
What is Digital Library
A digital library is an integrated set of services for capturing, cataloging, storing, searching, protecting, and retrieving information, which provide coherent organization and convenient access to typically large amounts of digital information. As a consequence of the huge amounts of digital content becoming available, modern search engine technologies are now being introduced in digital libraries to retrieve the relevant content.
4
Architecture of Digital Library Search System
5
Modules of System Crawler Document Parser Indexing Module
Database Search & Browsing Sub-Agent Web Browser Interface
6
Crawler The main component of the digital library search system is a crawler that traverses the hypertext structure in the web, downloads the web pages or harvest the desired papers published in specific venue (e.g. a conference or a journal) and stores them in database. It is an agent to automatically locate and acquire research publications.
7
Document Parser It is a document parser and database creator.
It extracts the semantic features from the downloaded documents and places them into a database as parsed documents.
8
Indexing Module The parsed documents are routed to an indexing module that builds the index based on the keywords present in the pages. Various ranking methods are also implemented in this module to present relevant results to users according to their needs.
9
Database Search & Browsing Sub-Agent
It consists of a query processing sub-agent which takes a user query of proper syntax and returns an HTML formatted response to the user. The query processing sub-agent provides several different browsing capabilities that allow a user to easily navigate through the document database. Although search by keyword is supported, there is emphasis on using the links between “citing” and “cited” documents to find related research papers.
10
Web Browser Interface It is the interface between user and the main system. User fires a query in the form of keywords on the web browser interface of a digital library search engine. Results are also displayed on this interface to the user.
11
Advantages of Digital Libraries
Digital Libraries improves upon manual search process in three ways: It automates the tedious, repetitive, and slow process of finding and retrieving Web based publications. Once potentially relevant papers are retrieved, it guides the user towards interesting papers by making them searchable. When a relevant paper is found, it helps the user by suggesting other related papers using similarity measures derived from semantic features of the retrieved documents.
12
CONCLUSION In this I presented an agent that automates and enhances the task of finding interesting and relevant research publications on the World Wide Web. It can save researchers a great deal of time and effort in the process of a literature search.
13
REFERENCES A Comparative Study of Page Ranking Algorithms for Online Digital Libraries by Sumita Gupta, Neelam Duhan, Poonam Bansal. Citeseer-An Autonomous Web Agent for Automatic Retrieval and Identification of Interesting Publications By Kurt D. Bollacker, Steve Lawrence and C. Lee Giles.
14
THANK YOU
15
QUERIES???
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.