Digital libraries and web- based information systems Mohsen Kamyar.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Logics for Data and Knowledge Representation Projects and thesis introduction.
 Andisheh Keikha Ryerson University Ebrahim Bagheri Ryerson University May 7 th
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
DL systems DL and the Web Ilie Savga
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Overview of Search Engines
Web Searching. Web Search Engine A web search engine is designed to search for information on the World Wide Web and FTP servers The search results are.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Survey of Semantic Annotation Platforms
Computational Linguistics WTLAB ( Web Technology Laboratory ) Mohsen Kamyar.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
Master Thesis Defense Jan Fiedler 04/17/98
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
10/18/2015Page 1 Introduction to Semantic Web Design B. Ramamurthy.
CORPORUM-OntoExtract Ontology Extraction Tool Author: Robert Engels Company: CognIT a.s.
LIR 10 Week 7 Boolean Searching and Online Periodical Databases.
Chapter 6: Information Retrieval and Web Search
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Advanced topics in software engineering (Semantic web)
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Search Engine Architecture
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Basics of Information Retrieval and Query Formulation Bekele Negeri Duresa Nuclear Information Specialist.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Basics of Web Based Computing. The Architecture The user’s system A Web Server What’s inside? Server software Apache or other Resources to be accessible.
Web Search Architecture & The Deep Web
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
© 1990—2006 Visual Knowledge Software® | Private and Confidential | 2 Semantic Agent Wikis For Engineering.
General Architecture of Retrieval Systems 1Adrienn Skrop.
©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610)
Knowledge and Information Retrieval Dr Nicholas Gibbins 32/4037.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Search Engine Architecture
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Information Retrieval on the World Wide Web
Search Engine Architecture
Chaitali Gupta, Madhusudhan Govindaraju
Presentation transcript:

Digital libraries and web- based information systems Mohsen Kamyar

Outline What is a Digital Library? Brief history Enabling Semantic Web Some projects

What is a Digital Library? A collection of books A collection of documents Web?

Brief History The goal is to make web contents understandable for machine Last efforts can be categorized in digital libraries scope In a library there are some cards that contain brief information about each book. (making this information as wide as possible and understandable for machine)

Brief History (continued) And according to Milne “providing structures is one of the things that Description Logic do best!” Web is a large sample of digital libraries Web has documents that there are not any information about them and we should provide some way for describing the underlying semantic

SGML to Semantic Web One of the first efforts in the field of Digital Library was the SGML (Standard Generalized Markup Language). This language uses tags for describing the meaning of each word ( Washington ) At last SGML and HTML were combined and XML was generated for describing the semantic as “meta data”.

SGML to Semantic Web To prevent the web community from repeating its mistakes in knowledge representation DLs community became very active and offered many languages for describing the semantic.

Untangle A system for representing card-catalog information It has a web-based interface and we can say that it is the first DL system with web-based interface It used Lisp and its HTTP Server for providing a web-based interface The goal was using DL in Digital Library (classification and information retrieval)

Untangle And now one of the main concerns in the web scope is the classification of the information There are some practical experiences in this project: we should define the objects that are naturally distinguishable as primitive concepts and others as defined concepts; for example book and journal are two primitive concepts. This project also exposed some limitations of DL for modelling.

Untangle One of the limitations is the trade-off between defining objects as concepts or as individuals; for example a person should be defined as an individual, but some persons are subject of many books then for efficiency purposes we should define them as concepts. For this reasons in this projects some special concepts was added to DL.

FindUR The main goal of this project was query expansion. In query expansion we expand a query with its synonyms and hyponyms. There are two main criterion in information retrieval field: Ratio of the desired to undesired returned pages (precision, false positive) Ration of desired missed pages to desired pages (recall, false negative)

FindUR If we use synonyms and hyponyms in a search we can reduce the false-negative ratio. In this project, Wordnets were used for specifying the synonyms and hyponyms. This project was expanded. It found many complicated applications; for example complex search phrases in news and etc.

Alvis Superpeer Semantic Search Engine Partners Helsinki university of technology EPFL (Switzerland)... This project has three stages: Crawling the web using a subject-specific crawler and converting the documents to a simple XML format

Alvis Analyzing documents using a domain-specific linguistic analyzer and annotating the documents. In this stage semantic tagging can be included. In this stage the generated document will be indexed according to its domain and will take a score. In this stage some categories will be assigned to each document, and finally probabilistic approaches will be used for searching.