INFORMATION RETRIEVAL

Slides:



Advertisements
Similar presentations
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Advertisements

Natural Language Processing WEB SEARCH ENGINES August, 2002.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Search Engines and Information Retrieval
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
1 Chapter 19: Information Retrieval. ©Silberschatz, Korth and Sudarshan19.2Database System Concepts - 5 th Edition, Sep 2, 2005 Chapter 19: Information.
Search engines. The number of Internet hosts exceeded in in in in in
Chapter 19: Information Retrieval
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
1 MARG-DARSHAK: A Scrapbook on Web Search engines allow the users to enter keywords relating to a topic and retrieve information about internet sites (URLs)
Information Retrieval
Chapter 5: Information Retrieval and Web Search
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
RSS Feeds in AquaBrowser Library Staff Training Upper Midwest Users Group Conference 18 October 2011 Nina Mentzel, SDLN
An Application of Graphs: Search Engines (most material adapted from slides by Peter Lee) Slides by Laurie Hiyakumoto.
Ryerson University Library and Archives Searching the Deep Web Winter 2012.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Computing & Information Sciences Kansas State University Monday, 04 Dec 2006CIS 560: Database System Concepts Lecture 41 of 42 Monday, 04 December 2006.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Overview What is a Web search engine History Popular Web search engines How Web search engines work Problems.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
CSE 6331 © Leonidas Fegaras Information Retrieval 1 Information Retrieval and Web Search Engines Leonidas Fegaras.
CSCI-235 Micro-Computer in Science Internet Search.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Chapter 6: Information Retrieval and Web Search
Search engines are used to for looking for documents. They compile their databases by employing "spiders" or "robots" to crawl through web space from.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Ranking in Information Retrieval Systems Prepared by: Mariam John CSE /23/2006.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Search Engines.
به نام خدا مهندسي اينترنت جوانمرد اسلايد پنجم.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Search Strategies for the World Wide Web. Search Engines  SearchEngineWatch.com will periodically rank the most effective search engines.  Some Search.
Ranking Definitions with Supervised Learning Methods J.Xu, Y.Cao, H.Li and M.Zhao WWW 2005 Presenter: Baoning Wu.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Basics of Databases and Information Retrieval1 Databases and Information Retrieval Lecture 1 Basics of Databases and Information Retrieval Instructor Mr.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Implementation of Vector Space Model March 27, 2006.
Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.
Definition, purposes/functions, elements of IR systems Lesson 1.
Database System Concepts, 5th Ed. ©Sang Ho Lee Chapter 19: Information Retrieval.
Search Engine Optimization
Information Retrieval
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2017 Lecture 7: Information Retrieval II Aidan Hogan
Information Retrieval and Web Search
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2018 Lecture 7 Information Retrieval: Ranking Aidan Hogan
Information Retrieval on the World Wide Web
Thanks to Bill Arms, Marti Hearst
Information Retrieval
Introduction to Search Engines
DBMS with probabilistic model
Data Mining Chapter 6 Search Engines
Web Mining Department of Computer Science and Engg.
Introduction to Information Retrieval
Chapter 5: Information Retrieval and Web Search
Chapter 31: Information Retrieval
Information Retrieval and Web Design
Information Retrieval and Web Design
Information Retrieval and Web Design
Chapter 19: Information Retrieval
Introduction to Search Engines
Presentation transcript:

INFORMATION RETRIEVAL -Monday March 21, 2006 -Arjun Dasgupta

IR DOMAINS Ranking functions in Traditional IR Ranking functions used in the WWW Ranking functions used in databases

RANKING FUNCTOINS IN TRADITIONAL IR Started as a branch of library science Definitions: Corpus: a collection of documents Document: a set/bag of words Keyword Query: small set of keywords

An Example…. Query over the web = {Microsoft, Corporation} An obvious approach would be to retrieve all documents containing “Microsoft” and “Corporation” ordered by the “ranking function” Ranking function determines the order in which query results are presented

What constitutes a good ranking function??? Relative frequency of occurrence of keywords Proximity of keywords (this cannot be done in a bag of words model) Specificity or importance Links to/from other documents Popularity/ relevance of page with respect to the query