T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)

Slides:



Advertisements
Similar presentations
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Advertisements

Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Basic IR: Modeling Basic IR Task: Slightly more complex:
Modern Information Retrieval Chapter 1: Introduction
Query Languages. Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Modern Information Retrieval by R. Baeza-Yates and B. Ribeiro-Neto
Web Search - Summer Term 2006 II. Information Retrieval (Basics Cont.)
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
IR Models: Overview, Boolean, and Vector
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
IR Models: Structural Models
Models for Information Retrieval Mainly used in science and research, (probably?) less often in real systems But: Research results have significance for.
Query Languages: Patterns & Structures. Pattern Matching Pattern –a set of syntactic features that must occur in a text segment Types of patterns –Words:
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
Chapter 4 : Query Languages Baeza-Yates, 1999 Modern Information Retrieval.
1 Query Language Baeza-Yates and Navarro Modern Information Retrieval, 1999 Chapter 4.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Vector Space Model CS 652 Information Extraction and Integration.
Modern Information Retrieval Chapter 4 Query Languages.
IR Models: Review Vector Model and Probabilistic.
Other IR Models Non-Overlapping Lists Proximal Nodes Structured Models Retrieval: Adhoc Filtering Browsing U s e r T a s k Classic Models boolean vector.
 IR: representation, storage, organization of, and access to information items  Focus is on the user information need  User information need:  Find.
Chapter 4 Query Languages.... Introduction Cover different kinds of queries posed to text retrieval systems Keyword-based query languages  include simple.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
LIS618 lecture 1 Thomas Krichel economic rational for traditional model In olden days the cost of telecommunication was high. database use.
Information Retrieval Introduction/Overview Material for these slides obtained from: Modern Information Retrieval by Ricardo Baeza-Yates and Berthier Ribeiro-Neto.
Information Retrieval Chapter 2: Modeling 2.1, 2.2, 2.3, 2.4, 2.5.1, 2.5.2, Slides provided by the author, modified by L N Cassel September 2003.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
IR Models J. H. Wang Mar. 11, The Retrieval Process User Interface Text Operations Query Operations Indexing Searching Ranking Index Text quer y.
Chapter 6: Information Retrieval and Web Search
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Information Retrieval Model Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
CSCE 5300 Information Retrieval and Web Search Introduction to IR models and methods Instructor: Rada Mihalcea Class web page:
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
Search Engine Architecture
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
C.Watterscsci64031 Classical IR Models. C.Watterscsci64032 Goal Hit set of relevant documents Ranked set Best match Answer.
Set Theoretic Models 1. IR Models Non-Overlapping Lists Proximal Nodes Structured Models Retrieval: Adhoc Filtering Browsing U s e r T a s k Classic Models.
Information Retrieval and Web Search Introduction to IR models and methods Rada Mihalcea (Some of the slides in this slide set come from IR courses taught.
Introduction n IR systems usually adopt index terms to process queries n Index term: u a keyword or group of selected words u any word (more general) n.
Information Retrieval Models School of Informatics Dept. of Library and Information Studies Dr. Miguel E. Ruiz.
Information Retrieval and Web Search
Search Engine Architecture
Information Retrieval and Web Search
Information Retrieval on the World Wide Web
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Information Retrieval
موضوع پروژه : بازیابی اطلاعات Information Retrieval
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Search Engine Architecture
Models for Retrieval and Browsing - Structural Models and Browsing
Recuperação de Informação B
Information Retrieval and Web Design
Recuperação de Informação B
Recuperação de Informação
Advanced information retrieval
Presentation transcript:

T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)

2 T.Sharon - A.Frank Classical IR  Deals with Textual Information Retrieval.  Exists for a few decades, mostly for text repositories.  Pushed strongly with the development of the WWW for search engines.

3 T.Sharon - A.Frank IR Topics and their Relationships Retrieved Models & Evaluation Improvements on Retrieval Efficient Processing Bibliographic Systems The Web Digital Libraries Interfaces & Visualization Multimedia Modeling & Searching TEXTUAL IR HUMAN COMPUTER- INTERACTION FOR IR APPLICATIONS FOR IR IR Vocabulary:

4 T.Sharon - A.Frank Basic Architecture of an IR System DocumentsQueries Document Representation Query Representation Comparison

5 T.Sharon - A.Frank Interaction of the User with the IR System Retrieval Browsing database

6 T.Sharon - A.Frank What is a Query? Input: –query terms/words, should appear in the text –possibly conditions between them Output: –relevant documents –possibly ranked

7 T.Sharon - A.Frank Information Retrieval Systems Generic information retrieval system select and return to the user desired documents from a large set of documents in accordance with criteria specified by the user. Retrieval Functions –document search (ad-hoc) the selection of documents from an existing collection of documents. –document routing (filtering) the dissemination of incoming documents to appropriate users on the basis of user interest profiles.

8 T.Sharon - A.Frank The Process of Retrieving Information Text Databases Index DB Manager Module Indexing Text Operations Query Operations Searching Ranking User Interface Text Logical view Inverted file User feedback Retrieved docs User need

9 T.Sharon - A.Frank IR Ranking Ranking algorithms –The central problem regarding IR systems is the issue of predicting which documents are relevant and which are not. –Ranking algorithms are at the core of IR systems. –A ranking algorithm operates on basic premises regarding document relevance according to distinct IR model.

10 T.Sharon - A.Frank A Taxonomy of IR Models UserTaskUserTask Retrieval: Search Routing Browsing Boolean Vector Probabilistic Non-Overlapping Lists Proximal Nodes Classic Models Structured Models Flat Structure Guided Hypertext Browsing Fuzzy Extended Boolean Set Theoretic Generalized Vector Latent Semantic Index Neural Networks Algebraic Inference Network Belief Network Probabilistic

11 T.Sharon - A.Frank Retrieval Models Associations Full Text + Structure Full TextIndex Terms Structured Classic: Set Theoretic Algebraic Probabilistic Retrieval Structure Guided Hypertext Flat Hypertext Flat Browsing Logical View of Documents USERTASKUSERTASK

12 T.Sharon - A.Frank Query Language (1) Keyword-based Querying –Single-word Queries –Context Queries Phrase Proximity –Boolean Queries –Natural Language

13 T.Sharon - A.Frank Query Language (2) Pattern Matching –Words –Prefixes –Suffixes –Substring –Ranges –Allowing errors –Regular expressions

14 T.Sharon - A.Frank Query Language (3) Structural Queries –Form-like fixed structures –Hypertext structure –Hierarchical structure

15 T.Sharon - A.Frank Structural Queries (a)form-like fixed structure, (b) hypertext structure, and (c) hierarchical structure

16 T.Sharon - A.Frank Hierarchical Structure An example of a hierarchical structure: the page of a book, its schematic view, and a parsed query to retrieve the figure