Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 1: Introduction Alexander Gelbukh www.Gelbukh.com.

Slides:



Advertisements
Similar presentations
Special Topics in Computer Science The Art of Information Retrieval Chapter 10: User Interfaces and Visualization Alexander Gelbukh
Advertisements

Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 2: Modeling Alexander Gelbukh
Special Topics in Computer Science The Art of Information Retrieval Chapter 1: Introduction Alexander Gelbukh
1 Alexander Gelbukh Moscow, Russia. 2 Mexico 3 Computing Research Center (CIC), Mexico.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Chapter 5: Introduction to Information Retrieval
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Multimedia Database Systems
Modern Information Retrieval Chapter 1: Introduction
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
ISP 433/533 Week 2 IR Models.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Modern Information Retrieval Chapter 1: Introduction
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Information Retrieval in Practice
INFORMATION RETRIEVAL WEEK 1 AND 2
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
1 Information Retrieval and Web Search Introduction.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Advance Information Retrieval Topics Hassan Bashiri.
Modern Information Retrieval Chapter 1 Introduction.
Basic IR Concepts & Techniques ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
Recuperação de Informação. IR: representation, storage, organization of, and access to information items Emphasis is on the retrieval of information (not.
Chapter 5: Information Retrieval and Web Search
 IR: representation, storage, organization of, and access to information items  Focus is on the user information need  User information need:  Find.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Search Engines and Information Retrieval Chapter 1.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Modern Information Retrieval Computer engineering department Fall 2005.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Information Retrieval Introduction/Overview Material for these slides obtained from: Modern Information Retrieval by Ricardo Baeza-Yates and Berthier Ribeiro-Neto.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Chapter 6: Information Retrieval and Web Search
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Search Engine Architecture
Structure of IR Systems INST 734 Module 1 Doug Oard.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Information Retrieval in Practice
Information Retrieval (in Practice)
Modern Information Retrieval
Information Retrieval and Web Search
Search Engine Architecture
Information Retrieval and Web Search
Multimedia Information Retrieval
Information Retrieval
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Chapter 5: Information Retrieval and Web Search
Search Engine Architecture
Information Retrieval and Extraction
Recuperação de Informação
Presentation transcript:

Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 1: Introduction Alexander Gelbukh

2 Motivation First for libraries, but now WWW!!! Info: representation, storage, organization, access Search Engines (IR systems) User information need oPlain English description query Concerns of modern IR: omodeling oclassification, categorization, filtering osystem architecture ouser interfaces, visualization, query languages

3 Data vs. Information Retrieval Data Retrieval Precise description Well-structured data Precise results Yes-or-no results Science Information Retrieval Vague information need Natural Language, images,... Semantic interpretation Approximate results Relevance ranking Art!

4 Basic Concepts User task (search) oCan formulate what they need: Retrieval (classical) oCant (or does not know): Browsing (new to IR) Still not very well integrated oFiltering (user passive, contents active) Logical view of docs o... Added linguistic info... not clear if helps oFull text oText operations: reduce complexity to index terms Keywords, stopwords Stemming, noun groups (linguistic processing needed) oCategories Slow, good Fast, bad

5 Past, Present, and Future Since clay tablets oAlphabetical index (formal) oTable of Contents (by storing order) oClassifications (by meaning) Libraries oAutomation of classical techniques. Catalogs. oSearch by fields (exact match: author, title, keywords) Web & Digital Libraries: interactive oCheaper huge amount of data oNetworks remote access, wider audience oFree publishing unprepared, heterogeneous data Artificial Intelligence and Linguistic methods

6 Main concerns Open audience oHelp people to formulate their information need oImprove retrieval quality. Intelligent methods Efficiency (speed) oDevelopment of fast techniques Interaction oWatch user behavior to improve quality oPrivacy! Open content oLegal issues. Copyright. Responsibility for info quality oIntelligent methods

7 Retrieval process Database oDefine the logical view: text operations, text model Index (e.g., inverted file) User query oQuery operations (users are not good at this!) Retrieved docs oRanked by likelihood (relevance) Feedback cycle

9 The Textbook: Text IR Models and Evaluation oModeling (basic concepts) oRetrieval Evaluation Improvements on Retrieval oQuery Languages oQuery Operations oText Languages and Properties oText Operations Efficiency oIndexing and Searching

10 Conferences & Journals Confs on IR oIR oACM SIGIR oTREC oSPIRE Journal oIR General conferences on text processing oACL oCOLING oCICLing oDEXA (databases) oNLDB

11 Conclusions User Information Need oVague oSemantic, not formal Document Relevance oOrder, not retrieve Huge amount of information oEfficiency concerns oTradeoffs IR is art more than science

12 Thank you!