Information Retrieval

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval
Advertisements

Exercising these ideas  You have a description of each item in a small collection. (30 web sites)  Assume we are looking for information about boxers,
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Searching the Web II. The Web Why is it important: –“Free” ubiquitous information resource –Broad coverage of topics and perspectives –Becoming dominant.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Text Retrieval and Spreadsheets Class 4 LBSC 690 Information Technology.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
1 CS 430: Information Discovery Lecture 2 Introduction to Text Based Information Retrieval.
1 CS 502: Computing Methods for Digital Libraries Lecture 11 Information Retrieval I.
1 CS 430 / INFO 430 Information Retrieval Lecture 2 Text Based Information Retrieval.
Basics of Information Retrieval Lillian N. Cassel Some of these slides are taken or adapted from Source:
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Basics of Information Retrieval - Focus: the Web Lillian N. Cassel February 2008 For CSC 2500 : Survey of Information Science A number of these slides.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval and Web Search Pandu Nayak and Prabhakar Raghavan.
Web- and Multimedia-based Information Systems Lecture 2.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
What Does the User Really Want ? Relevance, Precision and Recall.
A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Z39.50 A Basic Introduction Kathleen R. Murray, Ph.D. William E. Moen, Ph.D. May 2002.
Definition, purposes/functions, elements of IR systems Lesson 1.
Web Programming Week 14 Old Dominion University Department of Computer Science CS 418/518 Fall 2006 Michael L. Nelson 11/27/06.
Presented By: Carlton Northern and Jeffrey Shipman The Anatomy of a Large-Scale Hyper-Textural Web Search Engine By Lawrence Page and Sergey Brin (1998)
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Discovery and Metadata March 9, 2004 John Weatherley
Reading literacy. Definition of reading literacy: “Reading literacy is understanding, using and reflecting on written texts, in order to achieve one’s.
A CRM-based Ontology for Narratives 1 Carlo Meghini, Valentina Bartalesi.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
How can I use a digital library to support my teaching? Find good resources to enhance existing curriculum  Search special collections aimed at your interests.
Education 499-R01 Search Basics.
What is Information Retrieval (IR)?
Text Based Information Retrieval
Chapter 25 - Automated Web Search (Search Engines)
Augmenting (personal) IR
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Federated & Meta Search
The Bento Approach to Next-Generation Discovery and Delivery Systems
EBSCO Discovery Service
Multimedia Information Retrieval
Information Retrieval
Thanks to Bill Arms, Marti Hearst
Web Programming Week 14 Old Dominion University
Basic Information Retrieval
What is a Search Engine EIT, Author Gay Robertson, 2017.
موضوع پروژه : بازیابی اطلاعات Information Retrieval
CS 430: Information Discovery
CSE 635 Multimedia Information Retrieval
Search V.16.
Introduction to Information Retrieval
Web Programming Week 14 Old Dominion University
Spreadsheets, Modelling & Databases
CS246: Information Retrieval
The ultimate in data organization
WJEC GCSE Computer Science
Information Retrieval and Web Design
Information Retrieval and Web Design
Recuperação de Informação
Presentation transcript:

Information Retrieval First lessons

Basic ideas User needs information Information sources Distinguish data, information, knowledge Information sources Very well organized, indexed, controlled Totally unorganized, uncharacterized, uncontrolled Something in between Connect the two in a way that matches information needs to information available.

The role of databases Databases hold specific data items Organization is explicit Keys relate items to each other Queries are constrained, but effective in retrieving the data that is there Databases generally respond to specific queries with specific results Browsing is difficult Searching for items not anticipated by the designers can be difficult

The Web Extreme opposite of a database No organization, no overall structure, no index or key to the content Searching and browsing are supported, but generally are not complete. (You will not know if you got every good response to your request. You may be able to tell that you got the response that meets your need, but may not know if you got the best response available.)

Digital Library Something in between the very structured database and the unstructured Web. Content is controlled. Someone makes the entries. (Maybe a lot of people make the entries, but there are rules for admission.) Searching and browsing are somewhat open, not controlled by fixed keys and anticipated queries. Nature of the collection regulates indexing somewhat.

How do we know the response is good? Precision Of the results returned, what percentage are meaningful to the goal of the query? Recall Of the materials available that match the query, what percentage were returned?

Text: Retrieval process

The process sequence Query entered Results Ranked Query Interpreted Index searched Items retrieved

The collection Where does the collection come from? How is the index created? Those are important distinguishing characteristics Inverted Index -- Ordered list of terms related to the collected materials. Each term has an associated pointer to the related material(s).

CITIDEL An example Digital Library All items are relevant to computing education Visit at http://www.citidel.org Part of the National Science Digital Library http://www.nsdl.org