Structure of IR Systems

Slides:



Advertisements
Similar presentations
Recuperação de Informação B Cap. 10: User Interfaces and Visualization 10.1,10.2,10.3 November 17, 1999.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
System Concepts. What is a System? Set of inter-related components with a clearly defined boundary Working together to achieve objectives.
Search and Ye Shall Find (maybe) Seminar on Emergent Information Technology August 20, 2007 Douglas W. Oard.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.
Information Retrieval in Practice
Search Engines and Information Retrieval
Information Retrieval Review
The Vector Space Model LBSC 796/CMSC828o Session 3, February 9, 2004 Douglas W. Oard.
Design The goal is to design a modular solution, using the techniques of: Decomposition Abstraction Encapsulation In Object Oriented Programming this is.
Information Retrieval Interaction CMSC 838S Douglas W. Oard April 27, 2006.
Modern Information Retrieval Chapter 5 Query Operations.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Project Life Cycle Jon Ivins DMU. Introduction n Projects consist of many separate components n Constraints include: time, costs, staff, equipment n Assets.
Evaluating the Performance of IR Sytems
Advance Information Retrieval Topics Hassan Bashiri.
Information Access Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies Design Understanding.
Learning Techniques for Information Retrieval We cover 1.Perceptron algorithm 2.Least mean square algorithm 3.Chapter 5.2 User relevance feedback (pp )
Overview of Search Engines
Search Engines and Information Retrieval Chapter 1.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2006.
Evaluation INST 734 Module 5 Doug Oard. Agenda  Evaluation fundamentals Test collections: evaluating sets Test collections: evaluating rankings Interleaving.
By Chung-Hong Lee ( 李俊宏 ) Assistant Professor Dept. of Information Management Chang Jung Christian University 資料庫與資訊檢索系統的整合 - 一個文件資料庫系統的開發研究.
INTRO TO USABILITY Lecture 12. What is Usability?  Usability addresses the relationship between tools and their users. In order for a tool to be effective,
Software Engineering Saeed Akhtar The University of Lahore Lecture 6 Originally shared for: mashhoood.webs.com.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.
Exploratory Visualization of Infectious Disease Propagation Ben Houston, Neuralsoft Zack Jacobson, Health Canada NX-Workshop on Social Network Analysis.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)
Structure of IR Systems INST 734 Module 1 Doug Oard.
Web Search Module 6 INST 734 Doug Oard. Agenda The Web Crawling  Web search.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Information Retrieval Techniques Israr Hanif M.Phil QAU Islamabad Ph D (In progress) COMSATS.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Jane Reid, AMSc IRIC, QMUL, 30/10/01 1 Information seeking Information-seeking models Search strategies Search tactics.
Performance Measurement. 2 Testing Environment.
Structure of IR Systems LBSC 796/INFM 718R Session 1, September 10, 2007 Doug Oard.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Structure of IR Systems LBSC 796/INFM 718R Session 1, January 26, 2011 Doug Oard.
Evaluation INST 734 Module 5 Doug Oard. Agenda Evaluation fundamentals Test collections: evaluating sets Test collections: evaluating rankings Interleaving.
Why IR test collections are so bad Mark Sanderson University of Sheffield.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets Selection part 2: Result sets  Examination.
Indexing LBSC 796/CMSC 828o Session 9, March 29, 2004 Doug Oard.
Search Engines Session 5 INST 301 Introduction to Information Science.
November 8, 2005NSF Expedition Workshop Supporting E-Discovery with Search Technology Douglas W. Oard College of Information Studies and Institute for.
Search Strategies Session 7 INST 301 Introduction to Information Science.
Cross-Language Information Retrieval Applied Natural Language Processing October 29, 2009 Douglas W. Oard.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Information Retrieval in Practice
Information Organization: Overview
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
What is Information Retrieval (IR)?
Search Engine Architecture
Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo
Thanks to Bill Arms, Marti Hearst
Usability Techniques Lecture 13.
Evidence from Behavior
CHAPTER 9 (part a) BASIC INFORMATION SYSTEMS CONCEPTS
Lecture 8 Information Retrieval Introduction
Search Engine Architecture
Information Organization: Overview
Information Retrieval and Web Design
Lab 2: Information Retrieval
Introduction to Search Engines
Presentation transcript:

Structure of IR Systems INST 734 Module 1 Doug Oard 1

Segments The nature of Information Retrieval (IR) What IR systems do The structure of interactive IR systems 2

Taylor’s Model of Question Formation Q1 Visceral Need End-user Search Q2 Conscious Need Intermediated Search Q3 Formalized Need Q4 Compromised Need (Query)

Iterative Search Searchers often don’t clearly understand The problem they are trying to solve What information is needed to solve the problem How to ask for that information The query results from a clarification process Dervin’s “sense making”: Need Gap Bridge 8

Design Strategies Foster human-machine synergy Divide-and-conquer Exploit complementary strengths Accommodate shared weaknesses Divide-and-conquer Divide task into stages with well-defined interfaces Continue dividing until problems are easily solved Co-design related components Iterative process of joint optimization 16

Human-Machine Synergy Machines are good at: Doing simple things accurately and quickly Scaling to larger collections in sublinear time People are better at: Accurately recognizing what they are looking for Evaluating intangibles such as “quality” Both are pretty bad at: Mapping consistently between words and concepts 17

Divide and Conquer Strategy: use encapsulation to limit complexity Approach: Define interfaces (input and output) for each component Define the functions performed by each component Build each component (in isolation) See how well each component works Then redefine interfaces to exploit strengths / cover weakness See how well it all works together Then refine the design to account for unanticipated interactions Result: a hierarchical decomposition Okay, I’ve presented a very complex field… how do we actually go about studying it? Example of a decomposition that doesn’t make sense: separating engine from transmission 19

Supporting the Search Process Predict Source Selection Nominate Choose Query Formulation IR System Source Reselection Search Query Query Reformulation and Relevance Feedback Selection Ranked List Examination Document Delivery Document

Supporting the Search Process Source Selection Query Formulation IR System Search Query Selection Ranked List Indexing Index Examination Document Acquisition Collection Delivery Document

Process/System Co-Design

Looking Ahead Modules 2: Indexing Module 3: Ranking Module 4: Interaction Module 5: Evaluation