Information Retrieval Techniques Israr Hanif M.Phil QAU Islamabad Ph D (In progress) COMSATS.

Slides:



Advertisements
Similar presentations
Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 1: Introduction Alexander Gelbukh
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Chapter 5: Introduction to Information Retrieval
INTRODUCTION TO MODELING
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
LBSC 796/INFM 718R: Week 1 Introduction to Information Retrieval Jimmy Lin College of Information Studies University of Maryland Monday, January 30, 2006.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Ranked Retrieval INST 734 Module 3 Doug Oard. Agenda  Ranked retrieval Similarity-based ranking Probability-based ranking.
Kalervo Järvelin – Issues in Context Modeling – ESF IRiX Workshop - Glasgow THEORETICAL ISSUES IN CONTEXT MODELLING Kalervo Järvelin
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
CAP 252 Lecture Topic: Requirement Analysis Class Exercise: Use Cases.
Information Retrieval February 24, 2004
Information Retrieval in Practice
INFORMATION RETRIEVAL WEEK 1 AND 2
Information Retrieval: Human-Computer Interfaces and Information Access Process.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.
Advance Information Retrieval Topics Hassan Bashiri.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
Chapter 8: Introduction to High-level Language Programming Invitation to Computer Science, C++ Version, Third Edition.
CSC230 Software Design (Engineering)
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 16 Slide 1 User interface design.
Search Engines and Information Retrieval Chapter 1.
The Cognitive Perspective in Information Science Research Anthony Hughes Kristina Spurgin.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
Modern Information Retrieval Computer engineering department Fall 2005.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2006.
Problem Solving Techniques. Compiler n Is a computer program whose purpose is to take a description of a desired program coded in a programming language.
Search Engine Architecture
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
Structure of IR Systems INST 734 Module 1 Doug Oard.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Structure of IR Systems INST 734 Module 1 Doug Oard.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.
Jane Reid, AMSc IRIC, QMUL, 30/10/01 1 Information seeking Information-seeking models Search strategies Search tactics.
Structure of IR Systems LBSC 796/INFM 718R Session 1, September 10, 2007 Doug Oard.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Jhu-hlt-2004 © n.j. belkin 1 Information Retrieval: A Quick Overview Nicholas J. Belkin
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
Structure of IR Systems LBSC 796/INFM 718R Session 1, January 26, 2011 Doug Oard.
1 13/05/07 1/20 LIST – DTSI – Interfaces, Cognitics and Virtual Reality Unit The INFILE project: a crosslingual filtering systems evaluation campaign Romaric.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Relevance Feedback Prof. Marti Hearst SIMS 202, Lecture 24.
INFORMATION STROAGE AND RETRIEVAL SYSTEM By Ms. Preeti Patel Lecturer School of Library And Information Science DAVV, Indore
Seminar on Information Retrieval 정보검색론특강
Definition, purposes/functions, elements of IR systems Lesson 1.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
Information Retrieval in Practice
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Information Retrieval (in Practice)
What is Information Retrieval (IR)?
Search Engine Architecture
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Introduction into Knowledge and information
Introduction to Information Retrieval
Lecture 8 Information Retrieval Introduction
Search Engine Architecture
Structure of IR Systems
Presentation transcript:

Information Retrieval Techniques Israr Hanif M.Phil QAU Islamabad Ph D (In progress) COMSATS

Information Retrieval Techniques MS(CS) Lecture 1 AIR UNIVERSITY MULTAN CAMPUS

Information Retrieval Systems Information – What is “information”? Retrieval – What do we mean by “retrieval”? – What are different types information needs? Systems – How do computer systems fit into the human information seeking process?

Dictionary says… Oxford English Dictionary – information: informing, telling; thing told, knowledge, items of knowledge, news – knowledge: knowing familiarity gained by experience; person’s range of information; a theoretical or practical understanding of; the sum of what is known Random House Dictionary – information: knowledge communicated or received concerning a particular fact or circumstance; news

Intuitive Notions Information must – Be something, although the exact nature (substance, energy, or abstract concept) is not clear; – Be “new”: repetition of previously received messages is not informative – Be “true”: false or counterfactual information is “mis-information” – Be “about” something Robert M. Losee. (1997) A Discipline Independent Definition of Information. Journal of the American Society for Information Science, 48(3),

Information Hierarchy Data InformationKnowledge Wisdom More refined and abstract

Information Hierarchy Data – The raw material of information Information – Data organized and presented in a particular manner Knowledge – “Justified true belief” – Information that can be acted upon Wisdom – Distilled and integrated knowledge – Demonstrative of high-level “understanding”

A (Facetious) Example Data – 98.6º F, 99.5º F, 100.3º F, 101º F, … Information – Hourly body temperature: 98.6º F, 99.5º F, 100.3º F, 101º F, … Knowledge – If you have a temperature above 100º F, you most likely have a fever Wisdom – If you don’t feel well, go see a doctor

What types of information? Text (Documents and portions thereof) XML and structured documents Images Audio (sound effects, songs, etc.) Video Source code Applications/Web services

“Retrieval?” “Fetch something” that’s been stored Recover a stored state of knowledge Search through stored messages to find some messages relevant to the task at hand SenderRecipient EncodingDecoding storage message noise indexing/writingRetrieval/reading

What is IR? Information retrieval is a problem-oriented discipline, concerned with the problem of the effective and efficient transfer of desired information between human generator and human user Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers). Anomalous States of Knowledge as a Basis for Information Retrieval. (1980) Nicholas J. Belkin. Canadian Journal of Information Science, 5,

What is Information Retrieval ? The process of actively seeking out information relevant to a topic of interest (van Rijsbergen) – Typically it refers to the automatic (rather than manual) retrieval of documents Information Retrieval System (IRS) – “Document” is the generic term for an information holder (book, chapter, article, webpage, etc)

Hopkins IR Workshop 2005Copyright © Victor Lavrenko What is Information Retrieval? Most people equate IR with web-search – highly visible, commercially successful endeavors – leverage 3+ decades of academic research IR: finding any kind of relevant information – web-pages, news events, answers, images, … – “relevance” is a key notion (details in Part II)

14

15

The formalized IR process Collection of documents Real world Document representations Query Information need Anomalous state of knowledge Matching Results

What do we want from an IRS ? Systemic approach – Goal (for a known information need): Return as many relevant documents as possible and as few non-relevant documents as possible Cognitive approach – Goal (in an interactive information-seeking environment, with a given IRS): Support the user’s exploration of the problem domain and the task completion.

The role of an IR system – a modern view – Support the user in – exploring a problem domain, understanding its terminology, concepts and structure – clarifying, refining and formulating an information need – finding documents that match the info need description As many relevant docs as possible As few non-relevant documents as possible

How does it do this ? User interfaces and visualization tools for – exploring a collection of documents – exploring search results Query expansion based on – Thesauri – Lexical/statistic analysis of text / context and concept formation – Relevance feedback Indexing and matching model

How well does it do this ? Evaluation – Of the components Indexing / matching algorithms – Of the exploratory process overall Usability issues Usefulness to task User satisfaction

Role of the user interface in IR Problem definition Source selection Problem articulation Examination of results Extraction of information Integration with overall task INPUT OUTPUT Engine

The Big Picture The four components of the information retrieval environment: – User – Process – System – Collection What computer geeks care about! What we care about!

The Information Retrieval Cycle Source Selection Search Quer y Selection Ranked List Examination Documents Delivery Documents Query Formulation Resource query reformulation, vocabulary learning, relevance feedback source reselection

Supporting the Search Process Source Selection Search Quer y Selection Ranked List Examination Documents Delivery Documents Query Formulation Resource Indexing Index Acquisition Collection

Simplification? Source Selection Search Quer y Selection Ranked List Examination Documents Delivery Documents Query Formulation Resource query reformulation, vocabulary learning, relevance feedback source reselection Is this itself a vast simplification?

The IR Black Box Documents Query Hits

Inside The IR Black Box Documents Query Hits Representation Function Representation Function Query RepresentationDocument Representation Comparison Function Index