Introduction Information Management systems are designed to retrieve information efficiently. Such systems typically provide an interface in which users.

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval Introduction to Information Retrieval Lecture 7: Scoring and results assembly.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Designing Scoring Rubrics. What is a Rubric? Guidelines by which a product is judged Guidelines by which a product is judged Explain the standards for.
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
Evaluating Search Engine
Deanery of Business & Computer Sciences Research Methods Week 1 Collecting, Processing and Analyzing Data.
1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
Case-based Reasoning System (CBR)
Computer comunication B Information retrieval. Information retrieval: introduction 1 This topic addresses the question on how it is possible to find relevant.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
The Vector Space Model …and applications in Information Retrieval.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
1 MARG-DARSHAK: A Scrapbook on Web Search engines allow the users to enter keywords relating to a topic and retrieve information about internet sites (URLs)
Recommender systems Ram Akella November 26 th 2008.
IR Models: Review Vector Model and Probabilistic.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Overview of Search Engines
Indexing Overview Approaches to indexing Automatic indexing Information extraction.
LBTO IssueTrak User’s Manual Norm Cushing version 1.3 August 8th, 2007.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
Minimal Test Collections for Retrieval Evaluation B. Carterette, J. Allan, R. Sitaraman University of Massachusetts Amherst SIGIR2006.
Samhaa R. El-Beltagy, Wendy Hall, David De Roure, and Leslie Carr Intelligence, Agents, Multimedia Department of Electronics and Computer Science University.
Word Weighting based on User’s Browsing History Yutaka Matsuo National Institute of Advanced Industrial Science and Technology (JPN) Presenter: Junichiro.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Information Retrieval Models - 1 Boolean. Introduction IR systems usually adopt index terms to process queries Index terms:  A keyword or group of selected.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
University of Malta CSA3080: Lecture 4 © Chris Staff 1 of 14 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Intro to Critiquing Research Your tutorial task is for you to critique several articles so that you develop skills for your Assignment.
21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Clustering C.Watters CS6403.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
The folksonomy tag cloud: when is it useful? James Sinclair and Michael Cardew-Hall Department of Engineering, The Australian National University, Canberra,
Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval
A Knowledge-Based Search Engine Powered by Wikipedia David Milne, Ian H. Witten, David M. Nichols (CIKM 2007)
Advantages of Query Biased Summaries in Information Retrieval by A. Tombros and M. Sanderson Presenters: Omer Erdil Albayrak Bilge Koroglu.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
Citation-Based Retrieval for Scholarly Publications 指導教授:郭建明 學生:蘇文正 M
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Designing Scoring Rubrics
Evaluation of IR Systems
Content analysis, thematic analysis and grounded theory
Augmenting (personal) IR
Multimedia Information Retrieval
Information Retrieval
The Starting Point: Asking Questions
Data Mining Chapter 6 Search Engines
Web Mining Department of Computer Science and Engg.
Magnet & /facet Zheng Liang
Chapter 11 user support.
Navigation-Aided Retrieval
Information Retrieval and Web Design
Connecting the Dots Between News Article
Presentation transcript:

Introduction Information Management systems are designed to retrieve information efficiently. Such systems typically provide an interface in which users formulate a search query to get a list of possible results. If users do not find what they are looking for, they must carefully refine and resubmit their query, and review the results again. Human memory tends to retrieve information in an associative way [1]. Our hypothesis is that storing an information set in a similar way— by using an associative semantic network to represent data as nodes and the relations between them as links [2]— will improve the effectiveness of information browsing behaviours through associative retrieval. Direct search is fast when appropriate keywords are used, but it can fail when users do not have a clear idea of their target. With associative networks, users begin as before with a direct search, but can then pick one of the result nodes and browse from there to other related nodes. By following links in the “semantic direction” of the target, users can “home in” on what they seek. Browsing can also be useful when searching for topics only vaguely defined in the user’s mind, or for getting a general idea of the thematic content of an information space. Method We examined whether similarity networks can aid retrieval tasks in large document corpora. We built an associative similarity network from two corpora of 2000 documents each, based on document keywords extracted with the tfidf algorithm. Each network node represents a single document, and is connected to other similar nodes with links weighted to reflect the degree of similarity. A common set of browsing and searching tasks were presented to 24 subjects, requesting them to retrieve documents which fit certain requirements. The interface used for the experiment is presented in Figure 1. For each task, half of the subjects used a standard search engine only, while the other half used the search engine enhanced by a browsing option that showed them the nearest neighbours in the network ranked by similarity. Quantitative and qualitative performance measures were taken to compare the two conditions. Analysis Performance in the tasks was evaluated using two metrics: 1.Correct answers to tasks were determined by two referees who reviewed user selections to judge whether each selection answered the task correctly. 2.Each selection was given an automatic score based on the frequency with which it was selected as an answer. There were 3 types of task: 1. Direct search tasks in which the target is well defined, so that it is easy to find using a conventional search engine. 2. Browsing tasks in which the target is not well defined, so that it is difficult find using a conventional search engine. 3. Topic tasks which ask the user to find all topics related to a particular theme discussed in the document corpus. Conclusions The results indicate that user performance improved when they were offered associative browsing in addition to conventional search. Improved performance was indicated by both performance measures, with significantly better results for browsing-type tasks. Contrary to our initial expectations, users did not tend to use the browse button to “home in” on targets. Instead, users used the browse button to find similar documents to one already found. Users typically found the first answer using a conventional search, and then used the browsing option to look for similar results. We conclude that semantic similarity networks can be used to improve user browsing behaviour, particularly in tasks for which the target is not well defined. More specifically, they were used to search exhaustively for related information given a known target. Improved Browsing with Semantic Networks Joel Lanir, Mike Huggett, Holger Hoos, Ronald Rensink Department of Computer Science, University of British Columbia Fig. 1. Experimental interface. Users were presented with a task (at the bottom of the screen) and were required to find the articles best suited to the task. References [1] Anderson, J. R. (1983) A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior 22(3):261–295. [2] Crestani, F. (1997) Applications of spreading activation techniques in Information Retrieval, Artificial Intelligence Review 11(6): calculated score results of search and browse interfaces example: Find articles that discuss reporters (e.g. Judith Miller) going to jail for not revealing their source. Subjects used the browse interface button in 85.8% of assigned browsing tasks. 80.0% of browse-button usage involved documents already selected as answers for a task. example: Find articles criticizing U.S. policies in the Iraqi war. example: Determine the most common topics related to technology. Results Acknowledgements We would like to thank Precarn for the financial support.