Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Natural Language Processing WEB SEARCH ENGINES August, 2002.
Introduction to Information Retrieval (Part 2) By Evren Ermis.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Search Engines and Information Retrieval
Information Retrieval Review
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Information Access Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies Design Understanding.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Basic IR Concepts & Techniques ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
University of Kansas Data Discovery on the Information Highway Susan Gauch University of Kansas.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Web Data Mining and Applications Part I
Overview of Search Engines
AdWords Instructor: Dawn Rauscher. Quality Score in Action 0a2PVhPQhttp:// 0a2PVhPQ.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
© 2007 Tom Beckman Features:  Are autonomous software entities that act as a user’s assistant to perform discrete tasks, simplifying or completely automating.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Technovation Incorporating Feedback Week 4. Check-in: paper prototype By now, your paper prototype should be complete, so that you can begin creating.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Search Engine Architecture
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
B. Trousse, R. Kanawati - JTE : Advanced Services on the Web, Paris 7 may 1999 Broadway: a recommendation computation approach based on user behaviour.
WIRED Future Quick review of Everything What I do when searching, seeking and retrieving Questions? Projects and Courses in the Fall Course Evaluation.
User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d)
KMS & Collaborative Filtering Why CF in KMS? CF is the first type of application to leverage tacit knowledge People-centric view of data Preferences matter.
IST 497G Ron Grzywacz November 2002 Personalization of Information Retrieval.
UOS Personalized Search Zhang Tao 장도. Zhang Tao Data Mining Contents Overview 1 The Outride Approach 2 The outride Personalized Search System 3 Testing.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Autonomous Interface Agents Henry Lieberman Media Laboratory, MIT Presented by Sumit Taank Vishal Mishra.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Information Retrieval in Practice
Recommender Systems & Collaborative Filtering
Search Engine Architecture
Search Engine Architecture
Augmenting (personal) IR
موضوع پروژه : بازیابی اطلاعات Information Retrieval
Lecture 8 Information Retrieval Introduction
Search Engine Architecture
Presentation transcript:

Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time

Relevance Feedback in IR Already in most systems - Improved query formulations - System evaluation of system Works from natural characteristics in documents - More interesting to work from the NC of people - Personal Relevance Feedback If you don’t know what the document set is, how do you reformulate a query? - Browse by query, then search - (Bibliometric) chaining

Old School Query Reformulation Identify core terms in document database Deemphasize (not index?) less core terms “We know what’s good for you” - Small set of documents - Accurate knowledge of users Small steps, building to quality documents Weights of queries are shifted - Preferred terms - Partial weights 0 - 1

It’s all about vectors Remember VectorSpace? Documents Queries How similar is the query to a document? - Averages and weights give final set - Length and location

Probability Relevance Feedback Document-based not term based Ranking documents be their content - And tweaked weights Depends on variety of documents in database - Wider variance = harder to predict - More processing power can help - Means to average and normalize values Ad hoc adjustment, relative weighting - Using the found documents as additional queries How do you evaluate RFS as doc db changes? - Previous retrieval is key, but not with changes - Adding common terms may help (in general)

IR & Filtering Are they the same? - Is a filter a proactive search? - Does filtering lead to better browsing, which leads to less need for searching? Good for lots of changing text (Web) Active use What about push media with filters? - RSS -

What do we mean by augment? Douglas Englebart’s system - GUI - Interaction - Connectivity - Management Improve upon Extend user capabilities Do what you want, but faster “Do what I mean, not what I say” What are some ways to augment?

What is Personalization? In computing? - Optimized - System specific In interfaces? - Modes of interaction - Appropriate for user level For IR? - Results - Time - Mode - (Relevance) Feedback

Personalized IR system design How would you design a personal IR system? Who would use it? How would you learn about them? - Interests - Sources - Preferences How do you evaluate a personal system? Understanding users is the key to personalizing search or search interfaces.

Letizia Interleaving browsing with (automated) search Augmented browsing = less searching? Understanding your usage preferences - “Behavior based” - Letizia explores for you “doing concurrent, autonomous exploration of links from the user’s current position” p1 - PageRank for individuals? - PageRank for the exact situation? - Smart crawling based on a profile?

Letizia’s Inferences What you do tells the systems your interests and habits List of keywords about your interests Persistence of interest issues - Shifts - Time to restate interest Automated queries, keyword matches Doesn’t get in the way (much) What about the interface? Making Web search better?

Siteseer “Personalized navigation for the Web” Isn’t this a CF system? Bookmarks are key indicators of interest Category fits Implicit recommendations

How to personalize the Web: WBI Interests are bookmarks or home pages - Links - Text Proxy-like between the user the Web Agent like functions - Monitor - records features - Editor - tweaks retrieved information - Generator - request to response - Autonomous agent - triggers

Outride Data mining for personalized search Fast model fitting for profiles Search keyword augmentation - Interests - Preferences “Contextual Computing” - Just in time information - Situational More than content analysis “Author relevancy”

Personalized Search Efficiency Contextualization - Activity - Availability Individualization - User goals (models of Iseek) - (Past) behavior Interface Awareness and customization

Personalization vs. Customization What’s the difference? - For a system, for a user - Interaction methods, selection methods My.yahoo.com vs. amazon.com AskJeeves vs. a Reference Librarian

WIRED System Evaluations Install IR software Set up documents for indexing - What types of documents - Sizes, formats, time to index? Perform some searches - Note search functionality - Describe (screen shot?) interface for search Examine results - Describe (screen shot?) results page/screen - Rotate, use subset of documents - Note differences in queries What model, index, system do you think the system uses (based on class discussions & readings)?

System Evaluation Questions What do these systems seem to offer? How would you use them? How would a group use them? Can it affect the way you search? - The way you work? - The way you store/organize information? What’s different than you expected? - Better or Worse? - From your deign ideas?