1 Personalized IR Reloaded Xuehua Shen

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Relevance Feedback User tells system whether returned/disseminated documents are relevant to query/information need or not Feedback: usually positive sometimes.
Chapter 5: Introduction to Information Retrieval
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Data Mining Feature Selection. Data reduction: Obtain a reduced representation of the data set that is much smaller in volume but yet produces the same.
Supervised Learning Techniques over Twitter Data Kleisarchaki Sofia.
Clustering approaches for high- throughput data Sushmita Roy BMI/CS 576 Nov 12 th, 2013.
Search Engines and Information Retrieval
Learning to Rank: New Techniques and Applications Martin Szummer Microsoft Research Cambridge, UK.
K nearest neighbor and Rocchio algorithm
Carnegie Mellon 1 Maximum Likelihood Estimation for Information Thresholding Yi Zhang & Jamie Callan Carnegie Mellon University
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
Learning Techniques for Information Retrieval Perceptron algorithm Least mean.
Information Retrieval Ch Information retrieval Goal: Finding documents Search engines on the world wide web IR system characters Document collection.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
Presented by Zeehasham Rasheed
Exploration & Exploitation in Adaptive Filtering Based on Bayesian Active Learning Yi Zhang, Jamie Callan Carnegie Mellon Univ. Wei Xu NEC Lab America.
An investigation of query expansion terms Gheorghe Muresan Rutgers University, School of Communication, Information and Library Science 4 Huntington St.,
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Information Retrieval in Practice
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Personalized Web Search by Mapping User Queries to Categories Fang Liu Presented by Jing Zhang CS491CXZ February 26, 2004.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Clustering Personalized Web Search Results Xuehua Shen and Hong Cheng.
22 nd January 2004 UITV 2004 NewsBoy: an interactive news retrieval system Joemon M Jose The Information Retrieval Group Department of Computing Science.
Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Personalized Search Xiao Liu
UCAIR Project Xuehua Shen, Bin Tan, ChengXiang Zhai
1 Large-Scale Information Filtering Systems Fatma Ozcan May 9, 2000 University of Maryland, College Park.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
1 01/10/09 1 INFILE CEA LIST ELDA Univ. Lille 3 - Geriico Overview of the INFILE track at CLEF 2009 multilingual INformation FILtering Evaluation.
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
Carnegie Mellon Novelty and Redundancy Detection in Adaptive Filtering Yi Zhang, Jamie Callan, Thomas Minka Carnegie Mellon University {yiz, callan,
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
Active Feedback in Ad Hoc IR Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
An Adaptive User Profile for Filtering News Based on a User Interest Hierarchy Sarabdeep Singh, Michael Shepherd, Jack Duffy and Carolyn Watters Web Information.
Context-Sensitive IR using Implicit Feedback Xuehua Shen, Bin Tan, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Relevant Document Distribution Estimation Method for Resource Selection Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
PREDICTING STOCK MARKET MOVEMENT USING SENTIMENTS For EECSE 6898-From Data to Solutions class Presented by-Tulika Bhatt(tb2658)
1 INFILE - INformation FILtering Evaluation Evaluation of adaptive filtering systems for business intelligence and technology watch Towards real use conditions.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Sampath Jayarathna Cal Poly Pomona
Information Organization: Overview
MMS Software Deliverables: Year 1
Author: Kazunari Sugiyama, etc. (WWW2004)
Document Visualization at UMBC
Paraskevi Raftopoulou, Euripides G.M. Petrakis
موضوع پروژه : بازیابی اطلاعات Information Retrieval
Ying Dai Faculty of software and information science,
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Lecture 4. Niching and Speciation (1)
Semi-Automatic Data-Driven Ontology Construction System
Information Organization: Overview
Presentation transcript:

1 Personalized IR Reloaded Xuehua Shen

2 Self-Adaptive User Profiles for Large Scale Data Delivery U. Cetintemel, M. J. Franklin, C. L. Giles UMCP UCB PSB ICDE2000 A DB conference paper that uses IR techniques

3 Usage of User Profile Personalized Search Information Filtering Push-based Data Delivery

4 Focus of the Presentation Representation, Construction and Maintenance of User Profile Experiment Design

5 Representation of User Profile Multi-Modal Approach,  multiple vectors (clusters) for multiple user interests  correlated vectors

6 Construction of User Profile Given a new judged document Select the most similar vector Decide whether to incorporate doc or create a new vector  If incorporation, do linear update

7 Maintenance of User Profile Merge For the updated vector, if similar vector exists, Do Merge Deletion (Interest Shift) Strength of vector If strength is below deletion threshold, Delete this vector

8 Design of Experiment No Real User: User Simulation No TREC evaluation: Yahoo! Category Training and then Testing

9 Set of Experiments Performance Comparison with existing algorithm RI and RG RI and RG single vector representation

10 Several Parameters of Experiment User Interest (User Profile) Complexity Threshold Change of User Interest  Interest Shifting  Interest Addition  Interest Deletion

11 Results

12 Threshold

13 Interest Shift

14 Summary Multiple vectors for multiple user interests Automatic and Adaptive maintenance (Insertion, Merge, Deletion)

15 Discussion Parameters [Yi Zhang ICML2003] Other Representation of User Profile? Explicit Feedback