COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Recommender Systems & Collaborative Filtering
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Chapter 5: Introduction to Information Retrieval
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
Chapter 5: Information Retrieval and Web Search
CS583 – Data Mining and Text Mining Course Web Page 07/cs583.html.
Mining and Searching Opinions in User-Generated Contents Bing Liu Department of Computer Science University of Illinois at Chicago.
A Holistic Lexicon-Based Approach to Opinion Mining
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Chapter 11. Opinion Mining. Bing Liu, UIC ACL-07 2 Introduction – facts and opinions Two main types of information on the Web.  Facts and Opinions Current.
CES 514 – Data Mining Spring 2010 Sonoma State University.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
A Holistic Lexicon-Based Approach to Opinion Mining Xiaowen Ding, Bing Liu and Philip Yu Department of Computer Science University of Illinois at Chicago.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 06/cs583.html.
Chapter 6: Information Retrieval and Web Search
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
McCormick Northwestern Engineering 1 Electrical Engineering & Computer Science Mining Millions of Reviews: A Technique to Rank Products Based on Importance.
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
User Modeling and Recommender Systems: recommendation algorithms
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
COMP423 Summary Information retrieval and Web search  Vecter space model  Tf-idf  Cosine similarity  Evaluation: precision, recall  PageRank 1.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Item-Based Collaborative Filtering Recommendation Algorithms
Data Mining and Text Mining. The Standard Data Mining process.
Overview Issues in Mobile Databases – Data management – Transaction management Mobile Databases and Information Retrieval.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
CF Recommenders.
Data mining (KDD) process
Web Mining Ref:
Methods and Metrics for Cold-Start Recommendations
Machine Learning With Python Sreejith.S Jaganadh.G.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Author: Kazunari Sugiyama, etc. (WWW2004)
CS583 – Data Mining and Text Mining
Multimedia Information Retrieval
Movie Recommendation System
CSE 635 Multimedia Information Retrieval
Web Mining Department of Computer Science and Engg.
Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)
Chapter 5: Information Retrieval and Web Search
Recommendation Systems
Presentation transcript:

COMP423 Intelligent Agents

Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set of items in the past – Content based filtering (e.g SmartMuseum) Based on how well the contend of the target item matches the user’s preferred content pattern, which is learnt from the user’s own past ratings and the content pattern of the rated items. – Hybrid

User-based Collaborative Filtering Nearest Neighbor Collaborative Filtering – Calculate user similarities Pearson’s correlation – Define the effective neighborhood – Compute the predicted ratings The correlation of two users ken and lee, they both rated n items K(1..n) L (1..n) Prediction on Ken’s rating for m

Item-based Collaborative filtering The same Item rating Matrix Item vectors: the columns Item similarity – Pearson’s Correlation – Cosine similarity – Adjusted Cosine similarity

typical Collaborative Filtering Memory based collaborative filtering – Nearest-neighbor based – User similarity – Item similarity Clustering for collaborative filtering – Kmeans – HAC – Naïve Bayes clustering – Group oriented, less personalized, can be addressed by reducing cluster size

Content based filtering Content – Features: Movie: directors, actor/actress, producers., editors, distributors, editors, keywords, review, …. Text recommendation: a set of extracted keywords Classification problem

Hybrid Collaborative filtering: – Require other users rating data (cold start problem) – Can do cross domain – Non-transitive association problem: users are linked by common items and items are linked by common users. Content Based – Require one user’s rating data – Require item’s content data – Not cross domain Sequential Hybridization Combinational Hybridization

Evaluation Binary: change rates to positive or negative – Precision – Top N precision – Recall – F-measure – MAP: consider ranking, precision, recall Mean of the Average Precision for all queries Average Precision: the mean of the precision when each relevant document is retrieved. (M is the No of relevant documents) Average precision is roughly the area under the precision and recall curve

Evaluation Consider ranking score MAE: mean absolute error

projects New data, new task – Online dating recommendation – Using community data: what is popular in my peers Combine two or more areas – Personalised Search and recommendation Building profile from click through data Query expansion based on profile – Knowledge based reasoning: model of “what I need” – Decision support systems: heuristics, personality New constrains – Consider time in building user profile – Multiple profiles Children book Chinese novel

A PhD thesis Completed in 2012 User similarities – Do not consider the relevance of items – Aware of item similarity User rating and prediction – Number between 1-5 – Probability: (0.2, 0.3, 0.6, 0.1, 0.1) Hybrid – Sequential – Diamond shape

Research projects Recommender systems combined with personalized search – Building profile from click through data – Query expansion based on profile Two way recommendation – Online dating systems Knowledge-based, Personalized recommendation

Projects Image recommendation Image search without a name Computational advertising Honors project – Recommender for courses

Opinion mining Document level Sentence level Feature level

Bing Liu, UIC ACL Feature-based Summary (Hu and Liu, KDD-04) GREAT Camera., Jun 3, 2004 Reviewer: jprice174 from Atlanta, Ga. I did a lot of research last year before I bought this camera... It kinda hurt to leave behind my beloved nikon 35mm SLR, but I was going to Italy, and I needed something smaller, and digital. The pictures coming out of this camera are amazing. The 'auto' feature takes great pictures most of the time. And with digital, you're not wasting film if the picture doesn't come out. … …. Feature Based Summary : Feature1: picture Positive: 12 The pictures coming out of this camera are amazing. Overall this is a good camera with a really good picture clarity. … Negative: 2 The pictures come out hazy if your hands shake even for a moment during the entire process of taking a picture. Focusing on a display rack about 20 feet away in a brightly lit room during day time, pictures produced by this camera were blurry and in a shade of orange. Feature2: battery life …

Bing Liu, UIC ACL Visual summarization & comparison Summary of reviews of Digital camera 1 PictureBatterySizeWeightZoom + _ Comparison of reviews of Digital camera 1 Digital camera 2 _ +

Opinion mining and sentiment analysis Classification Extraction Summarization Supervised, unsupervised Corpus based, dictionary based

Opinion mining Opinion holder, object and opinions(P, N) Comparative relations – A is cheaper than B Temporal opinion mining and summarization

Projects Web of things Hardware and software Cross domain learning Personalized search learning large Knowledge base Cross checking with Cyc, wordnet Privacy

Web data mining Web Content mining Web structure mining Web usage mining

Two projects on security Intrusion detection by clustering Web log files – New similarity measure Malicious Web pages Automatic detection