1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.

Slides:



Advertisements
Similar presentations
Item Based Collaborative Filtering Recommendation Algorithms
Advertisements

Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
Exploring Latent Features for Memory- Based QoS Prediction in Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu
Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Collaborative filtering with ordinal scale-based implicit ratings for mobile music recommendations S.-K. Lee et al., KAIST,
The Wisdom of the Few A Collaborative Filtering Approach Based on Expert Opinions from the Web Xavier Amatriain Telefonica Research Nuria Oliver Telefonica.
Intro to RecSys and CCF Brian Ackerman 1. Roadmap Introduction to Recommender Systems & Collaborative Filtering Collaborative Competitive Filtering 2.
Memory-Based Recommender Systems : A Comparative Study Aaron John Mani Srinivasan Ramani CSCI 572 PROJECT RECOMPARATOR.
EigenTaste: A Constant Time Collaborative Filtering Algorithm Ken Goldberg Students: Theresa Roeder, Dhruv Gupta, Chris Perkins Industrial Engineering.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
1 Collaborative Filtering Rong Jin Department of Computer Science and Engineering Michigan State University.
Collaborative Filtering CMSC498K Survey Paper Presented by Hyoungtae Cho.
Algorithms for Efficient Collaborative Filtering Vreixo Formoso Fidel Cacheda Víctor Carneiro University of A Coruña (Spain)
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Performance of Recommender Algorithms on Top-N Recommendation Tasks
Performance of Recommender Algorithms on Top-N Recommendation Tasks RecSys 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering.
Liang Xiang, Quan Yuan, Shiwan Zhao, Li Chen, Xiatian Zhang, Qing Yang and Jimeng Sun Institute of Automation Chinese Academy of Sciences, IBM Research.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
Presented by Tienwei Tsai July, 2005
Wancai Zhang, Hailong Sun, Xudong Liu, Xiaohui Guo.
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
Clustering-based Collaborative filtering for web page recommendation CSCE 561 project Proposal Mohammad Amir Sharif
A Hybrid Recommender System: User Profiling from Keywords and Ratings Ana Stanescu, Swapnil Nagar, Doina Caragea 2013 IEEE/WIC/ACM International Conferences.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Toward the Next generation of Recommender systems
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Badrul M. Sarwar, George Karypis, Joseph A. Konstan, and John T. Riedl
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Improving Recommendation Lists Through Topic Diversification CaiNicolas Ziegler, Sean M. McNee,Joseph A. Konstan, Georg Lausen WWW '05 報告人 : 謝順宏 1.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Marin Silic, Goran Delac and Sinisa Srbljic Prediction of Atomic Web Services Reliability Based on K-means Clustering Consumer Computing Laboratory Faculty.
Collaborative Filtering Zaffar Ahmed
Pearson Correlation Coefficient 77B Recommender Systems.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Recommendation Algorithms for E-Commerce. Introduction Millions of products are sold over the web. Choosing among so many options is proving challenging.
A Clustering-based QoS Prediction Approach for Web Service Recommendation Shenzhen, China April 12, 2012 Jieming Zhu, Yu Kang, Zibin Zheng and Michael.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Experimental Study on Item-based P-Tree Collaborative Filtering for Netflix Prize.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Reputation-aware QoS Value Prediction of Web Services Weiwei Qiu, Zhejiang University Zibin Zheng, The Chinese University of HongKong Xinyu Wang, Zhejiang.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
The Wisdom of the Few Xavier Amatrian, Neal Lathis, Josep M. Pujol SIGIR’09 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Collaborative Filtering: Searching and Retrieving Web Information Together Huimin Lu December 2, 2004 INF 385D Fall 2004 Instructor: Don Turnbull.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Item-Based Collaborative Filtering Recommendation Algorithms
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
WSRec: A Collaborative Filtering Based Web Service Recommender System
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
Movie Recommendation System
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Collaborative Filtering Non-negative Matrix Factorization
Huifeng Sun 1, Zibin Zheng 2, Junliang Chen 1, Michael R. Lyu 2
Presentation transcript:

1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong Sun School of Computer Science and Engineering Beihang University Beijing, China

2 Outline Introduction Motivation RegionKNN Algorithm Experiments Conclusion and Future Work

3 1. Introduction

4 Introduction Current situation –More than 25,000 public available services (seekda.com) –About 200,000 related documents Goal of service recommendation –Optimal QoS –User preference Current method: Collaborative Filtering (CF) –predict and recommend the potential favorite items for a particular user by using rating data collected from similar users. If Alice and Bob both like X and Alice likes Y then Bob is more likely to like Y Problems –Characteristics of QoS are neglected –Online performance need to be improved

5 2. Motivation

6 A Motivating Scenario Some QoS properties (e.g. availability, response time) highly correlate to users’ physical locations. Filtering WS Filtering WS

7 3. RegionKNN Algorithm

8 What’s RegionKNN Hybrid CF Algorithm –recommend web services with optimal QoS to the active user with consideration of the region factor Two phases of RegionKNN –Region model building (offline) Region-sensitive services identification Region aggregation –Service recommendation (online) (modified KNN) Neighbor selection QoS Prediction I take response time/round trip time (RTT) as an example to describe our algorithm

9 3.1 Region model

10 Region Model Region –a group of users who are closely located with each other and have similar RTT values u5u5 u 19 u 2 u 22, u 8 u 1, u 3 Service A Service B Service X

11 Input Dataset User-Service RTT Matrix: m services, n users The set of non-zero RTTs of service s {R 1 (s), R 2 (s),…, R k (s)} collected from all users is a sample from population R. s1s1 s2s2 … smsm u1u … u2u …539 …………… unun … 498 RTT is much longer than others

12 Region-sensitive Services Identification To estimate the mean μ and the standard deviation σ of R, we use: Median: the numeric value separating the higher half of a sample from the lower half. e.g. {120, 128, 200, 250, 258, 2000, 3500} median = 250 MAD: the Median of the Absolute Deviations from the sample's median. e.g. {120, 128, 200, 250, 258, 2000, 3500}  {8, 50, 122, 130, 1750, 2250} MAD = 130

13 Region-Sensitive Service –Let R = {R 1 (s), R 2 (s),…, R k (s)} be the set of RTTs of service s provided by users from all regions. Service s is a sensitive service to region M iff {120, 128, 200, 250, 258, 2000, 3500} u 1 u 3 u 5 u 19 u 2 u 22 u 8 u5u5 u 19 u 2 u 22 u 8 u 1, u3 Service A Region-sensitive services Identification

14 Definition Region Sensitivity Sensitive Region –Region M is a sensitive region iff regSen >λ. Region center – the median vector of all the RTT vectors provided by users in a region

15 Region Aggregation Why? –Users only provide limited number of QoS values, the sparse dataset always leads to poor recommendation. How? –It treats users with similar IP addresses as a region at the outset – In each iteration, the two most similar and non- sensitive regions are selected and aggregated, if their similarity exceeds threshold μ. –It executes at most N-1 steps (N is the number of regions at the outset), in case that all regions are non- sensitive, extremely correlates to each other and finally aggregates into one region.

16 Region Similarity The similarity between region M and N is measured by the similarity of the two centers. Similarity by Pearson Correlation Coefficient (PCC) s1s1 s2s2 s3s3 s4s4 s5s5 cmcm cncn By PCC, the similarity is of the two regions is 1

17 Region Similarity PCC often overestimates the similarity when the two regions have few co-invoked services. To adjust it, we use: s1s1 s2s2 s3s3 s4s4 s5s5 cmcm cncn By adjustment, the similarity of the two regions is 0.2

Service Recommendation

19 Neighbor Selection Neighbors: users with similar QoS experiences Advantages of region-based neighbor selection –Do not need to search the entire dataset, thousands of users are clustered into a certain number of regions –The feature of the group of users in a region is represented by the region center

20 QoS Prediction To calculate the RTT prediction for the active user u and service s i Get the active user’s IP address and find the region the user belongs to. If no appropriate region is found, the active user will be treated as a member of a new region. Identify whether service s i is sensitive to the specific region. If it is region-sensitive, then the prediction is generated from the region center:

21 QoS Prediction (cont.) Otherwise, use adjusted PCC to compute the similarity between the active user and each region center that has evaluated service s i, and find up to k most similar centers {c 1, c 2,…, c k }. If the active user’s region center has the RTT value of s i, i.e., the prediction is computed using the equation:

22 QoS Prediction (cont.) Otherwise, Previous CF-based web service recommendation algorithms use the following equation, to predict the missing QoS value. This equation is based on the assumption that each user’s rating range is subjective and comparatively fixed, while it is not applicable in our context.

23 Time complexity Model building (offline) –The time complexity of region aggregation algorithm is O(N 2 logN), and N is the number of regions at the outset. QoS prediction (online) – Let l be the number of regions, m the number of web services, and n the number of users. In the online part, O(l) similarity weight calculations are needed, each of which takes O(m) time. Therefore, the online time- complexity is O(lm)≈O(m). Previous user-based CF algorithm has O(mn) online time complexity.

24 4. Experiments

25 Experiments Dataset –a subset of WSRec with 300,000 RTT records –3000 users –100 services Evaluation Metric – R u (s) denotes the actual RTT of web service s given by user u – denotes the predicted one – L denotes the number of tested services Dataset:

26 MAE Performance

27 Impact of λ and μ

28 Impact of neighborhood size K

29 Impact of Data Sparsity

30 5.Conclustion and Future Work

31 Conclusion and Future Work Conclusion –a new region model for clustering users and identifying region-sensitive web services –a hybrid model-based and memory-based CF algorithm for web service recommendation, which significantly improves the recommendation accuracy –We demonstrate RegionKNN’s scalability advantage over traditional CF algorithms via time-complexity analysis Future Work –Investigation of more QoS properties and their variation with time –Internal relations between QoS properties

32