A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci A P2P REcommender system based on Gossip Overlays.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Google News Personalization: Scalable Online Collaborative Filtering
Google News Personalization Scalable Online Collaborative Filtering
Detecting Malicious Flux Service Networks through Passive Analysis of Recursive DNS Traces Roberto Perdisci, Igino Corona, David Dagon, Wenke Lee ACSAC.
SocioPlug Polystyrene: Survivable Shape for Self-Organising Data François Taïani Joint work with: Hoel Kervadec (INSA Rennes) Simon Bouget (ENS Rennes)
20/10/2006ALPAGE1 Ordered slicing of very large scale overlay networks Mark Jelasity University of Bologna, Italy Anne-Marie Kermarrec INRIA Rennes/IRISA,
Building of P2P Overlay Networks via Voronoi and Gossip Ranieri Baraglia.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
VDR: Proactive element Conclusions VDR reaches 3.5% more nodes than VDR-R and 9% more nodes than our modified random walk routing strategy (RWR) VDR shows.
Gossip Algorithms and Implementing a Cluster/Grid Information service MsSys Course Amar Lior and Barak Amnon.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Modeling and Analysis of Random Walk Search Algorithms in P2P Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE, Rensselaer Polytechnic Institute.
Active Learning and Collaborative Filtering
Democratizing personalization Anne-Marie Kermarrec Joint work with A. Boutet, D. Frey, R. Guerraoui, A. Jégou, H. Ribeiro.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Small-world Overlay P2P Network
CSCE 715 Ankur Jain 11/16/2010. Introduction Design Goals Framework SDT Protocol Achievements of Goals Overhead of SDT Conclusion.
1 LINK STATE PROTOCOLS (contents) Disadvantages of the distance vector protocols Link state protocols Why is a link state protocol better?
1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
1 AINA 2006 Wien, April th 2006 DiVES: A DISTRIBUTED SUPPORT FOR NETWORKED VIRTUAL ENVIRONMENTS The IEEE 20th International Conference on Advanced.
Computing Trust in Social Networks
XtreemOS IP project is funded by the European Commission under contract IST-FP XtreemOS WP3.2 - T3.2.3 Scalable Directory Service Design State.
Peer-to-peer file-sharing over mobile ad hoc networks Gang Ding and Bharat Bhargava Department of Computer Sciences Purdue University Pervasive Computing.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
CS401 presentation1 Effective Replica Allocation in Ad Hoc Networks for Improving Data Accessibility Takahiro Hara Presented by Mingsheng Peng (Proc. IEEE.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Multicast Communication Multicast is the delivery of a message to a group of receivers simultaneously in a single transmission from the source – The source.
Item-based Collaborative Filtering Recommendation Algorithms
Models of Influence in Online Social Networks
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
P2P RECOMMENDER SYSTEMS: A (SMALL) SURVEY Giulio Rossetti.
1 Distributed Monitoring of Peer-to-Peer Systems By Serge Abiteboul, Bogdan Marinoiu Docflow meeting, Bordeaux.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Focused Matrix Factorization for Audience Selection in Display Advertising BHARGAV KANAGAL, AMR AHMED, SANDEEP PANDEY, VANJA JOSIFOVSKI, LLUIS GARCIA-PUEYO,
1 BitHoc: BitTorrent for wireless ad hoc networks Jointly with: Chadi Barakat Jayeoung Choi Anwar Al Hamra Thierry Turletti EPI PLANETE 28/02/2008 MAESTRO/PLANETE.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
PRIVACY PRESERVING SOCIAL NETWORKING THROUGH DECENTRALIZATION AUTHORS: L.A. CUTILLO, REFIK MOLVA, THORSTEN STRUFE INSTRUCTOR DR. MOHAMMAD ASHIQUR RAHMAN.
Prediction Assisted Single-copy Routing in Underwater Delay Tolerant Networks Zheng Guo, Bing Wang and Jun-Hong Cui Computer Science & Engineering Department,
Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
1 Privacy-Enhanced Collaborative Filtering Privacy-Enhanced Personalization workshop July 25, 2005, Edinburgh, Scotland Shlomo Berkovsky 1, Yaniv Eytani.
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
Google News Personalization Big Data reading group November 12, 2007 Presented by Babu Pillai.
Peer-to-Peer Systems: An Overview Hongyu Li. Outline  Introduction  Characteristics of P2P  Algorithms  P2P Applications  Conclusion.
Hybrid Content and Tag-based Profiles for recommendation in Collaborative Tagging Systems Latin American Web Conference IEEE Computer Society, 2008 Presenter:
Speaker : 童耀民 MA1G /3/21 1 Authors: Phone Lin and Pai-Chun Chung, National Taiwan University Yuguang Fang, University of Florida.
CMSC 691B Multi-Agent System A Scalable Architecture for Peer to Peer Agent by Naveen Srinivasan.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
1 “Hybrid Search Schemes for Unstructured Peer- to-Peer Networks” “Random Walks in Peer-to-Peer Networks” Christos Gkantsidis, Milena Mihail, Amin Saberi.
Talal H. Noor, Quan Z. Sheng, Lina Yao,
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
WSRec: A Collaborative Filtering Based Web Service Recommender System
Using Friendship Ties and Family Circles for Link Prediction
Providing Secure Storage on the Internet
DHT Routing Geometries and Chord
Collaborative Filtering Nearest Neighbor Approach
M.Sc. Project Doron Harlev Supervisor: Dr. Dana Ron
Effective Replica Allocation
Presentation transcript:

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci A P2P REcommender system based on Gossip Overlays (PREGO) Ranieri Baraglia, Patrizio Dazzi, Matteo Mordacchini ISTI,CNR, Pisa,Italy Laura Ricci University of Pisa,Italy 10 th IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY Bradford,UK, 29 June - 1 July, 2010

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci PRESENTATION OUTLINE Recommender Systems: general characteristics Our Approach: PREGO Gossip Protocols: basic issues PREGO – User profile definition – Similarity metrics – Architecture of the P2P overlays – Recommendations spreading Experimental Results Conclusions

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci RECOMMENDER SYSTEMS: CHARACTERISTICS Recommender systems (recommendation systems): – information filtering systems attempting to recommend information items that are likely to be of interest to the user – compare the user profile to some reference characteristics to predict the 'rating' that a user would give to an item he does not know yet Content Based Approaches – The user profile is compared with some characteristics of the item Collaborative Filtering (Social Information Filtering) ‏ – Exploits the social environment of the user – Automates the 'word-of-mouth' process: items are recommended to a user by other people with similar tastes

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci RECOMMENDER SYSTEMS: PREGO A P2P system exploiting the collaborative exchange of information between users in order to build a system able to – cluster users with similar interests – spread useful suggestions among them PREGO distinguishing features: – builds a user profile exploiting content accessed from the user (purchase items, visited pages,...). Based on the fact that a user will show interest in the future for content accessed in the past – exploits a set of gossip protocols to cluster users with similar interests – defines a distributed recommandation service no centralized authority storing users profiles and ratings: preserve user privacy no centralized/controlled recommandations: a 'democratic' suggestion system

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci GOSSIP PROTOCOLS: CENTRALIZED APPROACHES Gossip (epidemic) protocols: originally introduced in 1987 for propagating updates in loosely replicated databases to maintain replica consistency The original gossip protocol: each node – has a complete view of the nodes of the network – periodically picks a random node N from the view and exchange some data with N Main characteristics of the approach – information known by any one node of the network is spread to all other nodes with very high probability. – requires a complete knowledge of the network at each node and this is infeasible for large scale, highly dynamic networks, like P2P systems.

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci A P2P RANDOM SAMPLING SERVICE A P2P gossip protocol requires the definition of a distributed service that returns to each peer a random sample of the nodes A distributed Random Samplice Service implemented through a gossip approach: each peer P – knows a small, continuously changing set of other peers (its neighbours) defining its view – periodically chooses a random node Q from its view – P and Q exchange a set of neighbours randomly picked from their views. the continous mixing of neighbours generates random overlays The random sampling service is exploited by higher level gossiping protocols to epidemically spread information over the network.

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci A P2P RANDOM SAMPLING SERVICE w hen a peer P (ex: 2) choose a peer Q (ex: 9) to gossip, the neighbour relation between P and Q is reversed P sends its descriptor to Q, and deletes the descriptor of Q from its view

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci CYCLON: AN ENHANCED P2P SAMPLING SERVICE Cyclon [Voulgaris 2006]: A P2P random sampling service improving the quality of the overlay in terms of randomness View = set of node descriptors including – the contact information of a peer – a numeric age field: number of gossip cycles since the moment the descritor was created (descriptor age) ‏ Enhanced protocol: A peer P – select the neighbour Q with the highest age – sends its 'fresh descriptor' to Q, i.e. a descriptor of age 0 With respect to the basic gossip protocol, Cyclon – prevents links to dead nodes from lingering indefinitively – imposes a predictible lifetime to each descriptor this implies an even spreading of links across the overlay

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci OUR PROPOSAL Extract a profile for each peer according to the past interests of the user. The profile may be constructed by considering – resources recently accessed – items recently purchased – pages visited Exploit a gossip protocol to define clusters of peers characterized by similar interests – definition of a proper similarity metrics – exploits the distributed sampling service defined by Cyclon Spread recommendations of new acquired content to proper cluster of peers

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci PREGO: THE OVERLAYS A peer may be characterized by a set of interests An overlay for each interest (sport, music,films..) within a single overlay clusters correspond to highly similar peers, for instance: an overlay for peers interested in films different clusters for different film genres (action, westerns,...) ‏

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci THE DEFINITION OF THE USER PROFILE Let  be the set of items of the system and  p  the subset of items belonging to a peer p The profile  p is defined as follows:  p = { ( i, C( i ), R( i ) ) | i   p } C( i ) is the content associated with the item i R( i ) is the rating given by p to i The set of items  p can be partitioned according to the set of interests of the peer p – if the peer p has a set I p = {I p 1,..., I p k } of interests,  p (I p j ) is the set of items associated with the interest I p j Different overlays associated to different interests

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci THE DEFINTION OF A SIMILARITY METRICS Jaccard similarity. Given a pair of peer p 1 and p 2 and a pair of interests

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci THE GOSSIP PROTOCOLS Each interest based P2P overlay is constructed through a gossip protocol A two-layered gossip framework PREGO PEER SAMPLING The Sampling Layer is responsible for feeding the PREGO-layer with nodes uniformely randomly selected from the whole overlay The PREGO Layer in charge of discovering peers characterized by similar interests

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci GOSSIP PROTOCOL: THE ACTIVE THREAD

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci GOSSIP PROTOCOL: THE PASSIVE THREAD

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci SPREADING RECOMMENDATIONS when an overlay is stabilized, a peer can consider its neighbours as the representatives of a community of 'friends' with similar interests when a peer p of the community becomes acquainted with a new item h – p discovers the interest group IG corresponding to h – p forwards a recommendation for h on the overlay corresponding to IG – p exploits the similarity function to evaluate the interest of its neighbours for the item h recommendation is forwarded only iff the similarity between h and the neighbour is above a given threshold. – for instance, a new film is recommended to a neighbour iff the intersection among the genres of the film and the profile of the neighbour is above a given threshold.

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci EXPERIMENTAL ENVIRONMENT A prototype implemented by the Overlay Weaver framework – rapid prototyping – high scalability – experiments replication – possibility of monitoring the status of the network Experiments exploit the Movie-lens Data Set – A DataBase of movies used for user recommendations – Includes 1 millions ratings for 3900 movies by 6040 users – Each user rates a few movies a very sparse matrix of preferences a complex scenario to build communities – User profile: movies seen by the corresponding user

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci GOSSIP PROTOCOLS COMPARISON Vicinity, TwinFinder two similar gossip algorithm based on interest similarity Reports average similarity (Jaccard) between a peer and its neighbourhood

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci GOSSIP PROTOCOLS COMPARISON Twinfinder obtains the smallest deviation, i.e. a more uniform overlay with respect to similarity

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci COVERAGE EVALUATION Goal of the test: measuring the ability of gossip protocols to provide recommendations to interested communities Coverage: percentage of film genres which may be recommended to a user by its neighbours

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci COVERAGE EVALUATION Coverage measure for different network sizes Best result for networks including high number of nodes

A P2P REcommender system based on Gossip Overlays (PREGO) ‏ R.Baraglia, P.Dazzi M.Mordacchini, L.Ricci CONCLUSIONS Definition of a gossip algorithm for the clustering of peers characterized by similar interests Creation of communities characterized by similar interests Spreading of recommendations Future work: – definition of a distributed algorithm to detect a profile characterizing a community of peers distributed voting algorithms to detect the 'most representative peer' of a community – investigating the structured overlays (DHT) approach to register community profiles – LSH (Locality Sensitive Hashing) to support the search of community with profiles similar to that of a given user