Preferential Publish/Subscribe Marina Drosou Department of Computer Science University of Ioannina, Greece Joint work with Evaggelia Pitoura and Kostas.

Slides:



Advertisements
Similar presentations
Google News Personalization: Scalable Online Collaborative Filtering
Advertisements

Constraint Based Reasoning over Mutex Relations in Graphplan Algorithm Pavel Surynek Charles University, Prague Czech Republic.
Fast Algorithms For Hierarchical Range Histogram Constructions
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Preference-Aware Publish/Subscribe Delivery with Diversity Marina Drosou Department of Computer Science University of Ioannina, Greece Joint work with.
How-To & Search Strategies April 16 th, Contents Using Grant Forward Search Results Filtering your Search – Keywords – Categories – Sponsors – Deadlines.
DSPIN: Detecting Automatically Spun Content on the Web Qing Zhang, David Y. Wang, Geoffrey M. Voelker University of California, San Diego 1.
CS345 Data Mining Page Rank Variants. Review Page Rank  Web graph encoded by matrix M N £ N matrix (N = number of web pages) M ij = 1/|O(j)| iff there.
On Novelty in Publish/Subscribe Delivery Kostas Stefanidis * Dept. of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong *Work.
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Suggestion of Promising Result Types for XML Keyword Search Joint work with Jianxin Li, Chengfei Liu and Rui Zhou ( Swinburne University of Technology,
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan.
Design and Evaluation of a Wide-Area Event Notification Service Antonio Carzaniga David S. Rosenblum Alexander L. Wolf.
Preference-Aware Publish/Subscribe Delivery with Diversity Marina Drosou Master Thesis Supervisor: Evaggelia Pitoura.
1 AINA 2006 Wien, April th 2006 DiVES: A DISTRIBUTED SUPPORT FOR NETWORKED VIRTUAL ENVIRONMENTS The IEEE 20th International Conference on Advanced.
Randomized Planning for Short Inspection Paths Tim Danner Lydia E. Kavraki Department of Computer Science Rice University.
“You May Also Like” Results in Relational Databases Kostas Stefanidis Department of Computer Science University of Ioannina, Greece Joint work with Marina.
SEQUOIAS YR-SOC'07 - Leicester June A NOVEL APPROACH TO WEB SERVICES DISCOVERY Marco Comerio Università di Milano-Bicocca
ReDrive: Result-Driven Database Exploration through Recommendations Marina Drosou, Evaggelia Pitoura Computer Science Department University of Ioannina.
Lecture 11. Matching A set of edges which do not share a vertex is a matching. Application: Wireless Networks may consist of nodes with single radios,
Background Notification services in LAN Provides Notification Selection Notification Delivery Done on a centralized server (hence not scalable) Challenge.
Marina Drosou Department of Computer Science University of Ioannina, Greece Thesis Advisor: Evaggelia Pitoura
Christos Tryfonopoulos MPII Saarbrücken David Midgley INSEAD Fontainebleau Pricing Information Goods in an Agent-based Information Filtering System Laura.
CHAPTER 5: CONSTRUCTING OPEN- AND CLOSED-ENDED QUESTIONS Damon Burton University of Idaho University of Idaho.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Mining Association Rules of Simple Conjunctive Queries Bart Goethals Wim Le Page Heikki Mannila SIAM /8/261.
Achieving fast (approximate) event matching in large-scale content- based publish/subscribe networks Yaxiong Zhao and Jie Wu The speaker will be graduating.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
On P2P Collaboration Infrastructures Manfred Hauswirth, Ivana Podnar, Stefan Decker Infrastructure for Collaborative Enterprise, th IEEE International.
MIDDLEWARE SYSTEMS RESEARCH GROUP Denial of Service in Content-based Publish/Subscribe Systems M.A.Sc. Candidate: Alex Wun Thesis Supervisor: Hans-Arno.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Probabilistic Ranking of Database Query Results Surajit Chaudhuri, Microsoft Research Gautam Das, Microsoft Research Vagelis Hristidis, Florida International.
Clustering-based Collaborative filtering for web page recommendation CSCE 561 project Proposal Mohammad Amir Sharif
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
Query Routing in Peer-to-Peer Web Search Engine Speaker: Pavel Serdyukov Supervisors: Gerhard Weikum Christian Zimmer Matthias Bender International Max.
Lecture 10: Class Review Dr John Levine Algorithms and Complexity March 13th 2006.
1 Efficient Search Ranking in Social Network ACM CIKM2007 Monique V. Vieira, Bruno M. Fonseca, Rodrigo Damazio, Paulo B. Golgher, Davi de Castro Reis,
Querying Structured Text in an XML Database By Xuemei Luo.
Searching for Extremes Among Distributed Data Sources with Optimal Probing Zhenyu (Victor) Liu Computer Science Department, UCLA.
April 14, 2003Hang Cui, Ji-Rong Wen and Tat- Seng Chua 1 Hierarchical Indexing and Flexible Element Retrieval for Structured Document Hang Cui School of.
 2003 Prentice Hall, Inc. All rights reserved. CHAPTER 3 JavaScript 1.
Chapter 8: Arrays.
Distributed Maintenance of Cache Freshness in Opportunistic Mobile Networks Wei Gao and Guohong Cao Dept. of Computer Science and Engineering Pennsylvania.
Reverse Top-k Queries Akrivi Vlachou *, Christos Doulkeridis *, Yannis Kotidis #, Kjetil Nørvåg * *Norwegian University of Science and Technology (NTNU),
CS 533 Information Retrieval Systems.  Introduction  Connectivity Analysis  Kleinberg’s Algorithm  Problems Encountered  Improved Connectivity Analysis.
Dynamic Diversification of Continuous Data Marina Drosou and Evaggelia Pitoura Computer Science Department University of Ioannina, Greece
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Early Profile Pruning on XML-aware Publish- Subscribe Systems Mirella M. Moro, Petko Bakalov, Vassilis J. Tsotras University of California VLDB 2007 Presented.
Marina Drosou, Evaggelia Pitoura Computer Science Department
Content-based Publish-Subscribe Over Structured P2P Networks Peter Triantafillou and Ioannis Aekaterinidis Presented by Jesse Chen 4/10/07.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
2015/12/251 Hierarchical Document Clustering Using Frequent Itemsets Benjamin C.M. Fung, Ke Wangy and Martin Ester Proceeding of International Conference.
Information-Centric Networks Section # 10.2: Publish/Subscribe Instructor: George Xylomenos Department: Informatics.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Distributed Ranked Data Dissemination in Social Networks Joint work with: Mo Sadoghi Vinod Muthusamy Hans-Arno.
Leveraging Knowledge Bases for Contextual Entity Exploration Categories Date:2015/09/17 Author:Joonseok Lee, Ariel Fuxman, Bo Zhao, Yuanhua Lv Source:KDD'15.
03/02/20061 Evaluating Top-k Queries Over Web-Accessible Databases Amelie Marian Nicolas Bruno Luis Gravano Presented By: Archana and Muhammed.
On Frequent Chatters Mining Claudio Lucchese 1 st HPC Lab Workshop 6/15/12 1st HPC Workshp - Claudio Lucchese.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
INFORMATION RETRIEVAL MEASUREMENT OF RELEVANCE EFFECTIVENESS 1Adrienn Skrop.
Computing Full Disjunctions
Algorithms Chapter 3 With Question/Answer Animations
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
Storing and Replication in Topic-Based Pub/Sub Networks
Introduction to Computer Science
Presentation transcript:

Preferential Publish/Subscribe Marina Drosou Department of Computer Science University of Ioannina, Greece Joint work with Evaggelia Pitoura and Kostas Stefanidis

PersDB Auckland Publish/Subscribe is an alternative to typical searching Users do not need to repeatedly search for new interesting data They specify their interests once and the system automatically notifies them whenever relevant data is made available Example: ▫Google Alerts Publish/Subscribe Systems DMOD Laboratory, University of Ioannina 2

PersDB Auckland Parts of a Publish/Subscribe system: ▫Subscribers: consumers of events ▫Publishers: generators of events ▫Event-notification service:  Store subscriptions (user interests)  Match events to subscriptions  Deliver event notifications Publish/Subscribe Systems 3 DMOD Laboratory, University of Ioannina

PersDB Auckland Publish/Subscribe Example DMOD Laboratory, University of Ioannina 4 title = The Godfather genre = drama showing time = 21:10 title = Ratatouille genre = comedy showing time = 21:15 title = Fight Club genre = drama showing time = 23:00 title = Casablanca genre = drama showing time = 23:10 title = Vertigo genre = drama showing time = 23:20 title = The Godfather genre = drama showing time = 21:10 title = Fight Club genre = drama showing time = 23:00 title = Casablanca genre = drama showing time = 23:10 title = Vertigo genre = drama showing time = 23:20 Published events Matching events User subscriptions genre = drama genre = horror

PersDB Auckland Typically, all subscriptions are considered equally important Users may receive overwhelming amounts of notifications In such cases, users would like to receive only a fraction of those notifications, the most interesting ones Say John is more interested in horror movies than comedies John would like to receive notifications about comedies only if there are no (or just a few) notifications about horror movies Current publish/subscribe systems do not allow John to express this different degree of interest Motivation DMOD Laboratory, University of Ioannina 5

PersDB Auckland To express some form of ranking among subscriptions, we define priorities among them To do this, we use preferences among subscriptions (preferential subscriptions) Based on preferential subscriptions, we deliver to users only the k most interesting events Motivation DMOD Laboratory, University of Ioannina 6

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Evaluation Summary Outline DMOD Laboratory, University of Ioannina 7

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Evaluation Summary Outline DMOD Laboratory, University of Ioannina 8

PersDB Auckland There are two kind of used schemes for specifying interesting events: ▫Topic-based  Each event belongs to a number of topics (e.g. “music”, “sport”)  Users subscribe to topics and receive all relevant events ▫Content-based  Users subscribe to the actual content of the events  More expressive In this work we use the content-based scheme Publish/Subscribe Variations DMOD Laboratory, University of Ioannina 9

PersDB Auckland A notification is a set of typed attributes consisting of: ▫A type ▫A name ▫A value Notifications DMOD Laboratory, University of Ioannina 10 stringtitle = LOTR: The Return of the King string director = Peter Jackson time release date = 1 Dec 2003 string genre = fantasy integer oscars = 11

PersDB Auckland A subscription is a set of typed attribute constraints consisting of: ▫A type ▫A name ▫A binary operator ▫A value Subscriptions DMOD Laboratory, University of Ioannina 11 string director = Peter Jackson time release date > 1 Jan 2003

PersDB Auckland string director = Steven Spielberg string genre = fantasy string release date > 1 Jan 2003 Given a notification n and a subscription s, s covers n (or n matches s) if and only if every attribute constraint of s is satisfied by some attribute of n Cover Relation DMOD Laboratory, University of Ioannina 12 string title = LOTR: The Return of the King string director = Peter Jackson time release date = 1 Dec 2003 string genre = fantasy integer oscars = 11 Event notification Subscriptions string director = Peter Jackson time release date > 1 Jan 2003

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Evaluation Summary Outline DMOD Laboratory, University of Ioannina 13

PersDB Auckland To define priorities among subscriptions: add preferences Two ways to express preferences: ▫Quantitative approach  Preferences are expressed by using scoring functions ▫Qualitative approach  Preferences are expressed by using preference relations between pairs of subscriptions Preferential Subscriptions DMOD Laboratory, University of Ioannina 14 genre = dramagenre = horror ≻ 0.7 genre = drama 0.9

PersDB Auckland A preferential subscription is a subscription enhanced with a numeric score within the range [0, 1] preferential subscription = subscription + score Preferential Subscriptions DMOD Laboratory, University of Ioannina 15 string director = Peter Jackson time release date > 1 Jan

PersDB Auckland Given a set of user preferential subscriptions and a published notification ▫Find out how important the notification is to the user How? ▫The notification score is computed based on the scores of the matching subscriptions ▫In the case of one matching subscription:  Notification score = subscription score Notification Score DMOD Laboratory, University of Ioannina 16

PersDB Auckland For user X, a notification n has score: sc(n, X) = max {score 1, …, score m } where score 1, …, score m are the scores of X’s preferential subscriptions that cover n Notification Score DMOD Laboratory, University of Ioannina 17 genre = adventure 0.9 director = Peter Jackson 0.7 string title = King Kong string director = Peter Jackson time release date = 14 Dec 2005 string genre = adventure 0.9

PersDB Auckland Based on the user’s preferential subscriptions, we rank published notifications and deliver only the top-k, i.e. the notifications with the k highest scores Publication of events is continuous A newly published notification n is delivered to a user X, if and only if: ▫It is covered by some subscription s issued by X and ▫X has not already received k notifications more preferable to n Top-k Notifications DMOD Laboratory, University of Ioannina 18

PersDB Auckland New notifications are constantly produced It is possible for very old but highly preferable notifications to prevent newer ones from reaching the user Stale Notifications DMOD Laboratory, University of Ioannina 19

PersDB Auckland Example DMOD Laboratory, University of Ioannina 20 title = The Godfather genre = drama showing time = 21:10 title = Ratatouille genre = comedy showing time = 21:15 title = Fight Club genre = drama showing time = 21:20 title = Casablanca genre = drama showing time = 21:25 title = Vertigo genre = drama showing time = 23:20 Published events Top-2 events User subscriptions title = The Apartment genre = comedy showing time = 21:00 genre = comedy 0.9 genre = drama 0.8 title = The Apartment genre = comedy showing time = 21:00 title = The Godfather genre = drama showing time = 21:10 title = Ratatouille genre = comedy showing time = 21:15 20:00 20:10 20:15 20:20 20:25 22:

PersDB Auckland Old but highly preferable notifications may prevent newer notifications from reaching the user To overcome this problem, we use the notion of time validity: ▫Notifications are associated with an expiration time ▫Only valid notifications can prevent others from reaching the user A newly published notification n is delivered to a user X, if and only if: ▫It is covered by some subscription s issued by X and ▫X’s top-k results do not contain k valid notifications more preferable to n Time-Valid Notifications DMOD Laboratory, University of Ioannina 21

PersDB Auckland Example DMOD Laboratory, University of Ioannina 22 title = The Godfather genre = drama showing time = 21:10 title = Ratatouille genre = comedy showing time = 21:15 title = Fight Club genre = drama showing time = 21:20 title = Casablanca genre = drama showing time = 21:25 title = Vertigo genre = drama showing time = 23:20 Published events Top-2 events User subscriptions title = The Apartment genre = comedy showing time = 21:00 genre = comedy 0.9 genre = drama 0.8 title = The Apartment genre = comedy showing time = 21:00 title = The Godfather genre = drama showing time = 21:10 title = Ratatouille genre = comedy showing time = 21:15 20:00 20:10 20:15 20:20 20:25 22:20 title = Vertigo genre = drama showing time = 23: Notifications expire at their showing time

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Preferential Subscription Graph Forwarding Notifications Evaluation Summary Outline DMOD Laboratory, University of Ioannina 23

PersDB Auckland Whenever a new notification n is published: ▫Locate users with matching subscriptions ▫Check whether n belongs to their top-k valid results  If so, forward n Match Notifications to Subscriptions DMOD Laboratory, University of Ioannina 24

PersDB Auckland How does the event-notification service carry out the matching process? Notifications need to be checked against user subscriptions ▫All of them? Exploit cover relations between subscriptions Match Notifications to Subscriptions DMOD Laboratory, University of Ioannina 25 string director = Peter Jackson string genre = fantasy

PersDB Auckland We organize preferential subscriptions using a directed acyclic graph Preferential subscription graph (PSG): ▫Nodes correspond to subscriptions ▫Edges correspond to cover relations between subscriptions ▫A score set is assigned to each subscription  Pairs of the form (user, score) Preferential Subscription Graph DMOD Laboratory, University of Ioannina 26

PersDB Auckland Preferential subscription graph (PSG): ▫Nodes correspond to subscriptions ▫Edges correspond to cover relations between subscriptions ▫A score set is assigned to each subscription  Pairs of the form (user, score) Preferential Subscription Graph DMOD Laboratory, University of Ioannina 27 cinema = ster John, 0.5 genre = drama showing time > 21:00 John, 0.7 cinema = ster genre = drama showing time > 21:00 John, 0.9 Anna, 0.6 genre = drama Anna, 0.5

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Preferential Subscription Graph Forwarding Notifications Evaluation Summary Outline DMOD Laboratory, University of Ioannina 28

PersDB Auckland A server maintains: ▫A Preferential Subscription Graph ▫A list of k elements of the form (score, expiration) for each user j (list j ) Each such element represents a notification previously forwarded to the user: ▫score is the notification’s score ▫expiration is the notification’s expiration time Forwarding Notifications DMOD Laboratory, University of Ioannina 29

PersDB Auckland Initially, all lists are empty Upon receiving a notification n Walk through PSG and locate all subscriptions that cover n For each user j associated with at least one such subscription Compute n’s notification score sc(n,j) for j Remove expired elements from list j If | list j | minimum score in list j Add (sc(n,j), n.expiration) to list j Forward n to j Forwarding Notifications DMOD Laboratory, University of Ioannina 30

PersDB Auckland Forwarding Notifications DMOD Laboratory, University of Ioannina 31 cinema = ster John, 0.5 genre = drama showing time > 21:00 John, 0.7 Anna, 0.6 cinema = ster genre = drama showing time > 21:00 John, 0.9 Anna, 0.7 genre = drama Anna, 0.5 John 0.910: :00 Anna 0.711: :00 Current time: 08:00 Published notification n: title = The Godfather genre = drama cinema = odeon showing time = 21:10 expiration 21:10 sc(n, John) = 0.7 sc(n, Anna) = :10 Top-2 lists

PersDB Auckland Up to now, we have used the quantitative approach to express preferences ▫Subscriptions augmented with interest scores The qualitative approach can also be used ▫Expressing priorities through preference relations Preferential Subscriptions DMOD Laboratory, University of Ioannina 32 genre = dramagenre = horror ≻ 0.7 genre = drama 0.9

PersDB Auckland Assume a set of priority conditions among a user’s subscriptions We use the winnow operator to extract the most preferable subscriptions, i.e. the subscriptions that are not covered by any other subscription Qualitative Approach DMOD Laboratory, University of Ioannina 33 genre = dramagenre = horror ≻ genre = comedygenre = romance ≻ genre = action ≻ genre = dramagenre = comedy genre = romancegenre = horror genre = action

PersDB Auckland Instead of a score, each subscription is now associated with a rank Qualitative Approach DMOD Laboratory, University of Ioannina 34 genre = dramagenre = comedy genre = romancegenre = horror genre = action rank = 1 rank = 2 rank = 3

PersDB Auckland Qualitative Approach DMOD Laboratory, University of Ioannina 35 cinema = ster John, 3 genre = drama showing time > 21:00 John, 2 Anna, 2 cinema = ster genre = drama showing time > 21:00 John, 1 Anna, 1 genre = drama Anna, 3 The maintained lists now contain elements of the form (rank, expiration) rank(n, X) = min{rank 1, …, rank m } In this case, subscriptions are augmented with a rank instead of a numeric score

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Evaluation Summary Outline DMOD Laboratory, University of Ioannina 36

PersDB Auckland We have fully implemented our approach ▫PrefSIENA: PrefSIENA is an extension of SIENA, a multi-threaded, distributed publish/subscribe system Evaluation DMOD Laboratory, University of Ioannina 37

PersDB Auckland Real movie dataset, derived from IMDB. Data about movies: ▫Title, year, budget, length, rating, MPAA, genre We measure the number of notifications delivered to the users: ▫Out goal is to deliver a portion of the matching notifications, the most interesting for each user This number depends on k, the expiration time and the order of the published events Experimental Setup DMOD Laboratory, University of Ioannina 38

PersDB Auckland 100 matching notifications scenario 1: highly preferable notifications first scenario 2: highly preferable notifications last τ as a function of the time length t = 500ms between publications Notifications forwarded to a specific user DMOD Laboratory, University of Ioannina 39 Scenario 1 and larger expiration time result in more pruning

PersDB Auckland Publish/Subscribe Preliminaries Preferential Subscriptions & Time-Valid Notifications Ranking in Publish/Subscribe Evaluation Summary Outline DMOD Laboratory, University of Ioannina 40

PersDB Auckland We extend publish/subscribe systems to incorporate ranking capabilities Notifications are ranked based on preferential subscriptions To maintain the freshness of data, we associate expiration times with notifications Preferential subscriptions are organized in a graph which is utilized to forward notifications to the users We have fully implemented our approach (PrefSIENA) Summary DMOD Laboratory, University of Ioannina 41

PersDB Auckland Other ways of computing notification scores: ▫Mean, minimum, weighted sum ▫Using the scores of the most specific preferential subscriptions Alternatives to expiration time Future work DMOD Laboratory, University of Ioannina 42 genre = adventure 0.9 director = Peter Jackson genre = adventure 0.7 string title = King Kong string director = Peter Jackson time release date = 14 Dec 2005 string genre = adventure 0.7

PersDB Auckland Kresimir Pripuzic, Ivana Podnar Zarko, Karl Aberer “Top-k/w Publish/Subscribe: Finding k Most Relevant Publications in Sliding Time Window w” 2nd International Conference on Distributed Event-Based Systems, Rome, July 1-4, 2008 Ashwin Machanavajjhala, Erik Vee, Minos Garofalakis, Jayavel Shanmugasundaram “Scalable Ranked Publish/Subscribe” 34th International Conference on Very Large Data Bases, Auckland, August 23-28, 2008 Christian Zimmer, Christos Tryfonopoulos, Klaus Berberich, Gerhard Weikum, Manolis Koubarakis “Node Behavior Prediction for LargeScale Approximate Information Filtering” 1st Workshop on Large-Scale Distributed Systems for Information Retrieval, Amsterdam, July 27, 2007 Related Work DMOD Laboratory, University of Ioannina 43

PersDB Auckland Thank you! DMOD Laboratory, University of Ioannina 44

PersDB Auckland Matching notifications: ▫User 1: 51% ▫User 2: 27% ▫User 3: 15% k = 1 t = 500ms Number of forwarded notifications DMOD Laboratory, University of Ioannina 45 Larger expiration times result in more pruning for all users