Referral Systems Formulation and Emergent Properties Munindar P. Singh (joint work with Bin Yu, pInar Yolum (mainly), Yathi Udupi) Department of Computer.

Slides:



Advertisements
Similar presentations
Research Issues in Web Services CS 4244 Lecture Zaki Malik Department of Computer Science Virginia Tech
Advertisements

Distributed Advice-Seeking on an Evolving Social Network Dept Computer Science and Software Engineering The University of Melbourne - Australia Golriz.
1 The PageRank Citation Ranking: Bring Order to the web Lawrence Page, Sergey Brin, Rajeev Motwani and Terry Winograd Presented by Fei Li.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
8.2 Discretionary Access Control Models Weiling Li.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Apr 30, 2002Mårten Trolin1 Previous lecture – passwords Passwords for authentication –Storing hashed passwords –Use of salt Passwords for key generation.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Expertise Networks in Online Communities: Structure and Algorithms Jun Zhang Mark S. Ackerman Lada Adamic University of Michigan WWW 2007, May 8–12, 2007,
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Chapter 20: Social Service Selection Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Integration of Applications MIS3502: Application Integration and Evaluation Paul Weinberg Adapted from material by Arnold Kurtz, David.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Link Structure and Web Mining Shuying Wang
Recommender systems Ram Akella November 26 th 2008.
Text-Based Content Search and Retrieval in ad hoc P2P Communities Francisco Matias Cuenca-Acuna Thu D. Nguyen
Peer-to-peer archival data trading Brian Cooper and Hector Garcia-Molina Stanford University.
Improving Data Access in P2P Systems Karl Aberer and Magdalena Punceva Swiss Federal Institute of Technology Manfred Hauswirth and Roman Schmidt Technical.
Yimam & Kobsa July 13, 2000TWIST 2000 Centralization vs. Decentralization Issues in Internet-based KMS: Experiences from Expertise Recommender Systems.
Building a Strong Foundation for a Future Internet Jennifer Rexford ’91 Computer Science Department (and Electrical Engineering and the Center for IT Policy)
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Web Data Mining and Applications Part I
CS246 Link-Based Ranking. Problems of TFIDF Vector  Works well on small controlled corpus, but not on the Web  Top result for “American Airlines” query:
Chapter 20: Social Service Selection Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Making the most of social historic data Aleksander Kolcz Twitter, Inc.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Social Networking Algorithms related sections to read in Networked Life: 2.1,
Aspects and Closures and Promises (Oh My!) Alva L. Couch Tufts University.
Ivan Osipkov Fighting Freeloaders in Decentralized P2P File Sharing Systems.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
Adapted from the original presentation made by the authors Reputation-based Framework for High Integrity Sensor Networks.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Examining Dynamic Trust Relationships in Autonomy-Oriented Partner Finding Department of Computer Science, HKBU, HK International WIC Institute, BJUT,
Rob Smith April 21, /18 GOLD Project Update Meeting GOLD an infrastructural approach to virtual organisations.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Overview of Issues and Interests in Standards and Interoperability Mary Saunders Chief, Standards Services Division NIST.
TDEC-NUATRC Workshop Strategic Risk Communication: Air Toxics Rebecca Parkin, PhD, MPH The George Washington University Washington, DC October 18, 2005.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
Ranking Link-based Ranking (2° generation) Reading 21.
Chapter 20: Social Service Selection Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Introduction to Active Directory
CI.III.1 Wider Adoption, Deployment, Utilization of a Cyberinfrastructure David De Roure.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Quality Is in the Eye of the Beholder: Meeting Users ’ Requirements for Internet Quality of Service Anna Bouch, Allan Kuchinsky, Nina Bhatti HP Labs Technical.
Social Networks and Cyberinfrastructure Munindar P. Singh
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Decentralized Trust Management for Ad-Hoc Peer-to-Peer Networks Thomas Repantis Vana Kalogeraki Department of Computer Science & Engineering University.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Dynamic Network Analysis Case study of PageRank-based Rewiring Narjès Bellamine-BenSaoud Galen Wilkerson 2 nd Second Annual French Complex Systems Summer.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Yue Zhang, Nathan Vance, and Dong Wang
The Globus Toolkit™: Information Services
Service-Oriented Computing: Semantics, Processes, Agents
Service-Oriented Computing: Semantics, Processes, Agents
Presentation transcript:

Referral Systems Formulation and Emergent Properties Munindar P. Singh (joint work with Bin Yu, pInar Yolum (mainly), Yathi Udupi) Department of Computer Science North Carolina State University

2 Outline Motivation and Framework Making Referral Systems Useful Understanding Referral Systems –Authoritativeness –Structure –Evolution Directions Backup –Clustering –Power-law networks

3 Referrals for Selection How can we find a business partner in a purely distributed system? A B C D Q Q A R(D) Q Each agent models the expertise (quality of a service) and sociability (quality of the referrals) of its acquaintances Based on these models, each agent can change its set of neighbors (using its neighbor selection policy): locally, autonomously Social network: as induced by the neighborhood relation An agent represents a principal offering or searching for services An agent generates a query for a service; sends it to its neighbors Each neighbor may provide the service or refer to other agents (based on its referral policies)

4 Why a Decentralized Approach? Problems with central authorities (e.g., Verisign) or reputation systems (e.g., eBay) Context and understanding: The contexts of usage may differ Empirical basis: Best to trust experience –Did Verisign itself buy DVDs from Amazon? Privacy: Raters may not want to reveal true ratings in public Trust: Users of ratings don’t necessarily know where the ratings come from

5 Referral process is explicit; emergent structure is not Emergent structure is explicit; underlying process is not Motivation Referrals for service selection –Follow referrals from trusted parties –Self-organize based on previous interactions Web structure –Properties of its snapshot –Stochastic models for approximating in-degree distributions –Hyperlinks are assumed to be endorsements –Local interactions are not captured

6 Application Domains Commerce: Distinct service producers and consumers Producers have expertise, consumers have sociability Answers are easy to evaluate Expertise of consumers does not increase Knowledge Management: All agents can be producers and consumers Answers are harder to evaluate Expertise of consumers may increase (expertise of the producers can be cached by others)

7 Developed over several years by Bin Yu Wentao Mo Paul Palathingal Subhayu Chatterjee Good theme for an MS thesis Prototype system for helping people participate in a referral network Practical challenges: –UI: use an IM client –Communication: use an IM server –Bootstrap: Infer people’s expertise and (initial) neighbors: mine Research challenges –How to evaluate convincingly? MARS: MultiAgent Referral System

8 Vector Space Model Originated in the 1960s Still used in text retrieval Applied by Yu & Singh; Yolum & Singh; Udupi, Yolum, & Singh Representations: 1 The initial work has involved vector representations for queries and knowledge –Assume dimensions, supply values –[spicy, timely, tasty, authentic, healthy]: [0.8,0.7,0.9,0.8,0.1] Easy approach conceptually –Common in text retrieval –Supports caching results –But has well-known limitations

9 Maximilien & Singh; Maximilien developed a practical framework for QoS in Web services QoS framework works as a reputation system; not yet combined into a referral system The meanings of the dimensions are not standard Ontology (loosely, conceptual model) for qualities of service –Common QoS: price, availability –Domain-specific QoS: spiciness –Idiosyncratic QoS: enjoyment How to handle preferences –Decision theory Representations: 2

10 Yu & Singh: Applies Dempster- Shafer theory of evidence and weighted majority learning Propagation of Trust Referrals support trust management –Provide a basis for finding witnesses, who can offer evidence (pro or con) about a third party –Provide a basis for rating such witnesses –Support adapting to select the more promising witnesses and avoid those who are deceptive

11 The completed work has mostly had an empirical flavor Theoretical aspects would be great topics for further research Analysis Not just develop a system and hope it works, but understand its functioning to: –Improve its effectiveness in important settings –Find new uses for it –Study general questions of the consequences of decentralization and emergence

12 Referral Policies Refer all neighbors: Refer best neighbor: Refer all matching neighbors: Refer those neighbors with “sufficient” expertise Does not consider which neighbors would be more likely to answer (similar to Gnutella) Refer the most capable neighbor. Guarantees that at least one neighbor is referred

13 Policies: Refer All Refer All Matching Refer Best Efficiency = # of good answers # of contacted agents Efficiency of Referral Policies Not enough good answers are found Too many agents are contacted

14 Effectiveness of Referral Policies Low quality even though answers are found Low efficiency but high quality

15 P(i): PageRank of i N(j): Neighbors of j K(i): Pages that point to page I d: Damping factor Authorities Link analysis to find authorities from Web crawls PageRank: Pages pointed to by authorities are also authoritative Factors that influence the emergence of authorities

16 In referral systems, mining is used as a research tool Cannot centrally crawl a referral system in practice Exposing mined results may violate privacy Yolum & Singh Referrals and Authorities Web search engines –Mostly crawl static pages –Interpret each URL as an endorsement –Mine centrally to decide where to direct searches by all users Referral systems –A decentralized agent Obtains dynamic (custom) information Knows if it is an endorsement Decides how to use it for its user Reveals appropriate information to others –Mining is optional, after the fact

17 Emergence of Authorities through Adaptation Authorities emerge as agents change neighbors

18 Authoritativeness & Number of Experts When the population has fewer experts, the authoritativeness of these experts is higher

19 Effect of Referral Policies When more referrals are exchanged, the authorities get higher PageRank (i.e., extent of their authoritativeness is higher)

20 Neighbor Selection Policies Providers: Weighted Average: Sociables:Choose the most sociable m agents of its acquaintances Choose the best m agents whose expertise matches the agent’s interests Choose the best m based on weighing both the expertise and the sociability of the acquaintances How do the agents choose their neighbors?

21 Effect of Neighbor Selection Policies Choosing sociables does not help authorities to emerge

22 Decreasing Expertise; Then Preferring Experts Given: agents 1 and 24 lose their expertise Evolution: Yet, agent 1 remains authoritative because of its sociability

23 Increasing Expertise; Then Preferring Sociables Given: agents 79 and 237 become experts Evolution: yet, agent 79 does not become authoritative because it is pointed to by only a few

24 Winner Takes All? Conjecture: After a population becomes stable, If agents prefer experts, then the winner need not take it all (i.e., a new expert can eventually become authoritative) If agents prefer sociables, then the winner takes it all (i.e., a new expert does not become authoritative)

25 Literature Referral systems: – MINDS – ReferralWeb Service location – Directory services (WHOIS++, LDAP) No modelling of other servers Rigid referrals (if any) – Chord, CAN, Pastry: – Routing based on a distributed hash table. – No support for autonomous or heterogeneous peers

26 MS Themes PhD Themes Directions Practical –Reimplement MARS –Incorporate QoS Research –Ontologies –Policies –Virtual Organizations

27 IEEE SUO; Cyc; Language-based approaches: WordNet; LDOCE Ontologies An ontology is a knowledge representation of some domain of interest –Successful communication (or interoperation) presupposes agreement of ontologies –Currently: develop standard ontologies for each domain Time consuming; fragile Doesn’t scale; omits opinions

28 Great theme for a dissertation Big challenge: how to convincingly evaluate the contribution Consensus Referral systems are a decentralized way to achieve (or approximate) consensus –About services, as above –Why not about ontologies? –Use social network to determine who is an authority in what topic –Find a way to combine their ontologies for those topics

29 Early stages: Udupi & Singh Logic-Based Policies Referral systems appear to work, but how can –We be sure nothing bad will happen –An administrator or user configure such systems Use declarative policies to capture the agents’ behavior –Use logic programming to develop the agents

30 Challenge to combine commitments with referral systems Virtual Organizations Organizations of autonomous, heterogeneous parties collaborating some computational task –Common in scientific computing –Emerging in business settings Challenges VOs face –Interoperation of information resources as in other systems –Governance regarding allocating resources

31 Key Ideas Pure Decentralization Reputation in action –Not separated from usage Interesting properties of clustering and emergence Intuitive model underlying link analysis

32 Backup Slides

33 Sidebar AutTitle Text

34 Interests used to generate queries Query, answer, interest, and expertise are vectors from Vector Space Model where each dimension corresponds to a domain Dimension of the vectors is 4 Sociability is scalar 400 agents, with 10 to 25% service providers 8 neighbors per consumer Initial neighbors picked randomly Reselect neighbors after every 2 queries 4 to 20 neighbor changes Basic Experimental Setup

35 Metrics Qualifications: –Similarity : A symmetric relation to measure how similar two vectors are –Capability : An asymmetric relation to measure how much better a vector is compared to the other

36 Metrics Quality: –Direct: How close a match are the neighbors of an agent to it? –Nth Best: Sort them and take the highest nth value. Each agent is represented by its nth best matching neighbor PageRank:

37 Measures how similar the neighbors of an agent are as well as how similar the agent is to its neighbors Sociability increases interest clustering Clustering Agents with similar interests May be looking for similar providers May give useful referrals Thus, will be considered sociable, and kept as neighbors

38 Result: Quality decreases when interest clustering increases Clustering (2)

39 Bipartite Communities Referral Communities Co-Citation versus Referral Communities

40 Bipartite Graphs Weakly-connected components Approximate how close a graph is to being bipartite: Removing k edges Removing k vertices Graph Structures Result: In a population where each agent exercises the Providers policy, if there are more providers than the number of neighbors an agent can have, then the graph converges into a bipartite graph

41 Bipartite Graphs Weakly-connected components If there is more than one weakly-connected component, then there is at least one customer who will not be able to find a service provider Graph Structures Result: In a population where each agent exercises the Sociables policy, the graph ends up with a number of weakly-connected components

42 In-Degree Distributions Referral Policies Neighbor Selection Policies

43 On Power-Law Relationships of the Internet Topology M. Faloutsos P. Faloutsos C. Faloutsos (SIGCOMM 1999) Interacting Individuals Leading to Zipf’s Law M. Marsili Y. Zhang (Physical Review Letters, 80(12), 1998) Power Laws

44 Power-Law Distribution of In-Degree When agents are ranked based on their in-degree, the agent with the highest rank has a lot higher in-degree than the agent with the second rank, and so on

45 Agents Prefer Providers (1) With non-selective referrals, when agents prefer providers, the in-degrees are shared among service providers

46 Agents Prefer Sociables (1) 1. With selective referrals, agents become locally sociable 2. In-degree distribution becomes a power-law

47 Agents Prefer Sociables (2) Decreasing the selectivity of referrals decreases the fitness of the power-law

48 Discussion Reputation? What reputation? –Clearly being used –Clearly being built up or torn down –But not computed (except for an after-the-fact study) Directions –Richer representations: transfer reputation across services –Protection against attacks: deception, collusion Implementation

49 Reputation Consider a society of principals, potentially each having opinions about the others –The opinions are applied implicitly in whether and how different parties do business with each other Someone’s reputation is a general opinion about that party –Sometimes partially probed by asking others –Never explicitly fully aggregated, except in current computational approaches