Reputation Systems For Open Collaboration, CACM 2010 Bo Adler, Luca de Alfaro et al. Nishith Agarwal

Slides:



Advertisements
Similar presentations
Measuring Author Contributions to the Wikipedia B. Thomas Adler, Luca de Alfaro, Ian Pye and Vishwanath Raman Computer Science Dept. UC Santa Cruz, CA,
Advertisements

GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Example Presentation Something to Show. Basic Title & Content Point to be made Number One Point to be made Number Two Point to be made Number Three.
Introduction To System Analysis and Design
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
To trust or not, is hardly the question! Sai Moturu.
A Content-Driven Reputation System for the Wikipedia Nan Li
Computing Trust in Social Networks
Online communities 1 Theory revision Complete some of the activities in this powerpoint and use the revision book to answer questions.
Survey Monkey A “How To” Guide.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Web 2.0: Concepts and Applications 2 Publishing Online.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Patterns, effective design patterns Describing patterns Types of patterns – Architecture, data, component, interface design, and webapp patterns – Creational,
Introduction To System Analysis and Design
Visibility Graph. Voronoi Diagram Control is easy: stay equidistant away from closest obstacles.
Characterizing the Uncertainty of Web Data: Models and Experiences Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti Università degli Studi.
Bruno Ribeiro Don Towsley University of Massachusetts Amherst IMC 2010 Melbourne, Australia.
WikiTrust: Turning Wikipedia Quantity into Quality B. Thomas Adler, Luca de Alfaro, and Ian Pye.
Algorithmic Detection of Semantic Similarity WWW 2005.
REPUTATION SYSTEMS FOR OPEN COLLABORATION CACM 2010 Bo Adler, Luca de Alfaro, Ashutosh Kulshreshtha, Ian Pye Reviewed by : Minghao Yan.
Robotics Club: 5:30 this evening
Computer Programming Application Friday 10/29/2010.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
1 NETWORKING 2012 Parallel and Distributed Systems Group, Delft University of Technology, the Netherlands May 22, 2012 Reducing the History in Decentralized.
Applying Link-based Classification to Label Blogs Smriti Bhagat, Irina Rozenbaum Graham Cormode.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
A novel approach to visualizing dark matter simulations
Lesson 7 -Collaborative Editing Objectives In this lesson we will: ● Introduce the idea of Wiki ethics, ● Explore the Recent changes page, ● and diff &
Link Prediction Class Data Mining Technology for Business and Society
Dept. of Information Systems St. Francis Xavier University
Lecture 7: Constrained Conditional Models
Your Step-by-Step Guide
Instructions and Procedures
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Working in Groups in Canvas
Distributed Software Development
Author(s): Rahul Sami, 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Noncommercial.
The Beta Reputation System
Lenva Shearing Gail Mitchell
IT.CAS.Web2.0 Kyle Erickson
On Assigning Implicit Reputation Scores in an Online Labor Marketplace
Semi-Global Stereo Matching with Surface Orientation Priors
Author(s): Rahul Sami, 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Noncommercial.
Link Prediction Seminar Social Media Mining University UC3M
What is the next line of the proof?
Applications of Data Mining in Software Engineering
Aspect-based sentiment analysis
Unit 1.4 Wired and Wireless Networks Lesson 1
Motivation and Background
Motivation and Background
Fine-Grained Complexity Analysis of Improving Traveling Salesman Tours
'Linear Hierarchical Models'
Unit 1.4 Wired and Wireless Networks Lesson 3
Discriminative Frequent Pattern Analysis for Effective Classification
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Correcting Error in Message Passing Systems
ICT Word Processing Lesson 5: Revising and Collaborating on Documents
Ben Jones - S Rebecca Hunter - S
Graph and Link Mining.
Social Role Evolution of an Ideological Online Community Brittany I
Lecture 6: Counting triangles Dynamic graphs & sampling
DIGITAL IDENTITY FOR COMPANIES
Shawn Stiver ARC Fall Semester, 2016 Geography 385 GIS For The Web
Wikis Skills (application development): wiki editing and management
Feature computation and classification of grating pitch.
Generalized Diagnostics with the Non-Axiomatic Reasoning System (NARS)
Presentation transcript:

Reputation Systems For Open Collaboration, CACM 2010 Bo Adler, Luca de Alfaro et al. Nishith Agarwal

What are Reputation Systems? Wikipedia Definition: A reputation system computes reputation scores for a set of objects within a community or domain, based on a collection of opinions other users hold about the objects. Why do we need them? Reputation Systems can help stem abuse of content, and can offer indications of content quality. In many ways, reputation systems are the on-line equivalent of the body of laws that regulates the real-world interaction of people. Who uses reputation systems? 2

Types of Reputation Systems? 3 WikiTrust and Crowdsensus are both content-driven systems

WikiTrust A content-driven reputation system for Wiki authors and content on Wikipedia Goals: Incentivize users to give lasting contributions Help users and editors increase the quality of content Offer content consumers a guide to content quality Components: User reputation system: Users gain reputation when they add content that is preserved by subsequent users. Content reputation system: Content gains reputation when it is revised by highly reputed authors. Firefox Extension: Lets users view content reputation by changing text background color. 4

WikiTrust – User Reputation System Contribution quality: Relies on edit distance between revisions. -1 if changes made by b are completely reverted +1 if changes made by b are completely preserved 5 Contributions are considered good quality if the change is preserved in subsequent revisions User reputation is computed according to quality and quantity of contributions they make User Reputation: Is proportional to the edit distance and contribution quality of b. r(B) ≈ d(a,b) + q(b | a,c) + r(C) r(B) being the reputation of author B of revision b r(C) being the reputation of author C of revision c

WikiTrust – Content Reputation System (TextTrust) Based on extent to which content was revised, and reputation of users who revised it. High content reputation requires consensus from reputed authors. Basic Algorithm: Content that is edited is assigned a small fraction of the revision user’s reputation. Unedited content gains more reputation. Some Tweaks: Ensures that re-arranging or deleting text leaves a low reputation mark Content reputation cannot exceed the revision user’s reputation Users cannot raise arbitrary reputation by multiple edits 6

Crowdsensus A content-driven reputation system built to analyze user edits to Google Maps  Goals: To measure accuracy of a user who contributes information To display accurate details for a business (title, address, phone etc.) Differences from WikiTrust: There exists a “ground truth” User reputation is not visible. Hence, no need to keep algorithm simple. User identity is stronger 7

Crowdsensus - Algorithm Structured as fixed point graph algorithm Vertices are users u, and business attributes a Edges are attribute values v Each user has truthfulness value q u Algorithm Details: User vertices send (q u, v u ) pairs to value vertices An attribute inference algorithm is used to derive probability distribution over values (v 1, v 2..v n ) Crowdsensus sends back to user u, the estimate probability that v u is correct Different attribute inference algorithms tailored to every attribute type Comparison to Bayesian Inference Model: For 1000 attributes, 100 users, 10 attribute values: –CrowdSensus error rate: 2.8% –Bayesian error rate: 7.9% For 1000 attributes, 100 users, 5 attribute values –CrowdSensus error rate: 12.6% –Bayesian error rate: 22% 8 a q1,v 1 q2,v 2 qn,v n

Design Considerations User vs. Content Driven Visibile to User? Weak vs. Strong Identity Existence of Ground Truth Chronological vs. Global Updates WikipediaContentYesWeakNoChronological Google Maps ContentNoStrongYesBoth YelpUserYesStrongNoChronological 9

Conclusion Reputation Systems are becoming increasing important to regulate real-world interactions of people Reputation Systems can be very different depending on the content Wikipedia’s WikiTrust relies on edit distance between revisions Google’s Crowdsensus measure’s the accuracy of contributing users using attribute inference algorithms 10 Research Directions: How can reputation systems lead to happy, active, healthy communities? How can we build reputation systems that meet multiple goals?

Pros Highlights the importance of reputation systems for online collaboration of content. Compares how two different content-driven reputation systems fulfill very different practical needs. Gives insight on how Google approaches reputation systems, where they might not explicitly share the algorithm. E.g. PageRank 11

Cons Mentions that CrowdSensus yields better results than Bayesian approach but does not elaborate on it. Provides little detail on how the CrowdSensus algorithm works for map edits. Does not provide detials about how attribute inference models work. 12