CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian

Slides:



Advertisements
Similar presentations
Ranking, Trust, and Recommendation Systems: An Axiomatic Approach Moshe Tennenholtz TechnionIsrael Institute of Technology and Microsoft Israel R&D Center.
Advertisements

A Simple Proof "There is no consistent method by which a democratic society can make a choice (when voting) that is always fair when that choice must be.
Web Information Retrieval
Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.
Voting and social choice Vincent Conitzer
IMPOSSIBILITY AND MANIPULABILITY Section 9.3 and Chapter 10.
CS 886: Electronic Market Design Social Choice (Preference Aggregation) September 20.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Arrow’s impossibility theorem EC-CS reading group Kenneth Arrow Journal of Political Economy, 1950.
5/12/2015PhD seminar CS BGU Counting subgraphs Support measures for graphs Natalia Vanetik.
Motivation: Condorcet Cycles Let people 1, 2 and 3 have to make a decision between options A, B, and C. Suppose they decide that majority voting is a good.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 By Gil Kalai Institute of Mathematics and Center for Rationality, Hebrew University, Jerusalem, Israel presented by: Yair Cymbalista.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Arrow's Impossibility Theorem Kevin Feasel December 10, 2006
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Alex Tabarrok Arrow’s Theorem.
Rank Aggregation Methods for the Web CS728 Lecture 11.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Estimating the Global PageRank of Web Communities Paper by Jason V. Davis & Inderjit S. Dhillon Dept. of Computer Sciences University of Texas at Austin.
CPS Voting and social choice
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 April 20, 2005
The Page Rank Axioms Based on Ranking Systems: The PageRank Axioms, by Alon Altman and Moshe Tennenholtz. Presented by Aron Matskin.
CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian
Social Choice Theory By Shiyan Li. History The theory of social choice and voting has had a long history in the social sciences, dating back to early.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Link Analysis, PageRank and Search Engines on the Web
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Social choice theory = preference aggregation = truthful voting Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
CS 188: Artificial Intelligence Fall 2009 Lecture 19: Hidden Markov Models 11/3/2009 Dan Klein – UC Berkeley.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Introduction complexity has been suggested as a means of precluding strategic behavior. Previous studies have shown that some voting protocols are hard.
Social choice (voting) Vincent Conitzer > > > >
Overview Aggregating preferences The Social Welfare function The Pareto Criterion The Compensation Principle.
Piyush Kumar (Lecture 2: PageRank) Welcome to COT5405.
CPS Voting and social choice Vincent Conitzer
Reputations Based On Transitive Trust Slides by Josh Albrecht.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
Center for E-Business Technology Seoul National University Seoul, Korea BrowseRank: letting the web users vote for page importance Yuting Liu, Bin Gao,
CHAPTER 15 SECTION 1 – 2 Markov Models. Outline Probabilistic Inference Bayes Rule Markov Chains.
Optimal Link Bombs are Uncoordinated Sibel Adali Tina Liu Malik Magdon-Ismail Rensselaer Polytechnic Institute.
PageRank. s1s1 p 12 p 21 s2s2 s3s3 p 31 s4s4 p 41 p 34 p 42 p 13 x 1 = p 21 p 34 p 41 + p 34 p 42 p 21 + p 21 p 31 p 41 + p 31 p 42 p 21 / Σ x 2 = p 31.
1 Elections and Manipulations: Ehud Friedgut, Gil Kalai, and Noam Nisan Hebrew University of Jerusalem and EF: U. of Toronto, GK: Yale University, NN:
TRUST-BASED RECOMMENDATION SYSTEMS : an axiomatic approach Microsoft Research, Redmond WA Jennifer Chayes Christian Borgs Reid Anderson Uri Feige Abie.
CS774. Markov Random Field : Theory and Application Lecture 02
Math for Liberal Studies.  We have seen many methods, all of them flawed in some way  Which method should we use?  Maybe we shouldn’t use any of them,
Great Theoretical Ideas in Computer Science.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Information Retrieval and Web Search Link analysis Instructor: Rada Mihalcea (Note: This slide set was adapted from an IR course taught by Prof. Chris.
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
CS483/683 Multi-Agent Systems Lecture 2: Distributed variants of 2 important AI problems: Search and Constraint Satisfaction 21 January 2010 Instructor:
Link Analysis Algorithms Page Rank Slides from Stanford CS345, slightly modified.
Social Choice Lectures 14 and 15 John Hey. Lectures 14 and 15: Arrow’s Impossibility Theorem and other matters Plan of lecture: Aggregation of individual.
A Sublinear Time Algorithm for PageRank Computations CHRISTIA N BORGS MICHAEL BRAUTBA R JENNIFER CHAYES SHANG- HUA TENG.
Algorithms for Large Data Sets
Impossibility and Other Alternative Voting Methods
Search Engines and Link Analysis on the Web
Social choice theory = preference aggregation = voting assuming agents tell the truth about their preferences Tuomas Sandholm Professor Computer Science.
Applied Mechanism Design For Social Good
Impossibility and Other Alternative Voting Methods
Alex Tabarrok Arrow’s Theorem.
CS 188: Artificial Intelligence Spring 2007
Making Eigenvector-based Reputation Systems Robust to Collusion
Piyush Kumar (Lecture 2: PageRank)
PageRank algorithm based on Eigenvectors
Models and Algorithms for Complex Networks
Outline Rank Aggregation Computing aggregate scores
Presentation transcript:

CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian

Previously in this class Ranking using the hyperlink structure:  HITS  PageRank

Today Dealing with web spam An axiomatic approach to PageRank Next Lecture: Kamal Jain

Recap The PageRank of a page p is the probability of p in the stationary distribution of a random walk that in each stage with probability 1 – ε follows a random link from the current page, and with probability ε, starts from a random page. Typically, ε = 0.15.

The collusion problem What if a group of nodes “collude” to increase the PageRank of one or more in the group? Zhang, Goel, Govindan, Mason, and Van Roy, WAW Define “amplification” of a group of nodes, and prove that it is always at most O(1/ ε).

The collusion problem Question: Is collusion really a problem? Experiment (on a web subgraph, and blogstreet):  Take, say, the 1000 th and the 1001 th nodes in the PageRank order.  Each of these nodes removes all links to other pages, and adds a link to the other.  Compute PageRanks in the new graph. Results: Ranks of the colluding nodes increase significantly. Exercise: Go to eBay and search for PageRank.

Finding colluding groups Approach 1: Find a set S with the largest amplification. However, it can be shown that this problem is NP-hard.

Finding colluding nodes Approach 2: Identify colluding individuals Observation: If we increase ε, the PageRank of a colluding individual decreases (often proportional to 1/ ε). Heuristic: Compute PageRanks for multiple values of ε, and compute the correlation of the PageRank of each node with 1/ ε. Nodes with high correlation are probably colluding.

Dealing with collusion We can “punish” colluding individuals by increasing their ε, so that they cannot pass their reputation on to others. Experimental results

Explaining PageRank Axiomatic approach  Define a set of “natural” axioms  Prove that PageRank satisfies these axioms  Prove that any page ranking algorithm satisfying these axioms outputs the same ranking as PageRank

Axiomatic Approaches: Voting Consider a democracy where people submit preference lists over candidates. A voting rule (or social welfare function) outputs a global ordering of candidates for every set of preference lists.

Voting Axioms Unanimity: If everyone prefers the candidate x to y, then the global ordering also ranks x above y. Independence of irrelevant alternatives (IIA): For any two candidates x and y, changes in people’s rankings of candidates other than x and y should not affect the relative position of x and y in the global ordering.

Arrow’s (Im)possibility Theorem Theorem [Arrow, 1951]: The only function satisfying unanimity and IIA is dictatorship. Extensions  Similar results hold for social choice functions where a single candidate (winner) must be chosen [Muller-Satterthwaite, 1977]  Majority rule arises naturally when we relax IIA or restrict the preference domain of people (i.e., impose rules on how they can rank candidates).

Axiomatic Approach: PageRank Agents are nodes of graph. Agents output a “vote” over other agents as represented by a directed graph G. A ranking algorithm is a function mapping every directed graph to an ordering of its nodes.

PageRank Axioms Isomorphism: The ranking procedure should be independent of the names of the nodes. Self edge: Adding self loops should not harm a node and should not affect other nodes. Vote by committee: Importance a gives to b and c by voting shouldn’t change if a votes via committee. Collapsing: If two nodes vote similarly, and are linked to by disjoint sets of nodes, the ranking does not change when they are collapsed to one node. Proxy: There is an equal distribution of importance.

PageRank: Altman and Tennenholtz Theorem: PageRank satisfies axioms. Theorem: PageRank is only ranking algorithm which satisfies axioms (i.e., every other ranking algorithm which satisfies axioms outputs same ranking as PageRank).