CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.

Slides:



Advertisements
Similar presentations
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Advertisements

© 2005 IBM Corporation Discovering Large Dense Subgraphs in Massive Graphs David Gibson IBM Almaden Research Center Ravi Kumar Yahoo! Research* Andrew.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Connectivity Structure of Bipartite Graphs via the KNC-Plot Erik Vee joint work with Ravi Kumar, Andrew Tomkins.
Massively Distributed Database Systems Distributed Hash Spring 2014 Ki-Joune Li Pusan National University.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Unsupervised learning
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Cobra: Content-based Filtering and Aggregation of Blogs and RSS Feeds Ian Rose 1, Rohan Murty 1, Peter Pietzuch 2, Jonathan Ledlie 1, Mema Roussopoulos.
Search in Power-Law Networks Presented by Hakim Weatherspoon CS294-4: Peer-to-Peer Systems Slides also borrowed from the following paper Path Finding Strategies.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Information Retrieval in Practice
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented by Yongqiang Li Adapted from
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian
Why we search: Visualizing and predicting user behavior E Adar, D S Weld, B N Bershad, S Gribble Raju Balakrishnan.
ISP 433/633 Week 7 Web IR. Web is a unique collection Largest repository of data Unedited Can be anything –Information type –Sources Changing –Growing.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Link Analysis. 2 HITS - Kleinberg’s Algorithm HITS – Hypertext Induced Topic Selection For each vertex v Є V in a subgraph of interest: A site is very.
CS522: Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian
Overview of Search Engines
HITS – Hubs and Authorities - Hyperlink-Induced Topic Search A on the left is an authority A on the right is a hub.
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
Stochastic Approach for Link Structure Analysis (SALSA) Presented by Adam Simkins.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
R OBERTO B ATTITI, M AURO B RUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Feb 2014.
Deduplication CSCI 572: Information Retrieval and Search Engines Summer 2010.
The PageRank Citation Ranking: Bringing Order to the Web Presented by Aishwarya Rengamannan Instructor: Dr. Gautam Das.
Graph Algorithms Ch. 5 Lin and Dyer. Graphs Are everywhere Manifest in the flow of s Connections on social network Bus or flight routes Social graphs:
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Adversarial Information Retrieval The Manipulation of Web Content.
1 Announcements Research Paper due today Research Talks –Nov. 29 (Monday) Kayatana and Lance –Dec. 1 (Wednesday) Mark and Jeremy –Dec. 3 (Friday) Joe and.
Finding dense components in weighted graphs Paul Horn
Distributed Computing Rik Sarkar. Distributed Computing Old style: Use a computer for computation.
Know your Neighbors: Web Spam Detection Using the Web Topology Presented By, SOUMO GORAI Carlos Castillo(1), Debora Donato(1), Aristides Gionis(1), Vanessa.
Using Hyperlink structure information for web search.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved Chapter 2 ARCHITECTURES.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Enabling Peer-to-Peer SDP in an Agent Environment University of Maryland Baltimore County USA.
PODC Distributed Computation of the Mode Fabian Kuhn Thomas Locher ETH Zurich, Switzerland Stefan Schmid TU Munich, Germany TexPoint fonts used in.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Services of the Formal Digital Library (FDL) NA-MKM 2004 January 6, 2004 Lori Lorigo, Cornell University.
- Murtuza Shareef Authoritative Sources in a Hyperlinked Environment More specifically “Link Analysis” using HITS Algorithm.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Recommendation systems Paolo Ferragina Dipartimento di Informatica Università di Pisa Slides only!
Estimating PageRank on Graph Streams Atish Das Sarma (Georgia Tech) Sreenivas Gollapudi, Rina Panigrahy (Microsoft Research)
Exponential random graphs and dynamic graph algorithms David Eppstein Comp. Sci. Dept., UC Irvine.
1 The EigenRumor Algorithm for Ranking Blogs Advisor: Hsin-Hsi Chen Speaker: Sheng-Chung Yen ( 嚴聖筌 )
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Block-level Link Analysis Presented by Lan Nie 11/08/2005, Lehigh University.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
A Sublinear Time Algorithm for PageRank Computations CHRISTIA N BORGS MICHAEL BRAUTBA R JENNIFER CHAYES SHANG- HUA TENG.
Extrapolation to Speed-up Query- dependent Link Analysis Ranking Algorithms Muhammad Ali Norozi Department of Computer Science Norwegian University of.
Information Retrieval in Practice
Cohesive Subgraph Computation over Large Graphs
Link-Based Ranking Seminar Social Media Mining University UC3M
Lecture 22 SVD, Eigenvector, and Web Search
Information retrieval and PageRank
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
3.3 Network-Centric Community Detection
Lecture 22 SVD, Eigenvector, and Web Search
Lecture 22 SVD, Eigenvector, and Web Search
Presentation transcript:

CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian

Today More about advertisement auctions Some open questions

Generative models Have Seen  Probabilistic models for power law graphs and small world networks  Network formation games Open Questions  More realistic models for the Internet graph, hierarchical (i.e. hosts and pages)  Combining ideas from both probabilistic models and game theoretic models

Search in Social Networks Have Seen  Kleinberg’s decentralized search  Generalizations to arbitrary geographic distributions Open Questions  New Kleinberg paper, FOCS 2005, discusses payment for search in social networks  Generalizations: general graphs, cost for forwarding information, response times

Link Analysis Algorithms Have Seen  Link analysis for web graph – PageRank, HITS  Web spam  Axiomatic approaches to PageRank Open questions  Collusion in PageRank: Algorithms for boosting PageRank of a particular node that locally look “normal”

Weblogs A social network  Time-stamped links  Frequent updates  RSS feed Open Questions  Generative models  Finding “discussions”  Blogs to predict trends (sales, etc.), finding correlated topics (also in query logs) Picture from Kumar et al.

Query Logs Income Tax

Query Logs Greeting Cards

Query Log Analysis Correlation between terms

Query Log Analysis Data streams (top gainers & top losers)  Two streams S 1 and S 2 of items from universe U={1,…,n} S 1 : S 2 :  Find item i with approximately largest (or smallest) count(i,S 1 )-count(i,S 2 )  One pass, sublinear space  Charikar et al. give 2-pass algorithm

Clustering and Communities Have Seen  Spectral clustering (use eigenvectors of adjacency matrix to well-connected components)  Correlation clustering (min cut positive edges and uncut negative edges)  Metric labeling (assign labels to vertices to min assignment plus separation cost) Open Questions  Combine correlation clustering and metric labeling  Clustering in directed graphs  Finding dense bipartite subgraphs in the web (Kumar et al. have heuristic for complete subgraphs which finds small communities)

Reputation Systems Have Seen  Equilibrium analysis of a model with one seller and multiple buyers Open Questions  Better game-theoretic analysis of existing systems (multiple sellers, identity fraud, buyer reputation)  Better methods for aggregating feedbacks (taking into account value of transactions; reputation of the person leaving the feedback)

Peer-to-peer networks Have Seen  Models (Napster, Gnutella, Chord)  Content storage, search, and dynamics Open Questions  Spread of viruses, honeypots  Incentives for providing content  Self-stabilization

Recommendation Systems Have Seen  History-based systems Two models, used assumptions about clustering of topics, small number of clusters, existence of orthogonal dominant user types  Query-based systems Two models, relied on “committees”, also assumed existence of small number of user types Open Questions  Economic incentives for recommendations

The End!