Small-World File-Sharing Communities Adriana Iamnitchi, Matei Ripeanu and Ian Foster,

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

Complex Network Theory
Complex Networks Advanced Computer Networks: Part1.
‘Small World’ Networks (An Introduction) Presenter : Vishal Asthana
CS8803-NS Network Science Fall 2013
Collective Dynamics of ‘Small World’ Networks C+ Elegans: Ilhan Savut, Spencer Telford, Melody Lim 29/10/13.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Socially-Aware Distributed Systems or Why this Class Collaboration? Anda Iamnitchi
Information Networks Small World Networks Lecture 5.
Search in Power-Law Networks Presented by Hakim Weatherspoon CS294-4: Peer-to-Peer Systems Slides also borrowed from the following paper Path Finding Strategies.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
CS728 Lecture 5 Generative Graph Models and the Web.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
 We developed a fast and tunable crawler, Cruiser.  Cruiser uses a master-slave architecture, parallel crawling, and leverages the two-tier topology.
Large-Scale Organization of Semantic Networks Mark Steyvers Josh Tenenbaum Stanford University.
School of Information University of Michigan SI 614 Search in random networks Lecture 16.
The small-world problem
Characterizing the Two-Tier Gnutella Topology  Gnutella, FastTrack, and eDonkey use two-tier overlay topologies.  Our initial study focuses on Gnutella.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Tracking User Attention in Collaborative Tagging Communities Elizeu Santos-Neto Matei Ripeanu Univesity of British Columbia Adriana Iamnitchi University.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
Social Media: YouTube as a Case. 2 New generation of video sharing service Feb.15th, 2005 Some statistics: 60 hours video uploaded very minute 4 billion.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
P2P Architecture Case Study: Gnutella Network
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
The Social Hourglass: Enabling Socially-aware Applications and Services Adriana Iamnitchi University of South Florida
Enabling Socially-Aware Distributed Systems or Some Ongoing Research in the Distributed Systems Group Adriana Iamnitchi
1 presentation of article: Small-World File-Sharing Communities Article: Adriana Iamnitchi, Matei Ripeanu, Ian Foster Presentation: Periklis Akritidis.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical.
CIS : Internet-Scale Networked Systems Adriana Iamnitchi (Anda)
Complex Networks Measures and deterministic models Philippe Giabbanelli.
Peer Centrality in Socially-Informed P2P Topologies Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Practical LFU implementation for Web Caching George KarakostasTelcordia Dimitrios N. Serpanos University of Patras.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design Authors: Matei Ripeanu Ian Foster Adriana.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Locating data in (small-world?) P2P scientific collaborations Gisik Kwon 6/11/02.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
File Grouping for Scientific Data Management: Lessons from Experimenting with Real Traces Shyamala Doraimani* and Adriana Iamnitchi University of South.
Analyzing Networks. Milgram’s Experiments “Six degrees of Separation” Milgram’s letters to various recruits in Nebraska who were asked to forward the.
Models and Algorithms for Complex Networks
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Unstructured Networks: Search Márk Jelasity. 2 Outline ● Emergence of decentralized networks ● The Gnutella network: how it worked and looked like ● Search.
Lecture 1: Complex Networks
Topics In Social Computing (67810)
Section 8.6 of Newman’s book: Clustering Coefficients
Section 8.2: Shortest path and small world effect
Shortest path and small world effect
Department of Computer Science University of York
Peer-to-Peer Information Systems Week 6: Performance
Clustering Coefficients
Lecture 9: Network models CS 765: Complex Networks
Other Random Graph Models
Presentation transcript:

Small-World File-Sharing Communities Adriana Iamnitchi, Matei Ripeanu and Ian Foster,

Selecting tradeoffs Understand users’ requirements … AND users’ behavior –Patterns exist and can be exploited: Zipf request popularity distribution (web caching) Breslau, Cao, Fan, Phillips, and Shenker. Web caching and Zipf-like distributions: Evidence and implications. Infocom, 1999 Power-law node degree distribution in graphs (search) Adamic, Lukose, Puniyani, Huberman. Search in Power-law Networks. In Physical Review E, , 2001

“No 24 in B minor, BWV 869” “Les Bonbons” “ Yellow Submarine” “Les Bonbons” “Yellow Submarine” “Wood Is a Pleasant Thing to Think About” “Wood Is a Pleasant Thing to Think About” System view: The Data-Sharing Graph G m T (V, E):  V is set of users active during interval T  An edge in E connects users that asked for at least m common files within T

Three Data-Sharing Graphs The DØ Experiment –High-energy physics data Grid –70+ institutions, 18 countries The Web: Boeing traces –Web requests from Boeing sites to the Web The KaZaA music-sharing network –Peer-to-peer system UsersRequestsDuration traces D0 3172,757, days Web 60,82616,527,19410 hours Kazaa 14,404976,1845 days

Small average path length Large clustering coefficient The DØ Collaboration Small World! CCoef = # Existing Edges # Possible Edges 6 months of traces (January – June 2002) 300+ users, 2 million requests for 200K files

Small-World Graphs Small path length, large clustering coefficient –Typically compared against random graphs Think of: –“It’s a small world!” –“Six degrees of separation” Milgram’s experiments in the 60s

Other Small Worlds Word co-occurrences Film actors LANL coauthors Internet Web Food web Power grid D. J. Watts and S. H. Strogatz, Collective dynamics of small-world networks. Nature, 393: , 1998 R. Albert and A.-L. Barabási, Statistical mechanics of complex networks, R. Modern Physics 74, 47 (2002).

Web Data-Sharing Graphs 7200s, 50files 3600s, 50files 1800s, 100files 1800s, 10file 300s, 1file

DØ Data-Sharing Graphs 7days, 1file 28 days, 1 file

KaZaA Data-Sharing Graphs 7day, 1file 28 days 1 file 2 hours 1 file 1 day 2 files 4h 2 files 12h 4 files

Motivation : Understand patterns (user behavior) to choose the right tradeoffs Understand what?! –The data-sharing graph Data-sharing communities –The D0 Experiment (physicists in a data grid) –Web –Kazaa It’s a small world! So what? –News or consequence of known patterns? –Relevance for mechanism design Outline

Exploiting Patterns: Intuition  Exploit the data-sharing graph (as overlay) Exploit its small-world properties –Large clustering coefficient: optimize existing (e.g., centralized) mechanisms –… and small average path length: design a new file location mechanism D0 Web

Information Dissemination: Local Answers D0 Web Kazaa Hit rate due to previous information dissemination within clusters (experience sharing): up to 70% of requests will find answers locally

The Message Be imaginative when looking for patterns There are small-world patterns in file-sharing communities …that can be exploited for mechanism design –Location of files –Data placement