1 CS 525 Advanced Distributed Systems Spring 2015 Indranil Gupta (Indy) Structure of Networks Apr 30, 2015 All Slides © IG.

Slides:



Advertisements
Similar presentations
Hidden Metric Spaces and Navigability of Complex Networks
Advertisements

Complex Networks Advanced Computer Networks: Part1.
Scale Free Networks.
Analysis and Modeling of Social Networks Foudalis Ilias.
School of Information University of Michigan Network resilience Lecture 20.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
Mining and Searching Massive Graphs (Networks)
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Small World Networks Somsubhra Sharangi Computing Science, Simon Fraser University.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
The structure of the Internet. The Internet as a graph Remember: the Internet is a collection of networks called autonomous systems (ASs) The Internet.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
P2P Architecture Case Study: Gnutella Network
1 Reading Report 4 Yin Chen 26 Feb 2004 Reference: Peer-to-Peer Architecture Case Study: Gnutella Network, Matei Ruoeanu, In Int. Conf. on Peer-to-Peer.
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
03/19/02Scalab Seminar Series1 Mapping the Gnutella Network Macroscopic Properties of Large Scale P2P Systems Ramaswamy N.Vadivelu Scalab, ASU.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Complex Network Theory – An Introduction Niloy Ganguly.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
How Do “Real” Networks Look?
CS:4980:0005 Peer-to-Peer and Social Networks Fall 2015 Introduction.
Scaling Properties of the Internet Graph Aditya Akella, CMU With Shuchi Chawla, Arvind Kannan and Srinivasan Seshan PODC 2003.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Scaling Properties of the Internet Graph Aditya Akella With Shuchi Chawla, Arvind Kannan and Srinivasan Seshan PODC 2003.
Lecture II Introduction to complex networks Santo Fortunato.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Lecture 23: Structure of Networks
CS 525 Advanced Distributed Systems Spring 2011
Peer-to-Peer and Social Networks
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Topology and Dynamics of Complex Networks
Lecture 23: Structure of Networks
Lecture 9: Network models CS 765: Complex Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

1 CS 525 Advanced Distributed Systems Spring 2015 Indranil Gupta (Indy) Structure of Networks Apr 30, 2015 All Slides © IG

2 1981

3 Countries= nodes Trade= edges Graph or Network

4 1992

5 The Internet (Internet Mapping Project, color coded by ISPs) PCs= nodes Connected= edges

6 Food Web of Little Rock Lake,WI Electric Power Grid Metabolic reaction network

7 This Lecture: Common Thread Networks –Structure of, –Dynamics within, We’ll study networks at three different “levels”

8 Lowermost Level: Basics, Physical Phenomena, and Life

9 Complexity of Networks Structural: human population has ~7 B nodes, there are millions of computers on the Internet… Evolution: people make new friends all the time, ISP’s change hands all the time… Diversity: some people are more popular, some friendships are more important… Node Complexity: Endpoints have different CPUs, Windows is a complicated OS, Mobile devices … Emergent phenomena: simple end behavior  complex system-wide behavior. If we understand the basics of climate change, why is the weather so unpredictable?

10 1. Network Structure “Six degrees of Kevin Bacon” Milgram’s experiment in 1970 Watts and Strogatz Model Kleinberg’s algorithmic results Recent work on shows similarities between the structures of: Internet, WWW, p2p overlays, electric power grid, protein networks, co- authorship among scientists These networks have “evolved naturally”

11 Ring graph Fully Connected graph Random graph Power Law Graph (Degenerate: tree)

12 A Scientist’s Perspective Two important metrics –Clustering Coefficient: CC Pr(A-B edge, given an A-C edge and a C-B edge) –Path Length of shortest path Ring graph: high CC, long paths Random graph: low CC, short paths Small World Networks: high CC, short paths

13 Ring graph Random Graph Clustering Coefficient Path Length Small World Networks Convert more and more edges to point to random nodes

14 Most “natural evolved” networks are small world Network of actors  six degrees of Kevin Bacon Network of humans  Milgram’s experiment Co-authorship network  “Erdos Number” Many of these networks also “grow incrementally” [Faloutsos and Faloutsos] “Preferential” models of growth

15 Another Scientific Viewpoint That was about “nature of neighbors”; what about number of neighbors? Degree distribution – what is the probability of a given node having k edges (neighbors, friends, …) Regular graph: all nodes same degree Gaussian Random graph: Exponential Power law:

16 log (number of nodes) log (node degree=k) Basics: The Log-Log Plot Power law Exponential Heavy tailed Number of nodes with degree k is ~

17 WWW is a power law graphNCSTRL co-author graph is power law, with exponential cutoff Electric Power Grid graph is exponential Social network of Utah Mormons is Gaussian

18 Power law vs. Small World A lot of small world networks are power law graphs (Internet backbone, telephone call graph, protein networks) Not all small world networks are power law (e.g., co-author networks) Not all power law networks are small world Preferential Model for network growth generates power law distributions – special way of incremental growth –Pr of linking to a node P is proportional to P’s current degree. –e.g., Web pages linking to each other Power law networks also called scale-free

19 Power law + Small world Most nodes have small degree, but a few nodes have high degree Attacks on small world networks Killing a large number of randomly chosen nodes does not disconnect graph Killing a few high-degree nodes will disconnect graph “A few (of the many thousand) nutrients are very important to your body” “The Electric Grid is very vulnerable to attacks”

20 2. Network Dynamics Strogatz goes on to discuss dynamics of many “natural networks” We’ll focus on dynamics w.r.t. the Internet and P2P networks in the papers [Akella et al] and [Ripeanu et al] But let’s just touch a bit on oscillation dynamics in networks…

21 Networks of coupled dynamical systems If each node is a dynamical system, and is affected by its neighbors, what behaviors emerge from the entire network? E.g., Social networks, network of neurons in the brain, protein networks, … An example of emergent behavior: self- synchronization

22 A group of oscillators can self- synchronize

23 Self-Synchronizing Fireflies Synchronizing Fireflies of Malaysia and Smokies Each firefly: Is driven by an external stimulus so Can show self-sync occurs when For more details see [Strogatz’s textbook on non- linear dynamics]

24 Why the heart beats by itself Consists of a few thousand sinoatrial cells Each oscillating at its own frequency Peskin’s model: when a cell fires, all other cells have a small jump in voltage [Why does this self-synchronize?] –Think of two sinoatrial cells first For more details, see [Strogatz’s book “Sync”] time voltage fire!

25 Discussion What is one problem where a self- synchronizing system could be used to design a distributed protocol? Why is the co-authorship network different from the Internet though both follow an incremental / preferential construction?

26 A Level Up: The Internet

27 [Faloutsos et al] showed that the Internet backbone follows a power law distribution [One kind of Dynamism over such a network?]

28 [Faloutsos et al] showed that the Internet backbone follows a power law distribution [One kind of Dynamism over such a network?] Routing [Akella et al] What is the stress on Internet routers due to –Shortest path routing (“efficient”) –Policy based routing (BGP)

29 AS’s The Internet is growing How does the stress scale? Internet is a multi-level topology At the highest level, it consists of AS’s AS’s consist of subnets, then LANS, … AS-AS routing is done by BGP

30 Main Result Take a power law network (node has degree k with probability ) Shortest path routing, with ties broken by higher degree With uniform traffic model for all pairs of end nodes, maximum edge congestion grows as [Theorem 1]

31 Log-log plot again (different kind) Power-law and Tree network topologies give superlinear congestion Exponential Network has sublinear congestion [why?]

32 + Shortest-path routing has worse max edge congestion than with uniform traffic Clout Traffic Model: well-connected nodes generate more end to end traffic

33 Policy Routing Due to ISP to ISP financial contracts, AS to AS edges are either –Customer-provider edges, or –Peering edges Policy routing prefers customer  provider traffic Gives “valley free” paths: most edges are customer  provider

34 + Policy routing also gives superlinear growth, but is better than shortest path routing Clout Traffic Model: well-connected nodes generate more traffic

35 Synthetic Graphs Real Graphs Policy-based and Shortest Path Routing give similar edge congestion for the uniform traffic model

36 A Solution: Add redundant edges between selective node pairs Congestion varies linearly with n

37 Discussion Metrics: max edge congestion (why not average?) –Instability from single source could spread Think “Electric Power Grid failures” Think “self-synchronizing routers” Why is Shortest Path Routing always worse than Policy Routing? –Shortest Path Routing is supposed to be “efficient” –Outrageous Opinion: Are policies the reason why the Internet stays up and robust? Should the design of Internet be left to non-technicians?

38 Another Level Up: Applications

39 Study of Gnutella [Ripeanu et al] Gnutella –Peer to peer Overlay –Users download songs from other users, upload their own songs –Each computer host = “peer” –Completely decentralized

40 Gnutella Structure P P P P P P Servents (“Peers”) P Connected in an overlay graph Store their own files Also store “peer pointers” Queries flooded out, ttl-restricted

41 Study of Gnutella 6 month period 10/00-5/01 –Before revision of Gnutella protocol (late 2001) Characteristics –System Size –Network Traffic –Node Connectivity

42 95% of nodes in largest connected component Quick Growth over time (exponential?) Spikes

43 Churn Characteristics 40% of nodes logged in for less than 4 hours Only 25% nodes alive for more than 24 hours

44 55% ping-pong messages (membership) 36% query messages Subsequent improvements reduced these to 8%, 92%

45 95% of nodes less than 7 hops away [Implication of ttl=7 for query messages? ]

46 Traffic Volume 170 K connexns for 50 K node Gnutella 6 KBps per connexn  1 GBps total  330 TB/month 1.7% of total traffic in Internet Backbone Recall: [3/00] 25% UWisc traffic from Napster

47 Average degree of node is scale-independent On average 3.4 edges / node

48 Not power law But Heavy-tailed

49 Overlay-Network Match Does the Gnutella Overlay reflect the underlying Internet structure? Entropy technique in paper –Nodes identified with their domain names –Gnutella graph structure  clustering of nodes based on domains –Calculate entropy of above clustering and compare with entropy of a random selection of nodes from across domains –If same, Gnutella graph is random, otherwise it is more ordered Authors find Gnutella clustering entropy to be only 8% lower than random clustering entropy Gnutella structure is independent of underlying Internet [hence the term “application overlays”]

50 Discussion Do overlays really reflect the application? Are application-dependent overlays “unfriendly” to the network and other applications? What if the overlays are very large? (think PlanetLab) What if they are small and there are millions of them? (think overlay hosting services) What if they are large and many? (think overlay hosting services on top of PlanetLab-style clusters)

51 Another level Up: The Users, Humans, …

52 Summary Humans, and the networks connecting them…societal networks, actor networks, co-authorship graphs…. And we’re back full circle! We’ve discussed –Network structure –Network dynamics Many commonalities –power law, small world … among “naturally evolved” networks –social nets, metabolic nets, electric power grid, Internet, WWW,… Can look at in awe, but systems design also has to deal with it

53 Announcements Regular office hours next week (both Tue and Thu)

54 Final Report Grading Final report will be graded just like a conference reviewer would:  Importance of problem  Novelty of solution  Evaluation of solution  Clarity of Presentation  Nits (grammar, references, etc.) + Business Plan (entrepreneurial projects) Why 12 page limit? (Mark Twain) “If I had more time, I would have written a shorter letter.”

55 Semester’s Final Lecture (next Tuesday) Links from website –No reviews needed –Read as many as you can – you’ll enjoy them –None of them is technical! –By a science fiction writer, an ecologist and computer scientists Next Tuesday’s lecture (semester’s last lecture) – we’ll close the discussion we started in the semester’s first lecture Mandatory to attend