The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 2 Ýmir Vigfússon.

Slides:



Advertisements
Similar presentations
CSE 211 Discrete Mathematics
Advertisements

Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture10.
The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 3 Ýmir Vigfússon.
Graph A graph, G = (V, E), is a data structure where: V is a set of vertices (aka nodes) E is a set of edges We use graphs to represent relationships among.
Graphs Chapter 20 Data Structures and Problem Solving with C++: Walls and Mirrors, Carrano and Henry, © 2013.
Chapter 8, Part I Graph Algorithms.
Data Structure and Algorithms (BCS 1223) GRAPH. Introduction of Graph A graph G consists of two things: 1.A set V of elements called nodes(or points or.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 CHAPTER 4 - PART 2 GRAPHS 1.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
CS 206 Introduction to Computer Science II 11 / 11 / Veterans Day Instructor: Michael Eckmann.
Graph & BFS.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Graph COMP171 Fall Graph / Slide 2 Graphs * Extremely useful tool in modeling problems * Consist of: n Vertices n Edges D E A C F B Vertex Edge.
Graph & BFS Lecture 22 COMP171 Fall Graph & BFS / Slide 2 Graphs * Extremely useful tool in modeling problems * Consist of: n Vertices n Edges D.
CSE 222 Systems Programming Graph Theory Basics Dr. Jim Holten.
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
Graphs Chapter 20 Data Structures and Problem Solving with C++: Walls and Mirrors, Frank Carrano, © 2012.
Chapter 9: Graphs Basic Concepts
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Peer-to-Peer and Social Networks Introduction. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
GRAPH Learning Outcomes Students should be able to:
Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman.
Lecture 13 Graphs. Introduction to Graphs Examples of Graphs – Airline Route Map What is the fastest way to get from Pittsburgh to St Louis? What is the.
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Graph Theory in Computer Science
Representing and Using Graphs
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Murtaza Abbas Asad Ali. NETWORKOLOGY THE SCIENCE OF NETWORKS.
Graphs. Definitions A graph is two sets. A graph is two sets. –A set of nodes or vertices V –A set of edges E Edges connect nodes. Edges connect nodes.
Graphs Discrete Structure CS203. Adjacency Matrix We already saw a way of representing relations on a set with a Boolean matrix: R digraph(R) M R1234.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
Data Structures CSCI 132, Spring 2014 Lecture 38 Graphs
Graphs A graphs is an abstract representation of a set of objects, called vertices or nodes, where some pairs of the objects are connected by links, called.
How Do “Real” Networks Look?
CS:4980:0005 Peer-to-Peer and Social Networks Fall 2015 Introduction.
1 Directed Graphs Chapter 8. 2 Objectives You will be able to: Say what a directed graph is. Describe two ways to represent a directed graph: Adjacency.
Graphs Slide credits:  K. Wayne, Princeton U.  C. E. Leiserson and E. Demaine, MIT  K. Birman, Cornell U.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Graphs and Paths : Chapter 15 Saurav Karmakar
Class 2: Graph Theory IST402.
Analyzing Networks. Milgram’s Experiments “Six degrees of Separation” Milgram’s letters to various recruits in Nebraska who were asked to forward the.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Programming Abstractions Cynthia Lee CS106B. Upcoming Topics Graphs! 1.Basics  What are they? How do we represent them? 2.Theorems  What are some things.
Graphs David Kauchak cs302 Spring Admin HW 12 and 13 (and likely 14) You can submit revised solutions to any problem you missed Also submit your.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
Introduction to Graph & Network Theory Thinking About Networks: From Metabolism to the Genome to Social Conflict Summer Workshop for Teachers June 27 th.
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
How Do “Real” Networks Look?
Community detection in graphs
CSE 5245 Lecture 1 Srinivasan Parthasarathy
The Structure of Information Networks
Network Science: A Short Introduction i3 Workshop
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Why Social Graphs Are Different Communities Finding Triangles
Graphs Chapter 11 Objectives Upon completion you will be able to:
How Do “Real” Networks Look?
Graphs Discrete Structure CS203.
Chapter 9: Graphs Basic Concepts
Graph Theory By Amy C. and John M..
Graph and Link Mining.
Chapter 9: Graphs Basic Concepts
GRAPHS.
Presentation transcript:

The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 2 Ýmir Vigfússon

Graph theory We will develop some basics of graph theory ◦ This provides a unifying language for network structure We begin with central definitions ◦ Then consider the implications and applications of these concepts

Graph theory A graph or a network is a way to specify relationships amongst a collection of items Def: A graph consists of ◦ Set of objects: nodes ◦ Pairs of objects: edges Def: Two nodes are neighbors if they are connected by an edge

Graph orientation Both graphs have 4 nodes (A,B,C,D) and 4 edges The left graph is undirected ◦ Edges have no orientation (default assumption) The right graph is directed ◦ Edges have an orientation, e.g. edge from B to C

Weighted graphs Edges may also carry additional information ◦ Signs (are we friends or enemies?) ◦ Tie strength (how good are we as friends?) ◦ Distance (how long is this road?) ◦ Delay (how long does the transmission take?) Def: In a weighted graph, every edge has an associated number called a weight. Def: In a signed graph, every edge has a + or a – sign associated with it.

Graph representations Abstract graph theory is interesting in itself But in network science, items typically represent real-world entities ◦ Some network abstractions are very commonly used Several examples (more later) ◦ Communication networks  Companies, telephone wires ◦ Social networks  People, friendship/contacts ◦ Information networks  Web sites, hyperlinks ◦ Biological networks  Species, predation (food web). Or metabolic pathways

ARPANET: Early Internet precursor December 1970 with13 nodes

Graph representation Only the connectivity matters ◦ Could capture distance as weights if needed

Transportation networks Graph terminology often derived from transportation metaphors ◦ E.g. “shortest path“, “flow“, “diameter“

A structural network The physics of rigidity theory is applied at every network node to design stable structures

An electrical network The physics of Kirchoff‘s laws is applied at every network node and closed paths to design circuits

Graph concepts “Graph theory is a terminological jungle in which every newcomer may plant a tree“ (Social scientist John Barnes) We will focus on the most central concepts ◦ Paths between nodes ◦ Cycles ◦ Connectivity ◦ Components (and giant component) ◦ Distance (and search)

Paths Things often travel along the edges of a graph ◦ Travel ◦ Information ◦ Physical quantities Def: A path is sequence of nodes with the property that each consecutive pair in the sequence is connected by an edge ◦ Can also be defined as a sequence of edges

Paths MIT – BBN – RAND – UCLA is a path

Paths You could also traverse the same node multiple times ◦ SRI – STAN – UCLA – SRI – UTAH – MIT This is called a non-simple path Paths in which nodes are not repeated are called simple paths

Cycles These are ring structures that begin and end in the same node ◦ LINC – CASE – CARN – HARV – BBN – MIT – LINC is a cycle Def: A cycle is a closed path with at least three edges In the 1970 ARPANET, every edge is on a cycle ◦ By design. Why?

Connectivity Can every node in a graph be reached from any other node through a path? ◦ If so, the graph is connected The 1970 ARPANET graph is connected In many cases, graphs may be disconnected ◦ Social networks ◦ Collaboration networks ◦ etc.

Connectivity Is this graph connected? What about now? ◦ A,B not connected to other nodes ◦ C,D,E not connected to other nodes

Components If a graph is not connected, it tends to break into pieces that themselves are connected Def: The connected components of an undirected graph are groups of nodes with the property that the groups are connected, and no two groups overlap More precisely: ◦ A connected component is a subset of nodes such that (1) every node has a path to every other node in the subset, and (2) the subset is not a part of a larger set with the property that every node can reach another

Components Three connected components ◦ {A,B}, {C,D,E}, {F,G,...,M}

Components: Analysis A first global way to look at graph structure ◦ For instance, we can understand what is holding a component together Real-world biology collaboration graph

Components: Analysis Analyzing graphs in terms of densely connected regions and the boundaries of regions ◦ For instance, only include edges with weights above a threshold, then gradually increase the threshold ◦ The graph will fragment into more and more components We will later see how this becomes an important type of analysis

Giant components Many graphs are not connected, but may include a very large connected components ◦ E.g. the financial graph from last lecture, or the hyperlink graph of the web Large complex networks often contain a giant component ◦ A component that holds a large percentage of all nodes Rare that two or more of these will exist in a graph. Why?

Romantic liasons in a high school Existence of giant component means higher risk of STDs (the object of study) Giant component

Distance Def: The distance between a pair of nodes is the edge length of the shortest path between them ◦ Just number of edges. Can be thought of as all edges having weight of 1 What‘s the distance between MIT and SDC?

Distance Q: Given a graph, how do we find distances between a given node and all other nodes systematically? ◦ Need to define an algorithm! How would you approach this problem?

Distance: Breadth-first search (BFS) From the given node (root) ◦ Find all nodes that are directly connected  These are labeled as “distance 1“ ◦ Find all nodes that are directly connected to nodes at distance 1  If these nodes are not at distance 1, we label them as “distance 2“ ◦... ◦ Find all nodes that are directly connected to nodes at distance j  If these nodes are not already of distance at most j+1, we label them as „distance j+1“

Distance: Breadth-first search (BFS)

Six degrees of separation Explained thoroughly in the video we saw First experiment done by Stanley Milgram in 1960s (research budget $680) ◦ 296 randomly chosen starters. Asked to send a letter to a target, by forwarding to someone they know personally and so on. Number of steps counted. Hypothesis: The number of steps to connect to anyone in a typical large-scale network is surprisingly small ◦ We can use BFS to check!

Six degrees of separation Milgram found a median hop number of 6 for successful chains – six degrees of separation ◦ This study has since been largely discredited

Six degrees of separation Modern experiment by Leskovec and Horvitz in 2008 Look at the 240 million user accounts of Microsoft Instant Messenger Complete snapshop (MS employees) Found a giant component with very small distances A random sample of 1000 users were tested ◦ Why do they look only at a sample?

Six degrees of separation Estimated average distance of 6.6, median of 7

Six degrees of geekiness Co-authorship graph centered on Paul Erdös

Network data sets Chapter 2 includes an overview of some massive data sets and networks Collaboration graphs ◦ Wikipedia, World of Warcraft, Citation graphs Who-talks-to-whom graphs ◦ Microsoft IM, Cell phone graphs Information linkage ◦ Hyperlinks Technological networks ◦ Power grids, communication links, Internet Natural and biological networks ◦ Food webs, neural interconnections, cell metabolism

Network data sets Leskovec‘s SNAP at Stanford has a repository of large-scale networks ◦ I also have a few more data sets that could be appropriate for the group project Keep thinking about whether there are some cool data sets that you might have access to...

Recap A graph consists of nodes and edges Graphs can be directed or undirected, weighted, signed or unweighted ◦ Paths between nodes (simple vs. non-simple) ◦ Cycles ◦ Connectivity ◦ Components (and the giant component) ◦ Distance (and BFS) Six degrees of separation can be checked with BFS