Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark.

Slides:



Advertisements
Similar presentations
Scale Free Networks.
Advertisements

Web Intelligence Text Mining, and web-related Applications
Analysis and Modeling of Social Networks Foudalis Ilias.
Web as Network: A Case Study Networked Life CIS 112 Spring 2010 Prof. Michael Kearns.
Project Ideas slides modified from Eileen Kraemer and David P. Feldman.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
Mining and Searching Massive Graphs (Networks)
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
Network Statistics Gesine Reinert. Yeast protein interactions.
Spanning Trees.
Spanning Trees. 2 Spanning trees Suppose you have a connected undirected graph Connected: every node is reachable from every other node Undirected: edges.
Mining and Searching Massive Graphs (Networks) Introduction and Background Lecture 1.
Advanced Topics in Data Mining Special focus: Social Networks.
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
CS/ENGRD 2110 Object-Oriented Programming and Data Structures Fall 2014 Doug James Lecture 17: Graphs.
Chapter 9: Graphs Basic Concepts
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Minimum Spanning Tree in Graph - Week Problem: Laying Telephone Wire Central office.
C o n f i d e n t i a l HOME NEXT Subject Name: Data Structure Using C Unit Title: Graphs.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Graph Theory in 50 minutes. This Graph has 6 nodes (also called vertices) and 7 edges (also called links)
Intro to Graphs CSIT 402 Data Structures II. CSIT 402 Graph Introduction2 Graphs Graphs are composed of ›Nodes (vertices) Can be labeled ›Edges (arcs)
Graphs Chapter 10.
Lecture 13 Graphs. Introduction to Graphs Examples of Graphs – Airline Route Map What is the fastest way to get from Pittsburgh to St Louis? What is the.
Chapter 2 Graph Algorithms.
Distributed Computing Rik Sarkar. Distributed Computing Old style: Use a computer for computation.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Graphs. 2 Graph definitions There are two kinds of graphs: directed graphs (sometimes called digraphs) and undirected graphs Birmingham Rugby London Cambridge.
GRAPHS THEROY. 2 –Graphs Graph basics and definitions Vertices/nodes, edges, adjacency, incidence Degree, in-degree, out-degree Subgraphs, unions, isomorphism.
Complex Networks Measures and deterministic models Philippe Giabbanelli.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Graphs A ‘Graph’ is a diagram that shows how things are connected together. It makes no attempt to draw actual paths or routes and scale is generally inconsequential.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Copyright © Curt Hill Graphs Definitions and Implementations.
Graphs and Paths : Chapter 15 Saurav Karmakar
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
“Important” Vertices and the PageRank Algorithm Networked Life NETS 112 Fall 2014 Prof. Michael Kearns.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Scale-free and Hierarchical Structures in Complex Networks L. Barabasi, Z. Dezso, E. Ravasz, S.H. Yook and Z. Oltvai Presented by Arzucan Özgür.
Chapter Chapter Summary Graphs and Graph Models Graph Terminology and Special Types of Graphs Representing Graphs and Graph Isomorphism Connectivity.
Graph clustering to detect network modules
Lecture 23: Structure of Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Graphs Lecture 19 CS2110 – Spring 2013.
Basic Concepts Graphs For more notes and topics visit:
Topics In Social Computing (67810)
Lecture 1: Introduction CS 765: Complex Networks
Applications of graph theory in complex systems research
Community detection in graphs
Lecture 23: Structure of Networks
Section 8.6 of Newman’s book: Clustering Coefficients
Graphs All tree structures are hierarchical. This means that each node can only have one parent node. Trees can be used to store data which has a definite.
Network Science: A Short Introduction i3 Workshop
Chapter 9: Graphs Basic Concepts
Project Ideas slides modified from Eileen Kraemer and David P. Feldman.
Analyzing Massive Graphs - ParT I
Chapter 9: Graphs Basic Concepts
Presentation transcript:

Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark Newman, U of Michigan:

This part of the course: WI whenwhatwhywhat else Sep 27 Complex Networks I The WWW is a huge complex network. Many other networks are overlaid upon it. Networks have important and interesting properties, re: speed of information spread, robustness, speed and quality of search, etc. Assignment 1 – worth 70% of CW Oct 4 Complex Networks II Assignment 2 – worth 15% of CW Oct 11 Web Search: How google works Search, obviously, is central to web intelligence Assignment 3 – worth 15% of CW Nov 8 Text mining and knowledge discovery from the WWW Towards inferring useful new knowledge automatically; also for better search, for non marked-up web Reading Nov 15 Web communities and cultural models Understanding how the web influences the formation and behaviour of groups, and the spread of information Reading

Introductory Points Graphs and networks are of central importance to us, because: The web is a large and complex network Major phenomena that underpin our existence, such as how information spreads, how diseases develop, how economies evolve, are best viewed mathematically as networks. Networks have structural properties and behaviour. When we analyse the structure of a network, we can reveal important clues about its behaviour. E.g. Predict how fast a virus, or rumour will spread on the web Assess which are the most authoritative web sites Predict how long it will take to search sections of the web Predict how robust to damage an area of the www is, or a cellular process is, etc.

This Week’s Material Basic Intro to graphs and networks, terminology, and so on. The interesting properties of real-world networks. Metrics and other structural properties that are currently used to analyse both the www and other networks. To support the understanding of metrics and properties, this week we cover basics of graphs and networks.

The very basics A graph is a set of two things: G = {V, E} V = a set of vertices (also called nodes) e.g. V = {A, B, C, D} E = a set of edges (also called arcs, or links) e.g. E = { {A,C}, {A,D}, {B,C}, {B, D} } in which each edge is a set of two vertices from V This graph is: AB CD

The very basics II An undirected edge between A and B: {A, B} (or {B, A}) A directed edge between A and B: (A, B) A loop at A: {A, A} or (A, A) A AB AB In an undirected graph, all edges are undirected. In a directed graph, all edges are directed.

The very basics III The degree of a node, in an undirected graph, is the number of edges attached to it. In this one, the degrees are: A: 2 B: 3 C: 3 D: 3 E: 0 F: 1 G: 2 What is the mean degree of this graph? AB CD F E G

The very basics IV Nodes in directed graphs have in-degrees and out-degrees. Here: Node: in,out as follows: A: 1, 2 B: 1, 2 C: 2, 1 D: 2, 2 E: 1, 1 F: 1,2 G: 0, 2 A directed graph without cycles is called a DAG. Is this a DAG? AB CD F E G

The very basics V This is an unlabelled graph. It is exactly the same as (isormorphic to) this one: This is a labelled graph. homepage teachingresearch graphs homepage teaching graphs research Since labels and links have meaning, this one is different:

Diversity of graphs: considering only loop-free graphs How many different 2-node, labelled undirected graphs are there? How many different 2-node, labelled directed graphs are there? How many different 3-node, labelled undirected graphs are there? Suppose there are G(k) possible undirected labelled graphs on k nodes. Whenever we add one extra node to an und. Lab. graph on k nodes: Any subset of the k existing nodes could link to it, and there are 2 k such subsets. So the number of possible und. lab. graphs on k+1 nodes is 2 k times what it is on k nodes.

Example numbers for undirected labelled graphs Size of graphNumber of possible graphs 5 nodes nodes35,184,372,088, nodes 1.6  nodes 1.3  nodesa lot.

More basics If there is a path in the graph from each node to every other, the Graph is connected, else it is unconnected. This one? AB CD F E G

More basics II Most graphs of interest and importance are far from complete – they tend to be called sparse. AB CD The complete (undirected) graph on n nodes is the graph that contains all n(n  1/)/2 possible edges. Is this one complete? Think about the following graphs: 1: Nodes = students in this university; Edge {A,B} exists if A and B have the same birthday. 2. Nodes = web pages: Edge (A,B) exists if A links to B. 3. Nodes = types of molecules in our bloodstream, Edge(A,B) exists if A interacts with B. 4. Nodes = all living humans. Edge{A,B} exists if A and B have ever shaken hands.

More Structural Properties Diameter: length of the longest path between any two nodes Number of components: in undirected graphs Degree distribution: An interesting and important fingerprint of a graph that we will see more of. Modularity: A graph is highly modular if it has several clusters of nodes with many links within the clusters, but few links between the clusters. Hierarchical modularity. A graph seems to be hierarchically modular if it is modular, as above, but the modules are themselves modular.

Some Networks One of these is a network of protein interactions in yeast. The other is a visualisation of an outbreak of TB. What do the nodes and edges represent? And … which is which?

Is this: spread of HIV infection (node = person / link = HIV transfer) or is it: books about politics (node = book / link = one mentions the other)

Notice how the book network is polarised

The internet

Assignment 1 Read: Exploring Complex Networks, by Steven Stroglatz, Nature 410, 268—276 Write: A 500-word `executive summary’ of most of this article. Leave out Box 1, and the section “Regular networks of coupled dynamical systems”, restart at “Complex network architectures”. AND Write: A 100-word account of what you assess to be the three main points conveyed by this article Write: A 200-word essay about the relevance of those points to the topic of your BSc or MSc (e.g. relevance to AI; relevance to IT(Business), etc..) Word limits in this assignment are important; over the limits means losing marks

Marking 30% of the marks: completeness and readability 30% of the marks: evidence of understanding the article, and generally making sense 30% of the marks: clarity of your arguments 10% of the marks: for making me say “Wow”

Next week Much more advanced, about: Degree distributions Cluster Co-efficients Modularity and hierarchy Random networks vs real networks Some basic graph algorithms Another article, much smaller, to read.