Informetric methods seminar Tutorial 2: Using Pajek for network properties Qi Yu.

Slides:



Advertisements
Similar presentations
Graphs - II Algorithms G. Miller V. Adamchik CS Spring 2014 Carnegie Mellon University.
Advertisements

Social network partition Presenter: Xiaofei Cao Partick Berg.
Lecture 5 Graph Theory. Graphs Graphs are the most useful model with computer science such as logical design, formal languages, communication network,
Network Pajek.
Edge-connectivity and super edge-connectivity of P 2 -path graphs Camino Balbuena, Daniela Ferrero Discrete Mathematics 269 (2003) 13 – 20.
Approximation Algorithms Chapter 14: Rounding Applied to Set Cover.
1 A Faster Approximation Algorithm For The Steiner Problem In Graphs Kurt Mehlhorn. Information Processing Letters, 27(3):125–128, 高等演算法二
CSC401 – Analysis of Algorithms Lecture Notes 14 Graph Biconnectivity
Some Graph Problems. LINIAL’S CONJECTURE Backgound: In a partially ordered set we have Dilworth’s Theorem; The largest size of an independent set (completely.
Knowledge Management Prof.dr.Nada Lavrač Topics: - Degree centrality/centralization - Closeness centrality/centralization - Betweenness centrality/centralization.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Approximation Algorithms Chapter 5: k-center. Overview n Main issue: Parametric pruning –Technique for approximation algorithms n 2-approx. algorithm.
Optimization of Pearl’s Method of Conditioning and Greedy-Like Approximation Algorithm for the Vertex Feedback Set Problem Authors: Ann Becker and Dan.
Author: Jie chen and Yousef Saad IEEE transactions of knowledge and data engineering.
TERMS, CONCEPTS and DATA TYPES IN GIS Orhan Gündüz.
Polynomial-Time Approximation Schemes for Geometric Intersection Graphs Authors: T. Erlebach, L. Jansen, and E. Seidel Presented by: Ping Luo 10/17/2005.
Introduction to Graphs
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Increasing graph connectivity from 1 to 2 Guy Kortsarz Joint work with Even and Nutov.
Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Sunbelt XXIV, Portorož, Pajek Workshop Vladimir Batagelj Andrej Mrvar Wouter de Nooy.
Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution 3.0 License.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
Pajek – Program for Large Network Analysis Vladimir Batagelj and Andrej Mrvar.
Vertices and Edges Introduction to Graphs and Networks Mills College Spring 2012.
Digital Image Processing CCS331 Relationships of Pixel 1.
Based on slides by Y. Peng University of Maryland
Lecture 13: Network centrality Slides are modified from Lada Adamic.
Indian Institute of Technology Kharagpur PALLAB DASGUPTA Graph Theory: Introduction Pallab Dasgupta, Professor, Dept. of Computer Sc. and Engineering,
Chapter 3. Community Detection and Evaluation May 2013 Youn-Hee Han
Data Structures & Algorithms Graphs
A n = c 1 a n-1 + c2an-2 + … + c d a n-d d= degree and t= the number of training data (notes) The assumption is that the notes in the piece are generated.
1 12/2/2015 MATH 224 – Discrete Mathematics Formally a graph is just a collection of unordered or ordered pairs, where for example, if {a,b} G if a, b.
Slides are modified from Lada Adamic
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Relation. Combining Relations Because relations from A to B are subsets of A x B, two relations from A to B can be combined in any way two sets can be.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Trees Thm 2.1. (Cayley 1889) There are nn-2 different labeled trees
Selected Topics in Data Networking Explore Social Networks: Center and Periphery.
Chapter 13 Backtracking Introduction The 3-coloring problem
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Chapter 9: Graphs.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Visualization of Biological Information with Circular Drawings.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. Fast.
Tutorial 3: Pajek basics Qi YU.  Getting started  Data format in Pajek  Window tools in Pajek  Resources.
Importance Measures on Nodes Lecture 2 Srinivasan Parthasarathy 1.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Measures of Central Tendency (0-12) Objective: Calculate measures of central tendency, variation, and position of a set of data.
Selected Topics in Data Networking
Graph theory Definitions Trees, cycles, directed graphs.
Network analysis.
Biconnectivity SEA PVD ORD FCO SNA MIA 11/16/2018 2:31 AM
Enumerating Distances Using Spanners of Bounded Degree
Graphs.
ITEC 2620M Introduction to Data Structures
Biconnectivity SEA PVD ORD FCO SNA MIA 5/6/2019 5:08 PM Biconnectivity
Winter 2019 Lecture 11 Minimum Spanning Trees (Part II)
Biconnectivity SEA PVD ORD FCO SNA MIA 5/23/ :21 PM
Measuring Angles and Arcs
Autumn 2019 Lecture 11 Minimum Spanning Trees (Part II)
Presentation transcript:

Informetric methods seminar Tutorial 2: Using Pajek for network properties Qi Yu

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Two-mode network  One-mode network  each vertex can be related to each other vertex.  Two-mode network  vertices are divided into two sets and vertices can only be related to vertices in the other set.

Example  Suppose we have data as below:  P1: Au1, Au2, Au5  P2: Au2, Au4, Au5  P3: Au4  P4: Au1, Au5  P5: Au2, Au3  P6: Au3  P7: Au1, Au5  P8: Au1, Au2, Au4  P9: Au1, Au2, Au3, Au4, Au5  P10: Au1, Au2, Au5 *vertices "P1" 2 "P2" 3 "P3" 4 "P4" 5 "P5" 6 "P6" 7 "P7" 8 "P8" 9 "P9" 10 "P10" 11 "Au1" 12 "Au2" 13 "Au3" 14 "Au5" 15 "Au5" *edgeslist See two_mode.net

Transforming to valued networks  The network is transformed into an ordinary network, where the vertices are elements from the first subset, using  “Net>Transform>2-Mode to 1-Mode>Rows”.  If we want to get a network with elements from the second subset we use  “Net>Transform>2-Mode to 1-Mode>Columns”.  Network with or without loops can be generated:  “Net>Transform>2-Mode to 1-Mode>Include Loops”.  We store values of loops into vector using “ Net>Vector>Get Loops” and use later this vector to determine  We can generate network with multiple lines – for each common event a line between corresponding two women:  “Net>Transform>2-Mode to 1-Mode>Multiple Line”.

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Basic information about a network  Basic information can be obtained by “Info>Network>General” which is available in the main window of the program. We get  number of vertices  number of arcs, number of directed loops  number of edges, number of undirected loops  density of lines  Additionally we must answer the question:  Input 1 or 2 numbers: +/highest, -/lowest where we enter the number of lines with the highest/lowest value or interval of values that we want to output.  If we enter 10, 10 lines with the highest value will be displayed. If we enter -10, 10 lines with the lowest value will be displayed. If we enter 3 10, lines with the highest values from rank 3 to 10 will be displayed.

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Global and local views on network

 Local view is obtained by extracting sub-network induced by selected cluster of vertices.  Global view is obtained by shrinking vertices in the same cluster to new (compound) vertex. In this way relations among clusters of vertices are shown.  Combination of local and global view is contextual view: Relations among clusters of vertices and selected vertices are shown.

Example  Import and export in 1994 among 80 countries are given. They is given in 1000$. (See Country_Imports.net)See Country_Imports.net  Partition according to continents (see Country_Continent.clu)see Country_Continent.clu  1 – Africa, 2 – Asia, 3 – Europe, 4 – N. America, 5 – Oceania, 6 – S. America.  Operations>Extract from Network>Partition  Operations>Shrink Network>Partition

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Degrees  In Pajek degrees are computed using “Net>Partitions>Degree” and selecting Input, Output or All.  Vertices with the highest degree can be displayed using Info>Partition.  For smaller networks the result can be displayed by:  double clicking the partition window, or  selecting “File>Partition>Edit” or  selecting the corresponding icon.  Normalized degree can be found in the vector window.

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Centrality and Centralization  Degree centrality is available in “Net>Partitions>Degree”;  Closeness centrality can be found in “Net>Vector>Centrality>Closeness”.  Betweenness centrality can be found in “Net>Vector>Centrality>Betweenness”.  When computing centrality according to degree and closeness we must additionally select Input, Output or All.  In the report window the network centralization index is given.  List of the selected number of most central vertices can be obtained by “Info>vector”.

Centralization  Centralization expresses the extent to which a network has a center.  A network is more centralized if the vertices vary more with respect to their centrality. More variation in the centrality scores of vertices yields a more centralized the network.  Take degree centralization for example:  Degree centralization of a network is the variation in the degrees of vertices divided by the maximum degree variation which is possible in a network of the same size. (Variation is the summed (absolute) differences between the centrality scores of the vertices and the maximum centrality score among them) Star- and line-networks.

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Component  Strongly connected components are computed using “Net>Components>Strong”  Weakly connected components using “Net>Components>Weak”.  Result is represented by a partition  vertices that belong to the same component have the same number in the partition.  Example  See component.net

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Biconnected components  A cut-vertex is a vertex whose deletion increases the number of components in the network.  A bi-component is a component of minimum size 3 that does not contain a cut-vertex.  To compute bicomponents use: “Net>Components>Bi-components”  Biconnected components are stored to hierarchy.  Example  See bicomponent.net

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Cores  A subset of vertices is called a k-core if every vertex from the subset is connected to at least k vertices from the same subset.  Cores can be computed using “Net>Partitions>Core” and selecting Input, Output or All core.  Result is a partition: for every vertex its core number is given.  In most cases we are interested in the highest core(s) only. The corresponding subnetwork can be extracted using “Operations>Extract from Network>Partition” and typing the lower and upper limit for the core number.  Example  See k_core.net

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Slice  An m-slice is a maximal subnetwork containing the lines with a multiplicity equal to or greater than m and the vertices incident with these lines.  M-slice can be calculated in Pejek by following:  1. select “Use max instead of sum” in option “Net>Partitions>Valued Core”;  2. select “Net>Partitions>Valued Core>First Threshold and Step”  Result is a partition with class numbers corresponding to the highest m-slice each vertex belongs to.  Example  See metformin_nonloop.net

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Diameter  Diameter means the length of the longest shortest path in network.  Calcultation: “net>Paths between 2 vertices>diameter”  Full search is performed, so the operation may be slow for very large networks (number of vertices larger than 2000)  Result is – the length of the longest shortest path in network and corresponding two vertices  Then we can find all the longest shortest path between above two vertices from “net>Paths between 2 vertices> All Shortest”

Content  Two mode network  Basic information about a network  Global and local views on network  Degrees  Centrality and Centralization  Component  Biconnected components  Cores  Slice  Diameter  Clustering Coefficients

Clustering Coefficients