Analyzing Massive Graphs - ParT I

Slides:



Advertisements
Similar presentations
Network Overview Discovery and Exploration for Excel (NodeXl) Hands On Exercise Presented by: Samer Al-khateeb Class: Social Media Mining and Analytics.
Advertisements

Network biology Wang Jie Shanghai Institutes of Biological Sciences.
CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.
Analysis and Modeling of Social Networks Foudalis Ilias.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Signaling pathways –SPIKE Network visualization.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Final Presentation Undergraduate Researchers: Graduate Student Mentor: Faculty Mentor: Jordan Cowart, Katie Allmeroth Krist Culmer Dr. Wenjun Zeng Investigating.
Chapter 11 Graphs and Trees This handout: Terminology of Graphs Eulerian Cycles.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Graph Theory in 50 minutes. This Graph has 6 nodes (also called vertices) and 7 edges (also called links)
Introduction to Information Retrieval CS 5604: Information Storage and Retrieval ProjCINETViz by Maksudul Alam, S M Arifuzzaman, and Md Hasanuzzaman Bhuiyan.
Lecture 12: Network Visualization Slides are modified from Lada Adamic, Adam Perer, Ben Shneiderman, and Aleks Aris.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
BIological NetwOrk Manager Cytoscape plugin Andrei Zinovyev Institut Curie/INSERM/Ecole de Mines, UMR 900 “Computational Systems Biology of Cancer”
1/25 Visualizing Social Networks Ryan Yee. 2/25 Plan Introduction and terminology Vizster NodeTrix MatLink Applications to Multi-agent systems.
Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Week 1 – Introduction to Graph Theory I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Tools for Visualizing Networks Dr. Frank McCown Intro to Web Science Harding University This work is licensed under a Creative Commons Attribution-NonCommercial-
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Informatics tools in network science
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Design and Analysis of Algorithms Introduction to graphs, representations of a graph Haidong Xue Summer 2012, at GSU.
S OCIAL N ETWORK A NALYSIS F OR D UMMIES Y ANNE B ROUX DH S UMMER S CHOOL L EUVEN, S EPTEMBER
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Prepare Launch Spyder Open Spyder (C:/Anaconda2/Scripts/spyder) Or Open from launcher (C:/Anaconda2/Scripts/launcher) Download data
Graph clustering to detect network modules
Lecture 23: Structure of Networks
Centralities (Gephi and Python)
Cohesive Subgraph Computation over Large Graphs
Analyzing, Visualizing
Groups of vertices and Core-periphery structure
Central nodes (Python and Gephi).
Tutorial: Big Data Algorithms and Applications Under Hadoop
Biological networks CS 5263 Bioinformatics.
Visualizing Social Networks
Tutorial 12 Biological networks.
Visualizing Social Networks
Comparison of Social Networks by Likhitha Ravi
Gephi Gephi is a tool for exploring and understanding graphs. Like Photoshop (but for graphs), the user interacts with the representation, manipulate the.
Community detection in graphs
Lecture 23: Structure of Networks
Structural Properties of Networks: Introduction
Network Visualization
Visualizing Social Networks
Network Science: A Short Introduction i3 Workshop
Apache Spark & Complex Network
Why Social Graphs Are Different Communities Finding Triangles
Graphs Chapter 11 Objectives Upon completion you will be able to:
Analyzing, Visualizing
Overview of big data tools
Central Nodes (Python and Gephi).
Network Software Overview.
Gephi.
Graphs.
Lecture 23: Structure of Networks
Network biology An introduction to STRING and Cytoscape
Modelling and Searching Networks Lecture 2 – Complex Networks
Centralities Using Gephi and Python Prof. Ralucca Gera,
Algorithms Lecture # 27 Dr. Sohail Aslam.
Chapter 10 Graphs and Trees
Collecting, Analyzing, and Visualizing Data with Python Part I
Collecting, Analyzing, and Visualizing Data with Python Part II
Presentation transcript:

Analyzing Massive Graphs - ParT I Dr. Michael Fire

Graphs/Networks are Everywhere Some examples: Online Social Networks Computer Networks Financial Transactions Networks Protein Networks Neuron Networks Animal Social Networks

Networks/Graphs are Well-Studied Due to their cross-disciplinary usefulness today, there are: Numerous graph algorithms Many tools/frameworks to analyze graphs Many tools to visualize graphs igraph Networkx

Graphs - The Basics A graph consists of a set of vertices (nodes), and a set of links (edges). The links connect vertices. There are direct graphs, like Twitter, and undirected graphs, like Facebook.

Practical Network Analysis There are countless interesting things that we can learn from graphs: Graph Algorithms and Their Complexity Network Evolution Models Dynamic Networks Spreading Phenomena Temporal Networks Motifs In today’s lecture, we are going to learn how to use networks as data structures to analyze and infer insights from data

Network Analysis Tools Networkx - a Python Library for graph analysis iGraph - a library collection for creating and manipulating graphs and analyzing networks. It is written in C and also exists as Python and R packages. NodeXL - NodeXL Basic is an open-source template that makes it easy to explore network graphs NetworKit - NetworKit is an open-source tool suite for high-performance network analysis. Its aim is to provide tools for the analysis of large networks. SGraph - A scalable graph data structure. The SGraph provides flexible vertex and edge query functions, and seamless transformation to and from an SFrame object

Networkx Pros: Easy and fun to use Mature library Many implemented graph algorithms Vertices can be anything and edges can store attributes Easy to visualize graphs and to save them in many formats Cons: Doesn’t work well with large graphs Slow

iGraph Pros: Mature library Many implemented graph algorithms Pretty fast Cons: Little tricky to use Doesn’t work well with very large graphs

SGraph Pros: Can work with very large graphs (billions of links) Part of the TuriCreate Eco-system (works great with SFrame) Cons: Relatively little functionality

Network Visualization Tools Cytoscape is an open source bioinformatics software platform for visualizing molecular interaction networks. Cytoscape can be used to visualize and analyze network graphs of any kind. Yeast Protein–protein/Protein–DNA interaction network visualized by Cytoscape.

Network Visualization Tools Gephi is an open-source network analysis and visualization software package written in Java

Network Visualization Tools D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. We can use D3.js to dynamically visualize networks

Some Terminology There are many interesting vertex/links/networks properties that are widely used in the context of analyzing networks. Here is a short list of terms we will need for the following examples: Degree - the number of edges incident to a vertex Clustering Coefficient - a measure of the degree to which nodes in a graph tend to cluster together. Betweenness centrality - a centrality measure which quantifies the number of times a vertex/edge acts as a bridge along the shortest path between two other nodes/edges. Modularity - measures the strength of division of a graph into modules (also called groups, clusters or communities) Strongly Connected Component - a maximal group of vertices that are mutually reachable Weakly Connected Component - a maximal group of vertices that are mutually reachable by violating the edge directions. Common Friends - Given two vertices u and v, the common friends of u and v is the size of the common set of friends that both u and v possess.

Some Useful Algorithms Community Detection Shortest Path Closeness Centrality PageRank k-Core Decomposition

Let’s move to running code using Jupyter Notebook

Recommended Read: Network Science, Class 1 Introduction, by Albert László Barabási Top 30 Social Network Analysis and Visualization Tools by Devendra Desale k-Core Decomposition: A tool for the visualization of Large Scale Networks by Alvarez-Hamelin, José Ignacio, et al.