Network Visualization

Slides:



Advertisements
Similar presentations
Sarah Reonomy OSCON 2014 ANALYZING DATA WITH PYTHON.
Advertisements

Visualization of Graph Data CS 4390/5390 Data Visualization Shirley Moore, Instructor October 6,
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Computer Science 112 Fundamentals of Programming II Introduction to Graphs.
Understand Computer Fundamentals Unit Objective: 2.01B Computer Software.
CSE Introduction to Computing Concepts. Outline  What is an application program?  What is Excel?  Creating a Simple Workbook  Writing Formulas.
4 Chapter Four Introduction to HTML. 4 Chapter Objectives Learn basic HTML commands Discover how to display graphic image objects in Web pages Create.
EXCEL Intro to Microsoft Excel. Objectives for the Week Content ObjectivesLanguage Objectives I can create and manipulate charts, graphs, and reports.
Creating A Worksheet and Embedded Chart Chapter 1.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Python for: Data Science. Python  Python is an open source scripting language.  Developed by Guido Van Rossum in late 1980s  Named after Monty Python.
Visualizing Large Dynamic Digraphs Michael Burch.
Network Information: Manipulation, Sharing, and Visualization Dr. Greg Bernstein Grotto Networking
DATA MINING Pandas. Python Data Analysis Library A library for data analysis of (mostly) tabular data Gives capabilities similar to Excel and SQL but.
Practical Kinetics Exercise 0: Getting Started Objectives: 1.Install Python and IPython Notebook 2.print “Hello World!”
Python & NetworkX Youn-Hee Han
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Design and Analysis of Algorithms Introduction to graphs, representations of a graph Haidong Xue Summer 2012, at GSU.
Tutorial 1 Getting Started with Adobe Dreamweaver CS5.
Chapter 7 Computer-Aided Design and Drafting in Architecture.
From digital to craft: How to make a data matrix with SNS data There are many applications that allow its users to visualize different networks directly.
Subject Four Graphs Data Structures. What is a graph? A data structure that consists of a set of nodes (vertices) and a set of edges that relate the nodes.
B121 Chapter 6 Effective Ways of Displaying Information.
How to Get Started With Python
NoSQL: Graph Databases
Neo4j: GRAPH DATABASE 27 March, 2017
Deep Neural Networks: A Hands on Challenge
NoSQL: Graph Databases
Python for data analysis Prakhar Amlathe Utah State University
PyTimber & CO M. Betz, R. De Maria, M. Fitterer, C. Hernalsteens, T. Levens Install: $ pip install pytimber Sources:
CSC391/691 Intro to OpenCV Dr. Rongzhong Li Fall 2016
Getting Started with Plot.ly
Tutorial: Big Data Algorithms and Applications Under Hadoop
Basic Concepts Graphs For more notes and topics visit:
Mark V. Janikas Marjean Pobuda
TU170 Learning online and computing with confidence
DATA MINING Python.
Matrix Representation of Graph
Prepared by Kimberly Sayre and Jinbo Bi
System And Application Software
Visual Perception.
MANAGING DATA RESOURCES
IST256 : Applications Programming for Information Systems
Multi-Dimensional Data Visualization
Brief Intro to Python for Statistics
Unit# 6: ICT Applications
Presentation, layout and labeling
Graphs All tree structures are hierarchical. This means that each node can only have one parent node. Trees can be used to store data which has a definite.
Chapter 9: Graphs Basic Concepts
CSCI N207 Data Analysis Using Spreadsheet
Not long ago, trained specialists were required to perform many of the operations you can now do with a microcomputer. Competent end users need to understand.
NASA/ Johnson Space Center
How to make your paper professional and elegant
Statistics for the Social Sciences
Statistics for the Social Sciences
Tutorial 7 – Integrating Access With the Web and With Other Programs
Simulation And Modeling
Graphs G = (V, E) V are the vertices; E are the edges.
Python and REST Kevin Hibma.
Python for Data Analysis
Introduction To MATLAB
Collecting, Analyzing, and Visualizing Data with Python Part I
Analyzing Massive Graphs - ParT I
Chapter 9: Graphs Basic Concepts
For Friday Read chapter 9, sections 2-3 No homework
Presentation transcript:

Network Visualization

Network Data (Graph) A very hot topic Social networks Network analysis is not a new area! It is very old.

Compared with Hierarchy Data Relationship is more complicated. Between any two vertex With direction With weight But same goals Presentation of vertex and edges Interaction

Classification Node-link Matrix Hybrid

Node-Link Main challenge A good layout of vertex to reduce visual complex. A popular technique: Force-directed graph drawing Each pair of nodes are connected by a “spring”.

Character co-occurrence in Les Misérables http://hci.stanford.edu/jheer/files/zoo/

Variations: Arc Diagram http://hci.stanford.edu/jheer/files/zoo/

Variations: Radial Network

Node-Link Layout Pros Cons Direct and easy to understand Results are sensitive to initial positions of vertex High computational complex Visual complexity for large networks Not stable

Edge Bundling

Matrix

Basic Idea N x N matrix Can embed direction, weight inforamtion N: the number of nodes The intersection of row and column: relationship Can embed direction, weight inforamtion

http://hci.stanford.edu/jheer/files/zoo

How to Read a Matrix Representation

How to Follow a Path

Hybrid Approaches NodeTrix Demo http://www.youtube.com/watch?v=7G3MxyOcHKQ

Emphasizing “Social” in Social Network Visualization Considering the attribute information of nodes and edges

http://hcil2.cs.umd.edu/video/2006/substrates.mpg

http://zhang.ist.psu.edu/demo/SocialNetSense/TreeNetViz.mov

Network Data Format Various formats are used by different software tools. Can be simple Or complicated source, target a,b a,c b,c . d,e Vertices 1 “a” 2 “b” 3 “c” 4 “d” 5 “e” Arcs 1 2 1 3 2 3 . 4 5 source, target, weight a,b,1 a,c,1 b,c,1 . d,e,1

. . ] }

Work on Your Data Excel is your best friend (probably). Easy to create cells with certain patterns Picked up by Excel or defined by users Analytical tools to generate necessary data

Format data for JSON: Links What you have Your goal

Format data for JSON: Nodes All nodes are there: links are defined by nodes. What we need is a list of unique nodes. Excel: Pivot Table tool

Python

Why Do We Need Python? Python is very powerful in processing data. Various libraries available for people to use directly No need to write codes for basic tools Data analytics, natural language processing, graphics, … Programming is relative easy. Compared with Java, C++/C Visualization requires better structured data. Python can prepare for such data.

A Few Points about Python Easy to install and use Available for all major OSes. Lots of resources on the Internet Tutorials, codes, books, … Pay attention to versions 2.7 and 3.5 different significantly. Having Python on your own computer Strongly recommended

Python in Our Classroom/VLab Python, IPython, python GUI Various problems Jupyter Notebook User friendly

Exercise: Using Python to Understand Cars Data Goals: Conduct basic analysis of car data cars.csv used for parallel coordinates Copy the cars.csv from your web space to your Documents folder (under the This PC class) Start Jupyter Notebook Open pythonExercise1.pdf under the InClassExercisesResources\Week7_Python folder Type in each command You must type in every command!

Library Packages Involved in This Exercise Numpy: the fundamental package for scientific computing pandas: PANel DAta System matplotlib: a plotting library for high quality figures

pandas Basics DataFrame Series Column is retrieved by column name. A 2-dimensional, tabular structure (like a spreadsheet) Series A 1-dimensional, list structure Each column in a dataframe is a series Column is retrieved by column name. Row is retrieved by index number

More Resources Reading Cheat sheets Chapter 5 -8 PythonCheatSheets under InClassExercisesResources