COT 6398: Network Science Spring 2016 Mainak Chatterjee

Slides:



Advertisements
Similar presentations
Complex Network Theory
Advertisements

Scale Free Networks.
Advanced Topics in Data Mining Special focus: Social Networks.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
The Structure of Networks with emphasis on information and social networks RU T-214-SINE Summer 2011 Ýmir Vigfússon.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Exp. vs. Scale-Free Poisson distribution Exponential Network Power-law distribution Scale-free Network.
Mining and Searching Massive Graphs (Networks)
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
TC2-Computer Literacy Mr. Sencer February 4, 2010.
Research of Network Science Prof. Cheng-Shang Chang ( 張正尚教授 ) Institute of Communications Engineering National Tsing Hua University Hsinchu Taiwan
Network Motifs: simple Building Blocks of Complex Networks R. Milo et. al. Science 298, 824 (2002) Y. Lahini.
Agilent: The Company, The Myth, The Lengend. Agilent: Agilent Technologies Inc. (NYSE: A) is a world-wide, diverse technology company focused on expansion.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
BIO337 Systems Biology/Bioinformatics (course # 50524) Spring 2014 Tues/Thurs 11 – 12:30 PM BUR 212 Edward Marcotte/Univ. of Texas/BIO337/Spring 2014.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Graph Theory in 50 minutes. This Graph has 6 nodes (also called vertices) and 7 edges (also called links)
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Information Networks Introduction to networks Lecture 1.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Statistical Mechanics of Complex Networks: Economy, Biology and Computer Networks Albert Diaz-Guilera Universitat de Barcelona.
Network Analysis Diffusion Networks. Social Network Philosophy Social structure is visible in an anthill Movements & contacts one sees are not random.
Developing a Good Research Question (for Project #4)
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Lecture 1: Introduction CS 790g: Complex Networks Slides are modified from Statistical physics of complex networks by Sergei Maslov and Complex Adaptive.
and Complex Adaptive Systems by Eileen Kraemer
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Web Intelligence Complex Networks I This is a lecture for week 6 of `Web Intelligence Example networks in this lecture come from a fabulous site of Mark.
KSE631: Content Networking Uichin Lee Feb. 07, 2011.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
A connected simple graph is Eulerian iff every graph vertex has even degree. The numbers of Eulerian graphs with, 2,... nodes are 1, 1, 2, 3, 7, 16, 54,
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
+ Big Data, Network Analysis Week How is date being used Predict Presidential Election - Nate Silver –
Introduction to biological molecular networks
Lecture 2: Introduction CS 765: Complex Networks Slides are adapted from Constantine Dovrolis, Eileen Kraemer, Peter Dodds, and Sergei Maslov.
Lecture 1: Introduction CS 790g: Complex Networks Slides are modified from Statistical physics of complex networks by Sergei Maslov and Complex Adaptive.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Elements of Network Science: CptS / EE Assefaw Gebremedhin Washington State University School of Electrical Engineering and Computer Science.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
1 Dr. Michael D. Featherstone Introduction to e-Commerce Network Theory 101.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
BCH339N Systems Biology/Bioinformatics (course # 54040) Spring 2016 Tues/Thurs 11 – 12:30 PM BUR 212.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Lecture II Introduction to complex networks Santo Fortunato.
Department of Computer and IT Engineering University of Kurdistan Social Network Analysis Introduction By: Dr. Alireza Abdollahpouri.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
CNT 5805: Network Science Fall 2016 Mainak Chatterjee
Structures of Networks
Lecture 1: Introduction CS 765: Complex Networks
E-Commerce Theories & Practices
Genomes and Their Evolution
Research Scopes in Complex Network
Presentation transcript:

COT 6398: Network Science Spring 2016 Mainak Chatterjee

The slides that I will be using for this class have been copied from open sources from the Web. I do not claim any intellectual property for the following material. I thank all other instructors for making the slides available and making my life easy. Disclaimers

Some real networks – Social Networks Networks of acquaintances Collaboration networks – actor networks, co-authorship networks, director networks Phone-call networks, networks, IM networks, Bluetooth networks Sexual networks Home page/blog networks – Information Networks Citation network (directed acyclic) The Web (directed) Peer-to-Peer networks Software graphs – Distribution Networks The Internet (router level, AS level) Power Grids, Telephone Networks Airline networks Transportation Networks (roads, railways, pedestrian traffic)

Some real networks…. – Biological Networks Protein-Protein Interaction Networks Gene regulation networks Gene co-expression networks Metabolic pathways The Food Web Neural Networks – Economic Networks Bank networks Credit card – Natural Networks River networks Leaf networks Epidemic

How do networks “look” like? Let’s look at some pictures… These will give you some idea of what we are dealing with. A word of caution – When was the data collected – How was the data collected (methodology) – Credibility of the experiments Nevertheless, the pictures tell a story

Terrorist Network

Actors (

Links among blogs (2004 presidential) election)

Product recommendations

Facebook The “Social Graph” behind Facebook

Technological networks Networks built for distribution of commodity – The Internet router level, AS level – Power Grids – Airline networks – Telephone networks – Transportation Networks roads, railways, pedestrian traffic

PoP-level Internet2 network

The Internet at AS level

Internet as measured by Hal Burch and Bill Cheswick's Internet Mapping Project.Internet Mapping Project

Routers

Power networks

transportation networks: airlines Source: Northwest Airlines WorldTraveler Magazine

transportation networks: railway maps Source: TRTA, March Tokyo rail map

Biological networks Biological systems represented as networks – Protein-Protein Interaction Networks – Gene regulation networks – Gene co-expression networks – Metabolic pathways – The Food Web – Neural Networks

Citric acid cycle Metabolites participate in chemical reactions metabolic networks

Biochemical pathways (Roche) Source: Roche Applied Science,

Gene regulatory networks GRN is a collection of DNA segments in a cell which interact with other indirectly (through their RNA and protein expression products) and with other substances in the cell, thereby governing the expressions levels of Messenger RNA (mRMA) and proteins. humans have 30,000 genes the complexity is in the interaction of genes can we predict what result of the inhibition of one gene will be? Source:

protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions Bio map by L-A Barabasi _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Protein binding networks Baker’s yeast S. cerevisiae (only nuclear proteins shown) Nematode worm C. elegans

Transcription regulatory networks Bacterium: E. coli Single-celled eukaryote: S. cerevisiae

The Protein Network of Drosophila (a small fly) CuraGen Corporation Science, 2003

KEGG database: Metabolic networks

C. elegans neurons

Network of Interacting Pathways (NIP) A.Mazurie D.Bonchev G.A. Buck, organisms

Freshwater food web by Neo Martinez and Richard Williams

Protein interaction networks

Ecological networks

Brain networks

Structural brain connectivity

Economic networks

World-trade networks

exchanges in a company

Phone calls in a country

Socio-epidemic networks

Epidemic networks

River networks

Leaf networks

Nodes: Links: Companies Investment Pharma Research Labs Public Biotechnology Collaborations Financial R&D Network Science: Introduction 2012 Business ties in bio-tech industry

Network Science: Introduction 2012 What message did the pictures convey? Enough of pictures

Network Science: Introduction 2012 We are surrounded by complex systems, from the society (a collection of 7 billion individuals, to communications systems, to neurons in the brain. All these systems work together in a seamless fashion. These systems, random looking at first, upon close inspection display endless signatures of order and self-organization whose quantification, understanding, prediction and eventually control is the major intellectual challenge. Bottomline….it’s the complexity

THE ROLE OF NETWORKS Behind each complex system there is a network (a wiring diagram), that defines the interactions between the component. We will never understand complex system unless we map out and understand the networks behind them. Network Science: Introduction 2012

“Complex” networks Complex networks are large (in node number) Complex networks are sparse (low edge to node ratio) Complex networks are usually dynamic and evolving Complex networks can be social, economic, natural, informational, abstract,... Isn’t this graph theory? – Yes, but emphasis is on data and mechanistic explanations...

Networks in complex systems Complex systems – Large number of components interacting with each other – All components and/or interactions are different from each other – Paradigms: 10 4 types of proteins in an organism, 10 6 routers in the Internet 10 9 web pages in the WWW neurons in a human brain The simplest property: – who interacts with whom? can be visualized as a network Complex networks are just a backbone for complex dynamical systems

What is Network Science? The study of complex systems that can be represented as (typically dynamic) networks –Society, Economy –Various biological networks (e.g., metabolism) –Brain –WWW, Internet, Transportation nets –Natural networks (global climate system) –Many others Relies on: –Network data (strong empirical basis) –Network models –Network algorithms –Statistics of network data

Main components of Network Science Structural properties of complex networks –E.g., scale-free, small-world Dynamics of networks Dynamics on networks Dynamics of & on networks (co-evolutionary networks) How does structure affect dynamics? And how do dynamics affect structure?

The roots of Network Science Graph Theory Statistical Mechanics Nonlinear Dynamics Games and Learning Data mining (“graph mining”) and machine learning Algorithms Complexity theory

Applications of Network Science Social networks and social media Economic networks Biology Ecology Network medicine Climate science Brain Science and Neuroscience Web Internet and computer networks.. many others

Why study Network Science now? Network Science: Introduction 2012 The list of chemical reactions that take place in a cell were discovered over a 150 year period by biochemists and biologists. In the 1990s they were collected in central databases, offering the first chance to assemble the networks behind a cell. The list of actors that play in each movie were traditionally scattered in books and encyclopedias. With the advent of the Internet, these disparate data were assembled into a central database by imdb.com, mainly to feed the curiosity of movie aficionados. The database offered the first chance for network scientists to explore the structure of the affiliation network behind Hollywood. The detailed list of authors of millions of research papers were traditionally scattered in the table of content of thousands of journals, but recently the Web of Science, Google Scholar, and other sites assembled them into comprehensive databases, easing the search for scientific information. These databases turned into the first science collaboration maps.

Data Availability: Universality: The (urgent) need to understand complexity: THE EMERGENCE OF NETWORK SCIENCE Movie Actor Network, 1998; World Wide Web, C elegans neural wiring diagram 1990 Citation Network, 1998 Metabolic Network, 2000; The architecture of networks emerging in various domains of science, nature, and technology are more similar to each other than one would have expected. During the past decade some of the most important advances towards understanding complexity were provided in context of network theory. Network Science: Introduction 2012

Interdisciplinary: Cell biologists/Computer Scientists/others needs the wiring diagram behind their system, extracting information from incomplete and noisy data sets. Empirical, Data driven: Focus is on data and its utility. Not just mathematical models, but tools to test real data. The value addition will be judged by the insights it offers. Quantitative and Mathematical Nature: Graph theory; organizing principles from statistical physics, control and information theory, statistics and data mining. Computational Intensive: Size of networks giving rise to Big Data. Needs algorithms, database management, data mining, analytics, software tools. THE CHARACTERISTICS OF NETWORK SCIENCE Network Science: Introduction 2012

Impact of Network Science

Network Science: Introduction January 10, 2011 Network Science: Introduction 2012

The 2003 blackout is a typical example of a cascading failure. When a network acts as a transportation system, a local failure shifts loads or responsibilities to other nodes. If the extra load is negligible, the rest of the system can seamlessly absorb it, and the failure remains effectively unnoticed. If the extra load is too much for the neighboring nodes to carry, they will either tip or again redistribute the load to their neighbors. Either way, we are faced with a cascading event, the magnitude and reach of which depend on the centrality and capacity of the nodes that have been removed in the first round. Case in point is electricity: As it cannot be stored, when a line goes down, its power must be shifted to other lines. Most of the time the neighboring lines have no difficulty carrying the extra load. If they do, however, they will also tip and redistribute their increased load to their neighbors. Cascading failures are common in most complex networks. They take place on the Internet, when traffic is rerouted to bypass malfunctioning routers, occasionally creating denial of service attacks on routers that do not have the capacity to handle extra traffic. We witnessed one in 1997, when the International Monetary Fund pressured the central banks of several Pacific nations to limit their credit. And they are behind the financial meltdown, when the US credit crisis paralyzed the economy of the globe, leaving behind scores of failed banks, corporations and even bankrupt states, like Greece. Example: Cascading Failures

Network Science: Introduction 2012 American effort to dry up the money supply of terrorist organizations is aimed at crippling terrorist networks. Doctors and researchers hope to induce cascading failures to kill cancer cells. Cascading can also help

How come we see relevant ads when we browse? All rely on networks – connections that we have with products/ people/things/etc. ECONOMIC IMPACT – Business exploiting user data Network Science: Introduction 2012

In September 2010 the National Institutes of Health awarded $40 million to researchers at Harvard, Washington University in St. Louis, the University of Minnesota and UCLA, to develop the technologies that could systematically map out brain circuits. The Human Connectome Project (HCP) with the ambitious goal to construct a map of the complete structural and functional neural connections in vivo within and across individuals. BRAIN RESEARCH Network Science: Introduction 2012

If you were to understand the spread of diseases, can you do it without networks? If you were to understand the WWW structure, searchability, etc, hopeless without invoking the Web’s topology. If you want to understand human diseases, it is hopeless without considering the wiring diagram of the cell. MOST IMPORTANT Networks Really Matter Network Science: Introduction 2012

Course Logistics Second time Network Science is being offered at UCF –First time in Fall 2014 Relatively new: Very few universities offer Network Science –Course website lists a few

Course Website The course website: Linked from my “Teaching” page Instructor: Mainak Chatterjee Office: HEC 305 Office hours: Tue/Thu 3:00 – 4:30 PM or by appointments TA: None (Do NOT use the webcourse

Books A.Barabasi, Network Science M.E.J. Newman, Networks: An Introduction, Oxford University Press, Not available online. D. Easley and J. Kleinberg, Networks, Crowds and Markets, Cambridge Univ Press,

Topics to be covered…this is what I intend to teach Graph definitions, paths, components, degree distribution, clustering, degree correlations, centrality metrics, small-world property, scale-free property, heavy-tailed degree distributions, network motifs, Poisson networks, Watts-Strogatz model, preferential attachment and its variants, applications in communications and social networks, community identification and detection algorithms, percolation, vulnerabilities, resilience to random and targeted attacks, epidemics, immunization strategies, influence identification, games on networks, strategic network formation, evolution due to cooperation and non-cooperation on social networks.

Grading 3 Assignments (3 X 15) = 45 Points These will be programming assignments and analytical questions. 1 Mid-term exam = 30 points Sometime in Mid April Topic review = 25 points Requires 2 presentations (2 and 10 minutes)

Helps if you know Graph Theory –Basic definitions and properties Probability theory –Distribution functions (Binomial, Poisson) Linear Algebra –Eigenvalues and eigenvectors A bit of differential equations Programming knowledge (any language)

Project Review You will read at least 3-4 papers or book chapters on a particular topic. You will make a 2-min presentation where you let the class know the topic that you would be reviewing and the set of papers that you would be reading. Final Presentation: What is the topic/problem that you have reviewed? How has it been studied? What are the technical approaches? You will have to present the details of these approaches so make sure you understand them well. What kind of data sets were used? Was data collected via experiments? If so, what was the methodology? Again, you will have to provide details. What insights do the analysis/results provide? If you were to investigate this problem, how would you go about doing so? Can this line of research be extended beyond the current understanding? If so, how?