Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

The multi-layered organization of information in living systems
Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Global topological properties of biological networks.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Signaling pathways –SPIKE Network visualization.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Interactions and Disease Audry Kang 7/15/2013.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
A highly abbreviated introduction to proteomics
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Protein protein interactions
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Network Analysis and Application Yao Fu
Hyun Seok Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University College of Medicine Lecture 13. Network Analysis MES
Biological Pathways & Networks
(D) Crosslinking Interacting proteins can be identified by crosslinking. A labeled crosslinker is added to protein X in vitro and the cell lysate is added.
Interactions and more interactions
Protein-protein interactions Courtesy of Sarah Teichmann & Jose B. Pereira-Leal MRC Laboratory of Molecular Biology, Cambridge, UK EMBL-EBI.
Finish up array applications Move on to proteomics Protein microarrays.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Proteome and interactome Bioinformatics.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Lecture 8. Topics in Biological Networks (Basics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational Biology.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
GO based data analysis Iowa State Workshop 11 June 2009.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Protein-protein Interactions
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
CSCI2950-C Lecture 12 Networks
Biological networks CS 5263 Bioinformatics.
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Protein Complex Discovery
Protein Complex Discovery
Presentation transcript:

Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University

Protein-protein interaction (PPI) Definition  Physical association of two or more protein molecules Examples  Receptor-ligand interactions  Kinase-substrate interactions  Transcription factor-co-activator interactions  Multiprotein complex, e.g. multimeric enzymes BCHM352, Spring 2011 Cramer et al. Science 292:1863, 2001 RNA polymerase II, 12 subunits 2

Significance of protein interaction Most proteins mediate their function through interacting with other proteins  To form molecular machines  To participate in various regulatory processes Distortions of protein interactions can cause diseases BCHM352, Spring

Method  Bait strain: a protein of interest, bait (B), fused to a DNA-binding domain (DBD)  Prey strains: ORFs fused to a transcriptional activation domain (AD)  Mate the bait strain to prey strains and plate diploid cells on selective media (e.g. without Histidine)  If bait and prey interact in the diploid cell, they reconstitute a transcription factor, which activates a reporter gene whose expression allows the diploid cell to grow on selective media  Pick colonies, isolate DNA, and sequence to identify the ORF interacting with the bait Pros  High-throughput  Can detect transient interactions Cons  False positives  Non-physiological (done in the yeast nucleus)  Can’t detect multiprotein complexes Uetz P. Curr Opin Chem Biol. 6:57, 2002 Yeast two-hybrid 4

BCHM352, Spring 2011 Tandem affinity purification Method  TAP tag: Protein A, Calmodulin binding domain, TEV protease cleavage site  Bait protein gene is fused with the DNA sequences encoding TAP tag  Tagged bait is expressed in cells and forms native complexes  Complexes purified by TAP method  Components of each complex are identified through gel separation followed by MS/MS Pros  High-throughput  Physiological setting  Can detect large stable protein complexes Cons  High false positives  Can’t detect transient interactions  Can’t detect interactions not present under the given condition  Tagging may disturb complex formation  Binary interaction relationship is not clear Chepelev et al. Biotechnol & Biotechnol 22:1,

BCHM352, Spring 2011 Large scale protein interaction identification Experimental  Yeast two-hybrid  Tandem affinity purification Computational  Gene fusion  Ortholog interaction  Phylogenetic profiling  Microarray gene co-expression Valencia et al. Curr. Opin. Struct. Biol, 12:368,

BCHM352, Spring 2011 Protein interaction data in the public domain Database of Interacting Proteins (DIP) The Molecular INTeraction database (MINT) The Biomolecular Interaction Network Database (BIND) The General Repository for Interaction Datasets (BioGRID) Human Protein Reference Database (HPRD) Online Predicted Human Interaction Database (OPHID) The Munich Information Center for Protein Sequences (MIPS) 7

HPRD BCHM352, Spring

Protein interaction networks Saccharomyces cerevisiae Jeong et al. Nature, 411:41, 2001 Drosophila melanogaster Giot et al. Science, 302:1727, 2003 Caenorhabditis elegans Li et al. Science, 303:540, 2004 Homo sapiens Rual et al. Nature, 437:1173,

Gene regulatory networks Experimental  Chromatin immunoprecipitation (ChIP) ChIP-chip ChIP-seq Computational  Promoter sequence analysis  Reverse engineering from microarray gene expression data Public databases  Transfac (  MSigDB (  hPDI ( ) BCHM352, Spring Shen-orr et al. Nat Genet, 31:64, 2002

KEGG metabolic network BCHM352, Spring

Network visualization tools Cytoscape  BCHM352, Spring Gehlenborg et al. Nature Methods, 7:S56, 2010

BCHM352, Spring Graph representation of networks Cramer et al. Science 292:1863, 2001 edge node Graph: a graph is a set of objects called nodes or vertices connected by links called edges. In mathematics and computer science, a graph is the basic object of study in graph theory. RNA polymerase II

BCHM352, Spring Undirected graph vs directed graph Protein interaction network Nodes: protein Edges: physical interaction Undirected Transcriptional regulatory network Nodes: transcription factors and genes Edges: transcriptional regulation Directed TF->target gene Metabolic network Nodes: metabolites Edges: enzymes Directed Substrate->Product Krogan et al. Nature 440:637, 2006 Ravasz et al. Science 297:1551, 2002 Lee et al. Science 298:799, 2002 Fhl1 RPL2B

Degree, path, shortest path Degree: the number of edges adjacent to a node. A simple measure of the node centrality. Path: a sequence of nodes such that from each of its nodes there is an edge to the next node in the sequence. Shortest path: a path between two nodes such that the sum of the distance of its constituent edges is minimized. BCHM352, Spring YDL176W Degree: 3 Fhl1 Out degree: 4 In degree: 0

Obama vs Lady Gaga: who is more influential? BCHM352, Spring Obama 7,035,548701,301 Gaga 8,873,525144,263 Eminem 3,509,4690 Twitter followers (in degree) Twitter following (out degree)

BCHM352, Spring Albert et al., Nature, 406:378, 2000 Random network 130 nodes, 215 edges Homogeneous: most nodes have approximately the same number of links Five red nodes with the highest number of links reach 27% of the nodes Scale-free network 130 nodes, 215 edges Heterogeneous: the majority of the nodes have one or two links but a few nodes have a large number of links Five red nodes with the highest degrees reach 60% of the nodes (hubs) Network properties (I): hubs

BCHM352, Spring Scale-free biological networks Jeong et al, Nature, 407:651, 2000Noort et al, EMBO Reports,5:280, 2004 Stelzl et al. Cell, 122:957, 2005 Metabolic network C. elegans Protein interaction network H. sapiens Gene co-expression network S. cerevisiae

BCHM352, Spring Network properties (II): small world network Stanly Milgram’s small world experiment  Social network  Average path length between two person Small world network: a graph in which most nodes can be reached from every other by a small number of steps. Biological interpretation: Efficiency in transfer of biological information Six degrees of separation Omaha Boston Wichita "If you do not know the target person on a personal basis, do not try to contact him directly. Instead, mail this folder to a personal acquaintance who is more likely than you to know the target person."

BCHM352, Spring Network properties (III): motifs Network motifs: Patterns that occur in the real network significantly more often than in randomized networks. Three-node patterns Milo et al., Science, 298:824, 2002 Feed-forward loop Feedback loop

BCHM352, Spring Network properties (IV): modularity Modularity refers to a group of physically or functionally linked molecules (nodes) that work together to achieve a relatively distinct function. Examples  Transcriptional module: a set of co- regulated genes sharing a common function  Protein complex: assembly of proteins that build up some cellular machinery, commonly spans a dense sub-network of proteins in a protein interaction network  Signaling pathway: a chain of interacting proteins propagating a signal in the cell Protein interaction modules Palla et al, Nature, 435:841, 2005 Gene co-expression modules Shi et al, BMC Syst Biol, 4:74, 2010

Network distance vs functional similarity Proteins that lie closer to one another in a protein interaction network are more likely to have similar function and involve in similar biological process. Sharan et al. Mol Syst Biol, 3:88, BCHM352, Spring 2011

Network-based disease gene prioritization Kohler et al. Am J Hum Genet. 82:949, BCHM352, Spring 2011 For a specific disease, candidate genes can be ranked based on their proximity to known disease genes.

Summary Biological networks  Protein-protein interaction network; Gene regulatory network; Metabolic network Graph representation of networks  Graph, node, edge, undirected graph, directed graph, degree, path, shortest path Network properties  Hubs and scale-free degree distribution  Small-world  Motifs  Modularity Network-based applications  Disease gene prioritization BCHM352, Spring