Computational Construction of Intra-Cellular Networks Tolga Can Associate Professor Department of Computer Engineering Middle East Technical University.

Slides:



Advertisements
Similar presentations
Biological pathway and systems analysis An introduction.
Advertisements

MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Doug Raiford Lesson 13 5/10/20151Gene networks and pathways.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Prediction of Therapeutic microRNA based on the Human Metabolic Network Ming Wu, Christina Chan Bioinformatics Advance Access Published January 7, 2014.
The STRING database Michael Kuhn EMBL Heidelberg.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
An Exploratory Method to Reconstruct Pathways Cory Tobin.
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Introduction to BioInformatics GCB/CIS535
Systems Biology Biological Sequence Analysis
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Seminar in Bioinformatics (236818) Ron Y. Pinter Fall 2007/08.
Affinity chromatography/mass spec Bait protein GST Page 252.
Protein Classification A comparison of function inference techniques.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Ch10. Intermolecular Interactions and Biological Pathways
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Biological Pathways & Networks
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Reconstruction of Transcriptional Regulatory Networks
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Central dogma: the story of life RNA DNA Protein.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
GO based data analysis Iowa State Workshop 11 June 2009.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
BCB 570 Spring Signal Transduction Julie Dickerson Electrical and Computer Engineering.
Protein-protein Interactions
Compiling Information and Inferring Useful Knowledge for Systems Biology by Text Mining the Literature Anália Lourenço IBB – Institute for Biotechnology.
System Structures Identification
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CSCI2950-C Lecture 13 Network Motifs; Network Integration
SEG5010 Presentation Zhou Lanjun.
Presentation transcript:

Computational Construction of Intra-Cellular Networks Tolga Can Associate Professor Department of Computer Engineering Middle East Technical University Ankara, Turkey

Getting to Atlanta

METU

Overview of the Tutorial (1) Introduction to Intra-cellular networks –Protein-protein interaction networks –Signal transduction networks –Transcriptional regulation networks a.k.a gene regulatory networks (GRNs) –Metabolic networks

Overview of the Tutorial (2) Computational methods to construct networks –SiPAN: simultaneous prediction and alignment of PPI networks by Alkan and Erten (March 2015, Bioinformatics) –lpNet: a linear programming approach to reconstruct signal transduction networks by Matos et al (May 2015, Bioinformatics) –Reconstructing genome-scale metabolic models with merlin by Dias et al. (April 2015, Nucleic Acids Research)

Networks are inter-linked

from the KEGG PATHWAY Database and can be complex

Protein-protein interaction networks Can be stable or transient physical interactions

Stable interactions in protein complexes E.g., ATPase

Transient interactions MAPK Signaling Pathway

Transient interactions Examples: –protein kinases add a phosphate group to a target protein –Transport proteins such as nuclear pore importins can carry other proteins These interactions form the dynamic part of PPI networks A PPI network downloaded from a database may contain mixed stable and transient interactions

Signal Transduction Networks PIP3 signalling module in B lymphocytes Unravelling the signal-transduction network in B lymphocytes, Sambrano, Nature, December 2002

Sources for interaction data Interaction databases: –BioGRID ,952 physical interactions between 19,906 human genes –IntAct by EBI (curated from literature) 531,946 interactions between 89,310 interactors extracted from 13,807 publications –STRING 10 (functional associations) Covers 9,643,763 proteins from 2,031 organisms Experimental techniques Focused low-throughout studies –Should be mined from free-text research literature

Functional associations E.g. The String Database The network around the BRCA1 gene in human. The snapshot is from the STRING Database at string.embl.de

Experimental techniques Yeast Two-hybrid Tagged Fusion Proteins Coimmunoprecipitation APMS – Affinity Purification-Mass Spectrometry –A tool for the characterization of protein complexes ( Bauer and Kuster, Eur. J. Biochem. 270, (2003) ) Biacore Atomic Force Microscopy (AFM) Fluorescence Resonace Energy Trasfer (FRET) X-ray Diffraction

Gene regulatory networks Interactions between transcription factors and their target proteins Post translational regulation by other factors such as microRNAs lead to hierarchical networks of diverse components (TFs, miRNAs, RNA binding proteins (RBPs))

shallow network, few long cascades. compact in-degree (promoter size limitation) The gene regulatory network of E. coli Shen-Orr et. al. Nature Genetics 2002 modular

Blue nodes x y z FFL Network motifs

Metabolic pathways Network of biochemical reactions in a cell –Reactions, metabolites, reaction dynamics Data sources –KEGG (Kyoto Encyclopedia of Genes and Genomes) –BioCyc, EcoCyc, MetaCyc – focus on particular species

Metabolic pathways Overview of the basic metabolic pathways of D. radiodurans How radiation kills cells: Survival of Deinococcus radiodurans and Shewanella oneidensis under oxidative stress, by Ghosal et al, FEMS Microbiology Reviews, 2005

Genome-scale metabolic networks May take days to construct We will discuss the detailed workflow of a metabolic network construction tool: merlin –1867 reactions, 1467 metabolites in the K. lactis metabolic model

Computational methods to construct networks SiPAN: simultaneous prediction and alignment of PPI networks by Alkan and Erten (March 2015, Bioinformatics) lpNet: a linear programming approach to reconstruct signal transduction networks by Matos et al (May 2015, Bioinformatics) Reconstructing genome-scale metabolic models with merlin by Dias et al. (April 2015, Nucleic Acids Research)

SiPAN overview Protein-protein interactions can be inferred by transferring interactions from a similar organism: interologs –We need to align networks of two different organisms for identification of interologs –However, network alignment methods assume error-free networks Propose an EM like strategy to iteratively refine the networks and converge to a better alignment and networks

SiPAN overview SPINAL RWS

SiPAN overview on an example

The algorithm

Non-conservation –Given a pair mappings (u,u’) and (v,v’) in an alignment (u,v in G 1 and u’,v’ in G 2 ), if the edge (u,v) exists and (u’,v’) does not exist (or vice verso), this is called a non-conservation and it can be resolved by either inserting the missing edge or deleting the existing edge. The objective of the algorithm is to resolve non-conservations that are significant.

Candidate set Candidate sets C 1 and C 2 –The set of non-conserved edges in G 1 and G 2, respectively

Breakpoint The candidate sets are sorted separately with respect to interaction confidence scores (as computed by RWS) –Increasing order with respect to edge confidence scores A breakpoint on a candidate set is an index on the sorted list of candidates such that the resolved deletions have smaller indices and the resolved insertions have higher indices than this index.

Indel If an edge-pair in both candidate sets is still non-conserved after committing both insertions/deletions in the two candidate sets such an edge-pair is called an indel and should be resolved by giving a higher priority to the operation on one of the candidate sets.

Resolving indels Indels are resolved from from higher to lower priority –Small weight  higher priority Weight of an indel is –w(u,v) x w(u’,v’) –Let in be the index of (u,v) and in’ be the index of (u’,v’) in their corresponding candidate sets –w(u,v)=in/|C 1 | and w(u’,v’)=(|C 2 |-in’)/|C 2 | or –w(u,v)=(|C 1 |-in)/|C 1 | and w(u’,v’)=in’/|C 2 |

Resolving indels

Steps of SiPAN on an example

Inference of Signaling Networks

HPN-DREAM breast cancer network inference challenge The goal of the breast cancer network inference challenge is to quickly and effectively advance our ability to infer causal signaling networks and predict protein phosphorylation dynamics in cancer. Dataset –extensive training data from experiments on four breast cancer cell lines stimulated with various ligands. The data comprise protein abundance time- courses under inhibitor perturbations.

In silico challenge Infer the causal edges in a 20 node network given a dataset containing the 20 nodes’ observations across 10 time points and 4 perturbation experiments (one of these being the control)

In silico challenge

Experimental challenge Infer 32 causal networks, one for each combination of cell line and stimulus –4 cell lines –8 different stimuli. Each of the 32 datasets contains 45 nodes’ observations across 7 time points and 4 inhibition experiments (one of these being the control).

Experimental challenge

lpNet Network inference based on linear programming Infer interactions based on a combination of perturbation/non-perturbation and steady-state/time-series data The signaling network to be inferred is modeled by a weighted graph G –Nodes represent proteins –A weighted edge w ij represents an interaction >0 activation, <0 inhibition

Activity of a node Computed by the following model

The linear programming model

Results lpNet ranked 3 rd in the in silico challenge and 29 th in the experimental challenge among 60 participating teams. lpNet is robust against noise lpNet is faster than DDEPN –lpNet takes on average 15 min to infer a network with 10 nodes, 10 time points and 2 perturbations, while DDEPN takes, on average, 101 min –(computations done on an Intel Xeon 3 GHz, 26MB L2 cache, 32GB RAM, 64 bit Linux OS).

Inference of Genome-Scale Metabolic Models

merlin A tool for reconstructing genome-scale metabolic models

Traditional GSMM reconstruction process

merlin architecture

merlin: homology data curation interface

merlin: Reactions viewer

Conclusions Several tools, methods exist for construction of genome-scale intra-cellular networks Challenge: –Integrate different types of genome-scale networks together in a single cell model to simulate all processes in silico.