Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.

Slides:



Advertisements
Similar presentations
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Advertisements

Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Molecular Systems Biology 3; Article number 140; doi: /msb
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
Computational Modelling of Biological Pathways Kumar Selvarajoo
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling,
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Chapter 4: Protein Interactions and Disease
Protein Interactions and Disease Audry Kang 7/15/2013.
Proteomics Understanding Proteins in the Postgenomic Era.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
Apostolos Zaravinos, Myrtani Pieri, Nikos Mourmouras, Natassa Anastasiadou, Ioanna Zouvani, Dimitris Delakas, Constantinos Deltas Department of Biological.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
Networks and Interactions Boo Virk v1.0.
1 Introduction(1/2)  Eukaryotic cells can synthesize up to 10,000 different kinds of proteins  The correct transport of a protein to its final destination.
Finish up array applications Move on to proteomics Protein microarrays.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
SP Cancer Metastasis Summary Hypothesis: We hypothesize that miRNAs regulate breast cancer cell invasiveness and metastasis by synergistically targeting.
Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets.
Complementarity of network and sequence information in homologous proteins March, Department of Computing, Imperial College London, London, UK 2.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Decoding the Network Footprint of Diseases With increasing availability of data, there is significant activity directed towards correlating genomic, proteomic,
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
An overview of Bioinformatics. Cell and Central Dogma.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
Microarray: An Introduction
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
David Amar, Tom Hait, and Ron Shamir
Networks and Interactions
Gene expression.
Van’t Veer et al, Nature 415: (2002)
Volume 20, Issue 5, Pages (November 2014)
SEG5010 Presentation Zhou Lanjun.
Gautam Dey, Tobias Meyer  Cell Systems 
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Volume 20, Issue 5, Pages (November 2014)
Highly metastatic PDAC cells have a unique gene signature, which is not preserved in metastases but predicts poor patient outcome. Highly metastatic PDAC.
Presentation transcript:

Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson et al, BMC Bioinformatics, 2006 Team 1 Author: Kirill Osipov Presenter: Ferhat Ay

Background Metastasis is an event associated with a poor prognosis in cancer patients Metastasising cancer cells ◦ break away from the primary tumor ◦ acquire increased motility and invasiveness ◦ make cancer more difficult to treat Networks of protein-protein interactions (PPI) create metastasising cancer cells

PPI network analysis methods 1. Experimental, biochemical analysis ◦ Microarray gene expression data  provides correlation between gene expressions  lacks information on protein interaction mechanism ◦ Genome survey  High throughput but error prone  In-depth but time-consuming & expensive 2. Computational analysis ◦ Complements biochemical analysis ◦ Can predict protein-protein interactions ◦ Aggregates data from many sources

Computational PPI analysis Protein interaction network is composed of predictions of individual PPIs ◦ prediction score is assigned to each interaction ◦ higher score => higher prediction confidence Computational protein network construction 1.Collect data on  gene expression  gene ontologies  phenotypic profiling  functional similarities 2.Apply Bayesian regression to deduce PPI 3.Identify communities of protein interaction  e.g. using clique percolation method

Paper(Jonsson et al) Overview Created a protein interaction network, "interactome" for a rat ◦ used computational and homology* approaches ◦ developed a scoring function based on  homology sequence similarity  amount of experimental data for every PPI in the network ◦ implemented an automated solution for building a network of interacting proteins Demonstrated utility of the interactions predicted by interactome ◦ Mapped predicted interactions to tumor expression data ◦ Confirmed interactome predicting protein networks involved in the cancer processes *Homology approaches study data from organisms sharing an ancestry. Assume that interacting protein modules in one organism may be considered to be interacting in a related organism

Homology-based data selection Homology data used for scoring is bounded by red box

Scoring function verification Checked scores against highly reliable X-ray crystallographic evidence. ◦ Confirmed that highly reliable interactions identified by X-ray crystallography correlate with high scores of the scoring function. Identified communities of protein interaction in cellular processes. ◦ Confirmed high scores for intra-community processes, i.e. interactions between proteins within the same cellular processes ◦ Confirmed low scores for inter-community processes, i.e. those between proteins that are not believed to have any interactions. Validated high scores for protein interactions within same cellular compartment and low scores for separate cellular compartments ◦ Used localization data from Gene Ontology Consortium

Analysis Overview Collected data from a microarray analysis of cell lines with different potential for metastatic state Constructed networks based on interactions directly with originating proteins and then including 2nd order interactions relative to originating proteins ◦ 10,628 interactions Used clique percolation method to identify communities of protein interactions ◦ 37 protein communities ◦ 313 proteins ◦ 1,094 interactions Majority of communities are associated with cancer metastasising processes

Protein communities identified by cluster analysis

Example: Intracellular signaling cascade community Zoom in

Summary The approach proposed by Jonsson et al departs from ealier methods of using expression data in community networks The authors generated the networks first, mapped the expression data on top of the networks and then performed a clustering. The approach allowed to bypass obstacles involved in traditional microarray analysis, e.g. clustering gene expression patterns The approach focused on metastatis-related interactions by using the clique method to highlights hubs of highly interconnected protein communities The complex parts of the network are considered by the clique method but simple linear pathways do not get included in analysis