Threshold selection in gene co- expression networks using spectral graph theory techniques Andy D Perkins*,Michael A Langston BMC Bioinformatics 1.

Slides:



Advertisements
Similar presentations
Microarray Data Analysis Day 2
Advertisements

Pathways analysis Iowa State Workshop 11 June 2009.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
University at BuffaloThe State University of New York Interactive Exploration of Coherent Patterns in Time-series Gene Expression Data Daxin Jiang Jian.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
Bi-correlation clustering algorithm for determining a set of co- regulated genes BIOINFORMATICS vol. 25 no Anindya Bhattacharya and Rajat K. De.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Basis State Prediction of Cell-Cycle Transcription Factors in Saccharomyces cerevisiae Dr. Matteo Pellegrini Dr. Shawn Cokus Sherri Rose UCLA Molecular,
Bioinformatics Spring Jianping Zhou Extraction of functional information from large-scale gene expression data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Yeast Dataset Analysis Hongli Li Final Project Computer Science Department UMASS Lowell.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
 2 Outline  Review of major computational approaches to facilitate biological interpretation of  high-throughput microarray  and RNA-Seq experiments.
Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks Jacob Scott, Trey Ideker, Richard M. Karp, Roded Sharan RECOMB 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Lecture7 Topic1: Graph spectral analysis/Graph spectral clustering and its application to metabolic networks Topic 2: Concept of Line Graphs Topic 3: Introduction.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
TF Infer A Tool for Probabilistic Inference of Transcription Factor Activities H.M. Shahzad Asif Institute of Adaptive and Neural Computation School of.
TF Infer A Tool for Probabilistic Inference of Transcription Factor Activities H.M. Shahzad Asif Machine Learning Group Department of Computer Science.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Gene Set Enrichment Analysis (GSEA)
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Graph and Topological Structure Mining on Scientific Articles Fan Wang, Ruoming Jin, Gagan Agrawal and Helen Piontkivska The Ohio State University The.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Networks and Interactions Boo Virk v1.0.
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel”
A COMPREHENSIVE GENE REGULATORY NETWORK FOR THE DIAUXIC SHIFT IN SACCHAROMYCES CEREVISIAE GEISTLINGER, L., CSABA, G., DIRMEIER, S., KÜFFNER, R., AND ZIMMER,
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Clustering Metabolic Networks Using Minimum Cut Trees Ryan Kellogg 1, Allison Heath 2, Lydia Kavraki 2,3 1 Carnegie Mellon University, Department of Electrical.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Central dogma: the story of life RNA DNA Protein.
Statistical Testing with Genes Saurabh Sinha CS 466.
Clustering Algorithms to make sense of Microarray data: Systems Analyses in Biology Doug Welsh and Brian Davis BioQuest Workshop Beloit Wisconsin, June.
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Bioinformatics lectures at Rice University Li Zhang Lecture 11: Networks and integrative genomic analysis-3 Genomic data
A Report on CAMDA’01 Biointelligence Lab School of Computer Science and Engineering Seoul National University Kyu-Baek Hwang and Jeong-Ho Chang.
Identification of co-expression networks by comparison of a multitude of different functional states of genome activity Marc Bonin 1, Stephan Flemming.
GO based data analysis Iowa State Workshop 11 June 2009.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Graph spectral analysis/
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Clustering Microarray Data based on Density and Shared Nearest Neighbor Measure CATA’06, March 23-25, 2006 Seattle, WA, USA Ranapratap Syamala, Taufik.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Emily Pachunka ● Spring 2017
Journal club Jun , Zhen.
Finding Dense and Connected Subgraphs in Dual Networks
Department of Genetics • Stanford University School of Medicine
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Presented by Meeyoung Park
GPX: Interactive Exploration of Time-series Microarray Data
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Interactome Networks and Human Disease
Presentation transcript:

Threshold selection in gene co- expression networks using spectral graph theory techniques Andy D Perkins*,Michael A Langston BMC Bioinformatics 1

Outline Introduce ― How to construct a gene co-expression network? ― Steps and our criterion Method Result & Analysis 2

Introduce In gene co-expression networks, nodes represent gene transcripts. Two genes are connected by an edge if their expression values are highly correlated. Definition of “high” correlation is somewhat tricky ― One can use statistical significance… ― But we propose a criterion for picking threshold parameter: spectral graph theory. 3

Introduce 4

Methods Microarray data sets ― Homo sapiens ― Saccharomyces cerevisiae: baker’s yeast 5

Methods Network construction ― Construct a complete graph ― Compute Pearson correlation coefficient between each nodes. ― A high-pass filter between 0.70 to 0.95 threshold Network representation ― Laplacian of the graph G 6

Methods Eigenvalue and eigenvector computation ― Aim to solve the eigenvalue problem defined above. ― resulting eigenvalues and associated eigenvectors, ― The eigenvector associated with λ 1 was exacted and sorted in increasing order. 7

Exmaple 8 λ 1 =0.7216V1=V1= Result:

Methods Cluster detection ― Using a sliding window technique ― Significant difference m + s/2, m:median ; s:standard deviation ― If less than 10 nodes, discard 9

Methods Paraclique extraction [17.] ― The base maximum clique size is 3. 10

Methods Functional comparisons ― To analyze some resulting paracliques in yeast and human, respectively. ― Use Saccharomyces Genome Database GO Slim Viewer and Ingenuity Pathways Analysis software 11

Results and discussion A nearly-disconnected components. [10.] 12 Result: λ1λ1 The ability to find the nearly-disconnected pieces allows us to identify those nodes sharing a well connected,or dense, cluster.

Results and discussion Spectral properties & Algebraic connectivity ― the multiplicity of the zero eigenvalue is equal to the number of connected components in the graph. ― When analyzing only the spectrum of the largest component, the smallest nonzero eigenvalue (λ 1 ): algebraic connectivity 13

Results and discussion Spectral properties & Algebraic connectivity 14 Algebraic connectivity yeast human

Results and discussion Spectral clustering :potential threshold ― Resulting in a likely nearly-disconnected component

Results and discussion Comparison with other results ― Traditional methods 16

Results and discussion Comparison with other results ― Previous studies (1) [19.] ― Based on RMT approach to determine correlation threshold ― result = 0.77,corresponds approximately to

Results and discussion Comparison with other results ― Previous studies (2) [14.] ― We select g=3, to enumerate paraclique. 18

Results and discussion Functional comparisons : SGD & IPA ― yeast t=0.78 Three largest paracliques size 21, 17, of the 21 genes had unknown molecular function; t=0.55 Three largest paracliques size 93, 53, 37. Many more of these gene have unknown molecular function(40,13,17). 19 Largely the same categories appeared within the three largest paracliques in both groups.

Results and discussion Functional comparisons : SGD & IPA ― human t= st paraclique related to cellular organization, gene expression, genetic disorder, drug metabolism, and cell signaling;2 nd protein synthesis; these were related to reproductive systems development and disease, respectively. t=0.65The networks seem to be annotated with a larger range of functions.ex : 2 nd matched 13 networks ranging from cellular assembly and organization, genetic disorder, to inflammatory disease, and many others. 20

Conclusion Here presented a systematic threshold selection method that make use of spectral graph theory. The results in agreement with previous study. At higher threshold ― Fewer of these genes fail to be categorized based upon the gene ontology. ― Fewer networks were identified as being enriched in the paracliques, making interpretation of the results easier. 21

Reference [10.] Ding CHQ, He X and Zha H: A spectral method to separate disconnected and nearly-disconnected web graph components.Proceedings of the Seventh ACM International Conference on Knowledge Discovery and Data Mining: 26–29 August 2001; San Francisco [14.] Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB,Brown PO, Botstein D and Futcher B: Comprehensive identification of cell cycle-regulated genes of the yeast Saccaromyces cerevisiae by microarray hybridization. Molecular Biology of the Cell 1998, 9 [17.] Chesler EJ and Langston MA: Combinatorial genetic regulatory network analysis tools for high throughput transcriptomic data. RECOMB Satellite Workshop on Systems Biology and Regulatory Genomics: 2–4 December 2005; San Diego