Differential analysis of Eigengene Networks: Finding And Analyzing Shared Modules Across Multiple Microarray Datasets Peter Langfelder and Steve Horvath.

Slides:



Advertisements
Similar presentations
Gene Correlation Networks
Advertisements

Functional Organization of the Transcriptome in Human Brain Michael C. Oldham Laboratory of Daniel H. Geschwind, UCLA BIOCOMP ‘08, Las Vegas, NV July 15,
Andy Yip, Steve Horvath Depts Human Genetics and Biostatistics, University of California, Los Angeles The Generalized Topological.
Automatic Identification of ROIs (Regions of interest) in fMRI data.
Bi-correlation clustering algorithm for determining a set of co- regulated genes BIOINFORMATICS vol. 25 no Anindya Bhattacharya and Rajat K. De.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
MOPAC: Motif-finding by Preprocessing and Agglomerative Clustering from Microarrays Thomas R. Ioerger 1 Ganesh Rajagopalan 1 Debby Siegele 2 1 Department.
Steve Horvath University of California, Los Angeles
Steve Horvath, Andy Yip Depts Human Genetics and Biostatistics, University of California, Los Angeles The Generalized Topological.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Microarray analysis Algorithms in Computational Biology Spring 2006 Written by Itai Sharon.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Generating Robust and Consensus Clusters from Gene Expression Data Allan Tucker a, Stephen Swift a, Xiaohui Liu a, Nigel Martin b, Christine Orengo c,
Is Forkhead Box N1 (FOXN1) significant in both men and women diagnosed with Chronic Fatigue Syndrome? Charlyn Suarez.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Is my network module preserved and reproducible? PloS Comp Biol. 7(1): e Steve Horvath Peter Langfelder University of California, Los Angeles.
Consensus eigengene networks: Studying relationships between gene co-expression modules across networks Peter Langfelder Dept. of Human Genetics, UC Los.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Gene Set Enrichment Analysis Petri Törönen petri(DOT)toronen(AT)helsinki.fi.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Ai Li and Steve Horvath Depts Human Genetics and Biostatistics, University of California, Los Angeles Generalizations of.
An Overview of Weighted Gene Co-Expression Network Analysis
“An Extension of Weighted Gene Co-Expression Network Analysis to Include Signed Interactions” Michael Mason Department of Statistics, UCLA.
A Geometric Interpretation of Gene Co-Expression Network Analysis Steve Horvath, Jun Dong.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells ES cell culture Self- renewing Ecto- derm.
Hierarchical Clustering and Dynamic Branch Cutting
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Steve Horvath Co-authors: Zhang Y, Langfelder P, Kahn RS, Boks MPM, van Eijk K, van den Berg LH, Ophoff RA Aging effects on DNA methylation modules in.
Expression Modules Brian S. Yandell (with slides from Steve Horvath, UCLA, and Mark Keller, UW-Madison)
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
More About Clustering Naomi Altman Nov '06. Assessing Clusters Some things we might like to do: 1.Understand the within cluster similarity and between.
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
Computing Co-Expression Relationships Wen-Dar Lin.
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Equivalent Opposite PTPRC low  CD19 low FAM60A low  NUAK1 high XIST high  RPS4Y1 low COL3A1 high  SPARC high Boolean analysis of large gene-expression.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Consensus modules: modules present across multiple data sets Peter Langfelder and Steve Horvath Eigengene networks for studying the relationships between.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.
Clustering Microarray Data based on Density and Shared Nearest Neighbor Measure CATA’06, March 23-25, 2006 Seattle, WA, USA Ranapratap Syamala, Taufik.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Ping Wang, Mar Method Paper. Ping Wang, Mar Outline Methods –Multiple QTL model identification procedure –Adjacency Measurement –Clustering.
SUPPORTING INFORMATION Figure S1. Cotton whitefly infestation within a sealed chamber.
Steve Horvath University of California, Los Angeles Module preservation statistics.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 3.
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Graph clustering to detect network modules
Multi-level predictive analytics and motif discovery across large dynamic spatiotemporal networks and in complex sociotechnical systems: An organizational.
 Identification of optimal parameter ranges in building and assessing correlation networks built from gene expression. Qianran Li Supervisor:Dr. Kathryn.
Topological overlap matrix (TOM) plots of weighted, gene coexpression networks constructed from one mouse studies (A–F) and four human studies including.
(A) Hierarchical clustering was performed to identify groups of patients with similar RNASeq expression of 20 genes associated with reduced survivability.
Human-Specific Transcriptional Networks in the Brain
Volume 3, Issue 1, Pages (July 2016)
ADAGE model example. ADAGE model example. For one sample in the expression compendium (one column in the figure with red or green colors, representing.
(A) Refined profile principal component analysis loading weights plot was used to derive insight into possible association of biomarkers. (A) Refined profile.
Michal Levin, Tamar Hashimshony, Florian Wagner, Itai Yanai 
Volume 37, Issue 6, Pages (December 2012)
Functional classification and visualization of differentially expressed genes. Functional classification and visualization of differentially expressed.
Inferring Cellular Processes from Coexpressing Genes
Derivation of ImSig. Derivation of ImSig. A, An example of a correlation network generated from a tissue data set where nodes represent unique genes and.
M-Wnt and E-Wnt cells cluster tightly with claudin-low and basal-like breast tumors, respectively, by microarray analysis. M-Wnt and E-Wnt cells cluster.
Presentation transcript:

Differential analysis of Eigengene Networks: Finding And Analyzing Shared Modules Across Multiple Microarray Datasets Peter Langfelder and Steve Horvath Dept. of Human Genetics UCLA

Road map Overview of Weighted Gene Co-expression Network analysis, module detection Differential analysis of networks at the level of modules  Consensus modules  Eigengenes and eigengene networks  Consensus eigengene networks Applications: (1) Human and chimp brains, (2) 4 tissues of female mice

Overview of Weighted Gene Co-expression Network Analysis

Constructing a weighted co-expression network Get microarray gene expression data Do preliminary filtering Measure concordance of gene expression profiles by Pearson correlation Correlation matrix is raised to a power (a.k.a. soft thresholding)  weighted network

Network module detection Modules: groups (clusters) of densely interconnected genes Pairwise measure of inter-connectedness: Topological overlap measures direct connection + shared neighbours No shared neighbours: low TOM many shared neighbours: high TOM Clustering procedure: Average linkage hierarchical clustering

Example of module detection via hierarchical clustering Expression data from human brains, 18 samples. Example of module detection via hierarchical clustering

Why are modules so important? Functional: expected to group together genes responsible for individual pathways, processes etc., hence biologically well- motivated Useful from a systems-biological point of view: bridge from individual genes to a systems-level view of the organism For certain applications, modules are the natural building blocks of the description, e.g., study of co-regulation relationships among pathways

Goal of our work: Differential analysis of networks (commonalities and differences) at the level of modules

Step 1: Find consensus modules Consensus modules: modules present in each set Rationale: Find common functions/processes Set 1 Set 2 Set modules Consensus modules

Consensus module eigengenes Step 2: Represent each module by one representative: Module Eigengene Pick one representative for each module in each set: We take Module Eigengene: 1 st Principal component of module expression matrix

Step 3: form a network of module eigengenes in each set Module relationship = Cor(ME[i], ME[j]) (ME:Module eigengene) Understand differences in regulation under different conditions Modules become basic building blocks of networks: ME networks Set 1 Set 2

Summary of the methodology Individual sets Consensus modules Consensus eigengenes Consensus eigengene networks

Methods

Consensus modules: Definition In each individual set: Modules = groups of densely interconnected genes Consensus: agreement of all sets Consensus modules: groups of genes that are densely interconnected in each set

Consensus modules: Detection Detection of modules in individual sets: Measure of gene-gene similarity (TOM) + clustering Detection of Consensus modules: Define a consensus gene-gene similarity measure and use clustering

Consensus similarity measure Set 1 Set 2

Consensus similarity measure Set 1 Set 2 Min

Applications

Human and chimpanzee brain expression data Construct gene expression networks in both sets, find modules Construct consensus modules Characterize each module by brain region where it is most differentially expressed Represent each module by its eigengene Characterize relationships among modules by correlation of respective eigengenes (heatmap or dendrogram)

Detection of Consensus modules

Biological information? Assign modules to brain regions with highest (positive) differential expression Red means the module genes are over-expressed in the brain region; green means under- expression

What else is new? Preservation of modules across the primate brains and their relationships to brain regions was described by Oldham et al. (2006). Challenge: Oldham et al. (2006) did not study the relationships between the modules. Solution: study module relationships using eigengene networks

Consensus eigengene networks Heatmap comparisons of module relationships

Eigengene network visualization (II) Module dendrograms show clusters of similar modules

Consensus modules across 4 mouse tissues Consensus analysis of expression data from liver, brain, muscle, adipose tissues, BXH mouse cross ~130 samples for each tissue; 3600 genes in each network Performed Functional Enrichment Analysis

Detection of consensus modules 11 modules in total

Functional Enrichment Analysis

Conclusions Differential analysis at the level of modules Consensus modules (modules present in all sets): study common pathways Eigengene networks (comprised of module eigengenes): study commonalities and differences in regulation Applications: Consensus eigengene networks are robust and encode biologically meaningful information