Human growth is associated with distinct patterns of gene expression in evolutionarily conserved networks Adam Stevens, Daniel Hanson, Andrew Whatmore,

Slides:



Advertisements
Similar presentations
Gene expression profiling identifies molecular subtypes of gliomas
Advertisements

Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
Gene Set Enrichment Analysis (GSEA)
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Supplemental Figures. Supplemental Figure 1. Top two canonical pathways clustering the potential predictive biomarkers. The Ingenuity IPA tool was used.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets Node Patient 1Patient 2Patient 3 Primary Mets.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
E14.5E16.5E18.5 Normalized mRNA level Get1 Nfix Smarcd3 A Supplementary Figure 1 (A) The microarray expression levels of bladder terminal differentiation.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Supplementary Figure 1 List of top 50 most abundantly expressed lncRNAs and mRNAs in HP and PFC from RNAseq. Expression abundance of lncRNAs and mRNAs.
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Volume 19, Issue 5, Pages (May 2017)
DNA Methylation Regulates Gene Expression in Intracranial Aneurysms
System Structures Identification
Genomic Investigation of Lupus in the Skin
M. Fu, G. Huang, Z. Zhang, J. Liu, Z. Zhang, Z. Huang, B. Yu, F. Meng 
Volume 44, Issue 1, Pages (January 2016)
Volume 2, Issue 2, Pages (February 2016)
Building and Analyzing Genome-Wide Gene Disruption Networks
Ingenuity Knowledge Base
Low Dimensionality in Gene Expression Data Enables the Accurate Extraction of Transcriptional Programs from Shallow Sequencing  Graham Heimberg, Rajat.
Volume 21, Issue 13, Pages (December 2017)
Figure S1: Gene importance plot derived from Variable/ Feature selection using machine learning on the training dataset. MeanDecreaseGini is the measure.
Fig. 4. BET bromodomain inhibition suppresses transactivation of shared transcriptional networks across HF models. BET bromodomain inhibition suppresses.
Volume 138, Issue 4, Pages (August 2009)
Genomic Investigation of Lupus in the Skin
Molecular Classification of Crohn's Disease and Ulcerative Colitis Patients Using Transcriptional Profiles in Peripheral Blood Mononuclear Cells  Michael.
Volume 11, Issue 2, Pages (August 2012)
Volume 13, Issue 9, Pages (December 2015)
Volume 9, Issue 3, Pages (September 2017)
Volume 62, Issue 2, Pages (April 2016)
Transcriptional Landscape of Cardiomyocyte Maturation
Understanding Tissue-Specific Gene Regulation
Integrated protein co‐expression and differential expression across the ALS‐FTD disease continuum Integrated protein co‐expression and differential expression.
Volume 33, Issue 4, Pages e6 (April 2018)
Human-Specific Transcriptional Networks in the Brain
Volume 3, Issue 1, Pages (July 2016)
System approaches to analyze the effects of Zc3h10 knockdown
Volume 7, Issue 3, Pages (September 2016)
Volume 6, Issue 4, Pages (April 2010)
Novel functional roles of uncharacterized genes as functional regulators of cellular cholesterol levels. Novel functional roles of uncharacterized genes.
Volume 23, Issue 3, Pages (March 2016)
Neurogenins induce a network of transcription factors that mediate iNGN neurogenesisA network of transcription factors involved in iNGN neurogenesis was.
Molecular Therapy - Nucleic Acids
The Omics Dashboard.
Volume 132, Issue 6, Pages (March 2008)
Volume 4, Issue 3, Pages e3 (March 2017)
Predicting Gene Expression from Sequence
Volume 122, Issue 6, Pages (September 2005)
Volume 19, Issue 5, Pages (May 2017)
Gene Density, Transcription, and Insulators Contribute to the Partition of the Drosophila Genome into Physical Domains  Chunhui Hou, Li Li, Zhaohui S.
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Maria S. Robles, Sean J. Humphrey, Matthias Mann  Cell Metabolism 
Identification of aging-related genes and affected biological processes. Identification of aging-related genes and affected biological processes. (A) Experimental.
Statistical chart of significantly differentially expressed genes
Enter the Matrix: Factorization Uncovers Knowledge from Omics
Lab-Specific Gene Expression Signatures in Pluripotent Stem Cells
The CREBBP-modulated network is enriched in signaling pathways upregulated in the light zone (LZ). The CREBBP-modulated network is enriched in signaling.
Volume 4, Issue 2, Pages e4 (February 2017)
Volume 10, Issue 1, Pages (January 2017)
The Zfhx3-Mediated Axis Regulates Sleep and Interval Timing in Mice
Volume 3, Issue 6, Pages e3 (December 2016)
Distinct subtypes of CAFs are detected in human PDAC
Volume 25, Issue 5, Pages e4 (May 2017)
Volume 27, Issue 7, Pages e4 (May 2019)
Presentation transcript:

Human growth is associated with distinct patterns of gene expression in evolutionarily conserved networks Adam Stevens, Daniel Hanson, Andrew Whatmore, Benoit Destenaves, Pierre Chatelain, Peter Clayton Supplementary Information: Introduction In this Supplementary Information we make available additional data that were discussed in the main manuscript. Table of Contents Supplemental Figures Figure S1. Generation of the main data set. Figure S2. Age related differences in gene ontology. Figure S3. Age related differences in expression of genes within canonical pathways. Figure S4. Identification of transcription factors that are expected to be activated or inhibited, given the observed gene expression changes in the three clusters of age related genes. Figure S5. Analysis of network topology. Figure S6. Analysis of protein connectedness (degree) in the human interactome as a measure of protein function within genes within age-related expression clusters from temporal lobe human brain tissue (GSE37721, Sterner et al 2012).

Figure S1. Generation of the main data set. Homogeneity of multiple lymphoid control datasets was demonstrated using: A) Principal component analysis (PCA), axes are the first three “components” marking the amount of variance each explains (%). B) Multiple dimensional scaling (MDS) using Isomapping procedure (Tenenbaum et al, 2000), axes represent a three dimensional contraction of multiple components (%). Homogeneity was assessed using cross-validation (X-validation) where one sample is sequentially removed and its effect on the distribution observed. To assess the effect of different age and gender distributions within the different studies used to form the main data set sliding-window MDS using isomapping was performed over the age range in groups of four; examples are shown C) age groups <2, ≤4, ≤6 & ≤8 years of age& D) age groups ≤8, ≤10, ≤ 12 & ≤14 years of age. Pink = GSE9006, green = TABM666, blue = GSE26440 & yellow = GSE Similar age-related clusters were shown in all data sets (ANOVA, p<0.05, gender as co-variate): E) GSE 9066, 540 probe–sets, F) TABM666, 4579 probe-sets, G) GSE26440, 603 probe-sets, H) GSE11504, 1828 probe–sets. Horizontal axis = age in years of sample. A) B) E) F) G) H) GSE9006 (n=24) TABM666 (n=16) GSE26440 (n=22) GSE11504 (n=25) Age groups <2, ≤4, ≤6 & ≤8 ISOMAP C) Full data set for X-validation PCA Full data set for X-validation ISOMAP Age groups ≤8, ≤10, ≤ 12 & ≤14 ISOMAP D)

Figure S2. Age related differences in gene ontology. Forrest plot of biological process gene ontology ANOVA comparing different age groups ranked by false discovery rate modified p-value (q), dark green = down-regulated genes (q<0.05), dark red = up-regulated genes (q<0.05); and by unmodified p-value (p), light green = down-regulated genes (p<0.05), pink = up- regulated genes (p<0.05). 0-6 v v v Age Group Comparisons (age in years)

A. Metabolic Pathways B. Signalling Pathways Figure S3. Age related differences in expression of genes within canonical pathways. Biological pathways were associated with the three clusters of age related genes as identified from the KEGG database (Webgestalt); ≤6yrs [Infancy, Early Childhood]; >6 to ≤17yrs [Late Childhood, Puberty] and >17yrs [Adult, Final Height] (hypergeometric test, q<0.2). A. Metabolic pathways. B. Signalling Pathways. Thiamine Glycine, Serine and Threonine Riboflavin Cysteine and Methionine Fructose and Mannose Pyrimidine Arachidonic acid Purine Porphyrin Nitrogen Infancy Adult Puberty Neurotrophin Calcium VEGF TLR TCR Chemokine Jak-STAT TGFB p53 Adipocytokine Infancy AdultPuberty PPAR BCR MAPK Histidine Fatty acid

Infancy Puberty Adult Figure S4. Identification of transcription factors that are expected to be activated or inhibited, given the observed gene expression changes in the three clusters of age related genes; ≤6yrs [Infancy, Early Childhood] ; >6 to ≤17yrs [Late Childhood, Puberty] and >17yrs [Adult, Final Height]. If the predicted transcription factor is also present in the dataset then the direction of the fold change in gene expression is shown (  = up-regulated,  = down-regulated). This analysis is based on expected causal effects between transcription factors and targets; the expected causal effects are derived from the literature compiled in the Ingenuity® Knowledge Base. The analysis examines the known targets of each transcription factor in the dataset, compares the targets’ direction of change to expectations derived from the literature, then issues a prediction for each transcription factor based on the direction of change. The direction of change is the gene expression in the experimental samples relative to a control. The z-score predicts the activation state of the transcription factor, using the gene expression patterns of the transcription factor and its downstream genes. An absolute z-score of ≥ 2 is considered significant. A transcription factor is predicted to be activated if the z-score is ≥ 2, inhibited if the z-score ≤ ‑ 2. The p-value of overlap is calculated by the Fisher’s Exact Test and indicates the statistical significance of genes in the dataset that are downstream of the transcription factor.

Figure S5. Analysis of network topology. A. A schematic representation of network “Hubs” (H), “Bottlenecks” (B) and “Hub- Bottlenecks” (HB); all network features associated with essential biological function (Yu et al, 2007 & Sun et al 2010). B. A flow diagram showing how differential gene expression data is used to generate an inferred protein:protein interaction (PPI) network derived from a model of the human interactome (Biogrid ), the top 10% “hubs” and “bottlenecks” are then used to generate a minimal essential network and gene expression associated essential pathways are defined. A) B) H H BHB H ModulePathwaysFDR 1Biological pathway 1<0.05 1Biological pathway 2<0.05 2Biological pathway 3<0.05 2Biological pathway 4<0.05 3Biological pathway 5<0.05 3Biological pathway6<0.05 4Biological pathway 7<0.05 Interactome Model of Gene Expression Data Minimal Essential Network Network Topology Differential Gene Expression PathwaysRatio Biological pathway 1<1 Biological pathway 2<1 Biological pathway 3<1 Biological pathway 4<1 Biological pathway 5<1 Biological pathway6<1 Biological pathway 7<1 Pathway Ontology From Differential Gene Expression Pathway Ontology From Minimal Essential Network = Gene Expression Associated Essential Pathways H = hub = highly connected protein B = Bottleneck = a network that limits flow of information HB = both a hub and a bottleneck “Hubs & Bottlenecks”

Figure S6. Analysis of protein connectedness (degree) in the human interactome as a measure of protein function within genes within age-related expression clusters from temporal lobe human brain tissue (GSE37721, Sterner et al 2012). Growth phase related gene expression clusters derived from human temporal lobe brain tissue were grouped using the same binning as in the main data set, ≤6 years of age [infancy, early childhood group (n=7)]; >6 to ≤17 years of age [late childhood, puberty group (n=17)]; and >17 to <30 years of age [adulthood (n=6)], protein connectedness was measured from a model of the human interactome (Biogrid build ) and plotted against the frequency of proteins of specific degree. Age/growth phase related gene clusters as follows; Infancy, blue marker n= 232; Childhood/Puberty, red marker n= 176; Adult/Final height, green marker n= 165. Adult v. Infancy group p<0.05, Infancy group v Puberty group p<0.15, Wilcoxon test. Protein Connectedness (Degree) Protein Connectedness Frequency Interactome Protein Connectivity in Temporal Lobe Brain Tissue Infancy Childhood/Puberty Adult