2003 Inferring Connection Maps from AfCS Experimental Data and Legacy Data.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Integrating Cross-Platform Microarray Data by Second-order Analysis: Functional Annotation and Network Reconstruction Ming-Chih Kao, PhD University of.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Gene Expression Chapter 9.
Gene expression analysis summary Where are we now?
Microarray Data Preprocessing and Clustering Analysis
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
Fuzzy K means.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Analysis of microarray data
Introduction Tumor necrosis factor-  (TNF  ) is a pro-inflammatory cytokine important in immune responses TNF  inhibits cAMP-stimulated Cyp17 transcription.
The Virtual Free Radical School Cell Signaling by Oxidants: Mitogen-Activated Protein Kinases (MAPK) and Activator Protein – 1 (AP-1) Brooke T. Mossman*
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Gene Set Enrichment Analysis (GSEA)
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Detecting binding sites for transcription factors by correlating sequence data with expression. Erik Aurell Adam Ameur Jakub Orzechowski Westholm in collaboration.
Significance analysis of microarrays (SAM) SAM can be used to pick out significant genes based on differential expression between sets of samples. Currently.
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Benner, Subramaniam and Glass. 2003
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
The RNA-Binding Protein KSRP Promotes Decay of  -Catenin mRNA and Is Inactivated by PI3K-AKT Signaling Roberto Gherzi et al. PLoS Biol. (2006)
Cytokines, Growth Factors and Hormones SIGMA-ALDRICH.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
Analysis of the yeast transcriptional regulatory network.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
2003 Inferring Connection Maps from AfCS Experimental Data and Legacy Data.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Array Platforms 16K Agilent inkjet printed cDNA arrays –The recently developed inkjet printing method (Agilent Technologies) produces more uniform spots.
RAW264.7 Cell Ligand Screen Summary Progress Report and Perspectives AfCS 5/24/04.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
The Cognate interaction Genomic arrays A new era for modeling the immune response Benoit Morel.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Introduction to biological molecular networks
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Cluster validation Integration ICES Bioinformatics.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Analyzing Expression Data: Clustering and Stats Chapter 16.
E14.5E16.5E18.5 Normalized mRNA level Get1 Nfix Smarcd3 A Supplementary Figure 1 (A) The microarray expression levels of bladder terminal differentiation.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Gene expression. Gene Expression 2 protein RNA DNA.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
EQTLs.
M. Fu, G. Huang, Z. Zhang, J. Liu, Z. Zhang, Z. Huang, B. Yu, F. Meng 
Ingenuity Knowledge Base
Altered microRNA expression in stenoses of native arteriovenous fistulas in hemodialysis patients  Lei Lv, MD, Weibin Huang, MD, Jiwei Zhang, MD, Yaxue.
Loyola Marymount University
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
Figure 1 Hierarchical clustering (HCL) outcome of all tested samples with the expression profile of the case report set as unknown Hierarchical clustering.
Inferring Connection Maps from AfCS Experimental Data and
Revealing Global Regulatory Perturbations across Human Cancers
Volume 4, Issue 1, Pages (July 2013)
Revealing Global Regulatory Perturbations across Human Cancers
Epigenomic Profiling Reveals DNA-Methylation Changes Associated with Major Psychosis  Jonathan Mill, Thomas Tang, Zachary Kaminsky, Tarang Khare, Simin.
Predicting Gene Expression from Sequence
Loyola Marymount University
Loyola Marymount University
Functional classification and visualization of differentially expressed genes. Functional classification and visualization of differentially expressed.
Loyola Marymount University
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
Loyola Marymount University
Presentation transcript:

2003 Inferring Connection Maps from AfCS Experimental Data and Legacy Data

2003 COMPONENTS Parts-List INTERACTIONS AND NETWORKS COMPUTATIONAL MODELS Alliance for Cellular Signaling Context-Specific 2003

Our experiments measure genes, proteins and key metabolites. What are the underlying biological relationships amongst these entities? The cell functions as an integrated system involving all these players. How can we analyze our data to reveal this interconnectedness? Data Analysis 2003

Reconstructing Networks

2003 Signal Transduction in a Cell from Downward, Nature, August (2001)

2003 Significance analysis of microarrays * (SAM) (R. Tibshirani, G. Chu 2002) Objective: The replicated expression for each gene is taken for the 4hr time condition (untreated vs ligand) to determine whether the gene is statistically differentially up- or down- regulated. The t-statistics for all the genes are ordered and noted. The labels are then permutated and the t-statistic is calculated again. After many iterations, the cumulative t-statistics is averaged for each gene. Finally, for a given false positive rate, [called “False Discovery Rate” or FDR], the significant genes are selected. For each gene, define the adjusted “t-statistic” as follows:  treated -  untreated  + adjustment factor   mean of replicates   standard deviation for the gene

2003

“mitogenic” ligands FDR = 1% FDR = 35% FDR = 18% FDR = 1%- 3% Two-way dendrogram using significantly expressed genes (4hr) 2670 unique genes

2003 Concordance of significantly up (+) or down (-) regulated genes mitogenic ligands (FDR = 1%) 756 (-) 1082 (+) 337 (-) 135 (-) 553 (-) 147 (-) “down-regulated” matches “up-regulated” matches 3 (-) 446 (-) 887 (+) 96 (-) Mosaic plot 578 (+) 73 (+) 597 (+) 117 (+) 47 (+) 477 (+) 117 (+) 4 (+)6 (+)3 (+) 796 (-) 854 (+) 5 (+)4 (+) 3 (-) 10 (+) 1 (-) 3 (-) 2 (-) 3 (-) 72 (+) 18 (+) 341 (-) 143 (-) 152(-) 80(+) 108 (+) 171 (-) 163 (+) 151 (-) 119 (-) Discordance matrix Example: CD40L had 756 down-regulated and 1082 up-regulated genes. Those which were similarly regulated in AIG: 337 down 578 up. 72 (-)

2003 Beyond Clustering How can we obtain biological information from array data at the level of individual genes and correlations in expression between genes? Can we use the correlations to build a connection network that reflects correlations in expression? Is there biological significance to this?

2003 Two-way hierarchical cluster: mean ratio (vs control) of phosphoprotein levels and ligand Note: the ligands that elicit an ERK response (chemokines + AIG, CD40L) clustered together. A correspondence plot below also showed the grouping.

2003 Similarity measures between genes under different conditions with respect to expression levels for… … groups of genes  clustering methods … pairs of genes  correlation methods Covariance =  N k=1 {el(x(k)) – x mean )}{el(y(k)) – y mean ) = r xy Correlation = r xy /(  x  y  Where, el(x(k)) indicates the expression level of gene x under condition k. x mean is the expression level of gene x over N different conditions.  x is the standard deviation for gene x.

2003 Transcription factor encoded by fos is stabilized by ERK and continues to affect other IE genes such as jun from Nature Cell Biology August 2002

2003 Schematic interpretation of ERK signal duration for IE gene product for fos Cross-correlation matrices Transcription response from “non-ERK” ligands Transcription response from “ERK” response ligands

2003 Microarray analysis model using gene expression profiles DNA Gene AGene BGene CGene D Protein mRNA P P Signal transduction is most likely regulated on the protein level, but the downstream signal on the transcriptional level is the resultant output from the upstream (outside the nucleus) signal input. The signal information processing complexity is now increased on the transcription level but some information flows upstream and oscillates in an input/output fashion.

2003 Beyond Clustering Mechanisms for inducing high correlation between genes in their expression profiles –A direct interaction –An indirect interaction (the regulatory information of gene A product is transferred through the expressions of some other genes to induce the expression of gene B) –Regulation by a common gene (the expression of genes A and B are regulated by a common gene)

2003 Mitogen-Activated Protein Kinase Pathways Mediated by ERK, JNK, and p38 Protein Kinases G. L. Johnson and R. Lapadat Science 2002 December 6; 298: (in Review)

2003 Transcriptional effects downstream from proteins recruited in MAPK cascades (Hazzalin, et al,Nature Cell Biology (2002)

2003 “marginal correlation” “marginal” global correlation (for ligand j ) difference in correlation = r 2 all xy - r 2 all xy except ligand j Red indicates positive influence on the gene upon removing ligand j Green indicates negative influence on the gene upon removing ligand j

2003 “Marginal” correlation IE genes downstream from MAPK Ligand n=33 Idea: indicates the “leverage” on the global correlation coefficient for a gene for the particulat ligand

2003 Marginal Correlations between Genes Provides a “biologically”-driven approach to discriminating ligand responses at the gene and gene-product level. Serves as a pathway driven hypothesis generation method for QRTPCR. Suggests ideal double ligand experiments to explore major signaling pathways that lead to downstream gene expression changes.

2003 “Marginal” correlation signatures IE genes downstream from MAPK Ligand n=33  Correlation coefficient green = negative red = positive Mitogenic ligand

2003 “Marginal” correlation signatures IE genes downstream from MAPK Ligand n=33  Correlation coefficient green = negative red = positive chemokines No obvious pattern so consider data reduction

2003 mitogenicchemokines

2003 For the case of ligand 2MA… cAMP responsive element modulator

2003 Marginal Correlations averaged over Pathway-Specific Genes

2003 Marginal Correlations averaged over Pathway-Specific Genes

2003 Marginal Correlations averaged over Pathway-Specific Genes

2003 Marginal Correlations averaged over Pathway-Specific Genes

2003 transcription factor binding sites immediately upstream from “immediate- early” genes fos & jun (Hazzalin, et al,Nature Cell Biology (2002) = expression measured indirectly in ligand AfCS experiment

2003 Difference in IE gene cross-correlations from ligands that involve ERK pathway Critical level p = Partial correlations Ligands that stimulate ERK Note: junB expression wasn’t detected

2003 Difference in IE gene cross-correlations from ligands that involve ERK pathway Critical level p = Partial correlations ERK CREM a h k Possible interpretation of a gene regulatory network

2003 J Biol Chem 1998 Nov 20;273(47): The transcription factors ID{-3321=Elk-1} and ID{-11291=Serum Response Factor} are necessary for GH-stimulated transcription of ID{-3796=c-fos} through the Serum Response Element (SRE). Proc Natl Acad Sci U S A 1991 Jun 15;88(12): Furthermore, expression of antisense ID{2352=CREM} enhances ID{-3796=c-fos} basal and cAMP-induced transcription. Neurol Res 2000 Mar;22(2): In the non-trauma patients 36% expressed ID{-3796=c-fos} and 73% expressed ID{-6204=c-jun} mRNA, with all patients studied expressing ID{-3796=c-Fos} and ID{-6204=c- Jun} proteins. Mol Cell Biol 1991 Jan;11(1): We observe that the expression of endogenous ID{-6204=c-jun} and ID{-6205=jun B} genes is induced by E1A, which directly transactivates the promoters of ID{-3796=c-fos}, ID{-6204=c- jun}, and ID{-6205=jun B}. Genes Correlated by Gene Expression from Legacy Data extracted from Pathway Assist (Stratagene Database)

2003 Connections at the Protein Level from Legacy Data extracted using Pathway Assist (Stratagene)

2003

Full view of two-way dendrogram Two-Way Dendrogram from AfCS ligand screen using the probes (genes) relating to the “immediate-early” genes (with additional genes that encode MAPK proteins involved in the cascade). Summary: The transcription profiles of these selected genes distinguished the “mitogenic” ligands (AIG, CPG, CD40L, IL-4, IL10, LPS) from the “non-mitogenic” at the 2hr / 4hr time period. Since the upstream MAPK-ERK pathway is involved in cell proliferation this would be expected under ideal experimental conditions. The fact that a distinct two-way “bicluster” (mitogenic ligands are clustered to the IE genes from MAPK-ERK) as a first-pass result of the microarray experiment is highly encouraging. This “semi-supervised” approach indicates our expression data is biologically informative.

2003 Kohn’s Mammalian Cell Cycle Map (with AfCS genes)

2003 Kohn’s Mammalian Cell Cycle Map (with AfCS genes)

2003 Kohn’s Mammalian Cell Cycle Map (with AfCS genes)

2003

Non-mitogenic ligand response gene correlations Mitogenic ligand response gene correlations

2003 MYC Box and related genes

2003 MYC Connection Map Genetic regulatory module generated by partial correlations critical value = 10 -6

2003 Literature-derived expression-based connection maps for all AfCS proteins AfCS proteins with no known connections

2003