‘Omics’ - Analysis of high dimensional Data

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
A Factor Graph Model for Minimal Gene Set Enrichment Analysis Diana Uskat Computational Biology - Gene Center Munich.
Gene Ontology (GO) term “enrichment status” for targets of pollen– and sporophyte- enriched kn-miRNA. A, term enrichment levels along with the GO term.
A novel approach to analysis of primary HTS data Compound Set Enrichment Thibault VarinAnsgar Schuffenhauer Gubler, H., Parker, C., Zhang, JH., Raman,
1 Statistics Achim Tresch UoC / MPIPZ Cologne treschgroup.de/OmicsModule1415.html
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Achim Tresch Computational Biology ‘Omics’ - Analysis of high dimensional Data.
Introduction to Microarry Data Analysis - II BMI 730
Model-based clustering of gene expression data Ka Yee Yeung 1,Chris Fraley 2, Alejandro Murua 3, Adrian E. Raftery 2, and Walter L. Ruzzo 1 1 Department.
Data mining with the Gene Ontology Josep Lluís Mosquera April 2005 Grup de Recerca en Estadística i Bioinformàtica GOing into Biological Meaning.
Simultaneous Identification of Multiple Driver Pathways in Cancer Mark D. M. Leiserson, et.al.
Using Gene Ontology Models and Tests Mark Reimers, NCI.
Differentially expressed genes
Clustering (Part II) 10/07/09. Outline Affinity propagation Quality evaluation.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
ICA-based Clustering of Genes from Microarray Expression Data Su-In Lee 1, Serafim Batzoglou 2 1 Department.
Gene Set Enrichment Analysis Petri Törönen petri(DOT)toronen(AT)helsinki.fi.
1Module 2: Analyzing Gene Lists Canadian Bioinformatics Workshops
GO::TermFinder Gavin Sherlock Department of Genetics Stanford University
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
GenMAPP and MAPPFinder for Systems Biology Education Kam Dahlquist Vassar College June 12-20, 2004 BioQUEST Summer Workshop Beloit College.
Probabilistic Models that uncover the hidden Information Flow in Signalling Networks.
Jesse Gillis 1 and Paul Pavlidis 2 1. Department of Psychiatry and Centre for High-Throughput Biology University of British Columbia, Vancouver, BC Canada.
Probabilistic Models that uncover the hidden Information Flow in Signalling Networks Achim Tresch.
Gene Expression Data Analysis Lab Session CAD course Jian Li
1 Gene Ontology Javier Cabrera. 2 Outline Goal: How to identify biological processes or biochemical pathways that are changed by treatment.Goal: How to.
Summarizing Differential Expression Using Mann-Whitney U-tests.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Cluster validation Integration ICES Bioinformatics.
Mining publicly available microarray data Frances Turner
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
GO enrichment and GOrilla
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring T.R. Golub et al., Science 286, 531 (1999)
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer Max Leiserson *, Hsin-Ta Wu *, Fabio Vandin, Benjamin.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
6/11/20161 Graph models and efficient exact algorithms in studying cancer signaling pathways Songjian Lu, Lujia Chen, Chunhui Cai Department of Biomedical.
1 A Discussion of False Discovery Rate and the Identification of Differentially Expressed Gene Categories in Microarray Studies Ames, Iowa August 8, 2007.
David Amar, Tom Hait, and Ron Shamir
The Omics Dashboard Suzanne Paley Pathway Tools Workshop 2018
Pathweavers Elizabeth McClellan Ribble, Ph.D.
Behaviorally dependent allele-specific expression.
Physics-based simulation for visual computing applications
Genome-Wide Analysis of PLT2 Binding to Target Genes.
Phenotype validation: a novel heat shock function for the copper-responsive regulator CopR. Phenotype validation: a novel heat shock function for the copper-responsive.
ADAGE model example. ADAGE model example. For one sample in the expression compendium (one column in the figure with red or green colors, representing.
JNK signature in breast cancer cells is linked to ECM, stem cell, and wound healing gene networks and is enriched in basal‐like breast cancer JNK signature.
System approaches to analyze the effects of Zc3h10 knockdown
Meta-analysis of the Listeriomics transcriptomic data sets.
Genes affected concordantly within the ileum and meta-analysis data sets formed biofunctional clusters of overlapping pathways. Genes affected concordantly.
Categorization of the top 200 N
Functional enrichment of differentially expressed genes.
ChEA, ChIP Enrichment Analysis.
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
The Omics Dashboard.
Gene ontology (GO) term enrichment for significantly downregulated genes. Gene ontology (GO) term enrichment for significantly downregulated genes. Visualization.
Reanalysis of an existing experiment using ADAGE
Derivation of ImSig. Derivation of ImSig. A, An example of a correlation network generated from a tissue data set where nodes represent unique genes and.
The log-transformed reporter Z scores of the metabolic pathways showing significant differences between the PCOS and control groups. The log-transformed.
Transcriptional and genomic targets of EN1 in TNBC cells.
CREBBP loss-of-function results in gene expression repression signature. CREBBP loss-of-function results in gene expression repression signature. A–D,
Gene expression profiles of T cells.
Transcriptome profiling of PD-L1 antibody–treated macrophages showed inflammatory phenotype, increased survival and proliferation, and decreased apoptosis.
Characteristic gene expression patterns distinguish LCH cells from other immune cells present in LCH lesions. Characteristic gene expression patterns distinguish.
Presentation transcript:

‘Omics’ - Analysis of high dimensional Data Achim Tresch Computational Biology

Topics Hypergeometric test [Khatri and Draghici 2005] Kolmogorov-Smirnov test [Subramanian et al. 2005]

Gene Set Enrichment

Fisher‘s exact test, once more

Fisher‘s exact test, once more

Gene Ontology Example 559

(macromolecule biosynthesis) Gene Ontology Example (immune response) (macromolecule biosynthesis)

Kolmogorov-Smirnov Test < 10-10 Move 1/K up when you see a gene from group a Move 1/(N-K) down when you see a gene not in group a

Topics

GO scoring: general problem

GO Independence Assumption GO sets light yellow

GO Independence Assumption light yellow

The elim method

Top 10 significant nodes (boxes) obtained with the elim method

The weight method

The weight method

The weight method (x) (x)}

Top 10 significant nodes (boxes) obtained with the elim method The weight method Top 10 significant nodes (boxes) obtained with the elim method

Algorithms Summary

Topics

Significant GO terms in the ALL dataset Top scoring GO term Significant GO terms in the ALL dataset

Advantages & Disadvantages for ALL

Prostate cancer progression

Prostate cancer progression

Prostate cancer progression

Influence of the p-values adjustment

Simulation Study Introduce noise

Simulation Study

Simulation Study

Quality of GO scoring methods 10% noise level 40% noise level

Summary

Adrian Alexa MPI Saarbrücken Acknowledgements Adrian Alexa MPI Saarbrücken