Gene Set Enrichment Analysis. GSEA: Key Features Ranks all genes on array based on their differential expression Identifies gene sets whose member genes.

Slides:



Advertisements
Similar presentations
Figure S1: (A) Table shows number of genes that passed Welchs t-test at different q-values (FDR corrected p-values) and Fold Change cut-offs. (B) Immune.
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Gene Set Enrichment Analysis (GSEA)
Author: Jim C. Huang etc. Lecturer: Dong Yue Director: Dr. Yufei Huang.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
Significance Tests P-values and Q-values. Outline Statistical significance in multiple testing Statistical significance in multiple testing Empirical.
 2 Outline  Review of major computational approaches to facilitate biological interpretation of  high-throughput microarray  and RNA-Seq experiments.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
Gene Set Enrichment Analysis (GSEA)
Jesse Gillis 1 and Paul Pavlidis 2 1. Department of Psychiatry and Centre for High-Throughput Biology University of British Columbia, Vancouver, BC Canada.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
GSEA Overview -- Workflow GSEA is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
BIOS6660 shRNAseq Gene Set Enrichment Analysis Tzu L Phang PhD Robert Stearman PhD April 16, 2014.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
The Broad Institute of MIT and Harvard Differential Analysis.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
Canadian Bioinformatics Workshops
David Amar, Tom Hait, and Ron Shamir
Module 2: Analyzing gene lists: over-representation analysis
A Multi-stage Approach to Detect Gene-gene Interactions Associated with Multiple Correlated Phenotypes Zhou Xiangdong,Keith Chan, Danhong Zhu Department.
Raw data VS. Residual value
Identification of synovial fluid microRNA signature in knee osteoarthritis: differentiating early- and late-stage knee osteoarthritis  Y.-H. Li, G. Tavallaee,
Significance analysis of microarrays (SAM)
Volume 2, Issue 2, Pages (February 2016)
Genesets and Enrichment
A bioinformatic analysis of microRNAs role in osteoarthritis
Statistical analysis of the influence of network regulators on topology clusters in the oleate network. Statistical analysis of the influence of network.
Volume 18, Issue 12, Pages (March 2017)
P53 Mediates Vast Gene Expression Changes That Contribute to Poor Chemotherapeutic Response in a Mouse Model of Breast Cancer  Crystal Tonnessen-Murray,
RNA-seq analysis of iWAT and eWAT.
Volume 33, Issue 1, Pages (July 2010)
Volume 11, Issue 2, Pages (August 2012)
Identification of synovial fluid microRNA signature in knee osteoarthritis: differentiating early- and late-stage knee osteoarthritis  Y.-H. Li, G. Tavallaee,
Volume 14, Issue 12, Pages (March 2016)
Activin reprograms macrophages toward a TAM‐like phenotype
Deletion of Lats1 is phenotypically distinct from Lats2 deletion in PyMT tumors. Deletion of Lats1 is phenotypically distinct from Lats2 deletion in PyMT.
Volume 22, Issue 3, Pages (January 2018)
Exercise-driven metabolic pathways in healthy cartilage
Genes affected concordantly within the ileum and meta-analysis data sets formed biofunctional clusters of overlapping pathways. Genes affected concordantly.
Molecular Mechanisms Regulating the Defects in Fragile X Syndrome Neurons Derived from Human Pluripotent Stem Cells  Tomer Halevy, Christian Czech, Nissim.
Volume 22, Issue 3, Pages (January 2018)
Molecular Therapy - Nucleic Acids
Volume 17, Issue 19, Pages (October 2007)
Metabolite predictability across metabolic categories (A) and disease states (B) in the vaginal microbiome. Metabolite predictability across metabolic.
AZA treatment induces a distinct gene-expression pattern in stromal cells. AZA treatment induces a distinct gene-expression pattern in stromal cells. (A-C)
Enrichment of microbial functions with elevated triclosan.
Expression profiles of 5,493 transcripts grouped by k-means clustering
Statistical chart of significantly differentially expressed genes
Reanalysis of an existing experiment using ADAGE
Stage-specific expression modules of preimplantation development.
Extended analysis of differential expression datasets.
The CREBBP-modulated network is enriched in signaling pathways upregulated in the light zone (LZ). The CREBBP-modulated network is enriched in signaling.
Volume 2, Issue 3, Pages (March 2016)
Fig. 2 Inflammatory pathways with STAT1 as a key regulator are enriched in ICB responsive tumors in mouse models and patients. Inflammatory pathways with.
Loss of RNA-Binding Protein Sfpq Causes Long-Gene Transcriptopathy in Skeletal Muscle and Severe Muscle Mass Reduction with Metabolic Myopathy  Motoyasu.
Distinct molecular and clinical correlates of H3F3A mutation subgroups
Transcriptional and genomic targets of EN1 in TNBC cells.
CREBBP loss-of-function results in gene expression repression signature. CREBBP loss-of-function results in gene expression repression signature. A–D,
Gene expression profiles of T cells.
Identification of putative TET1 targets in TNBC
Distinct subtypes of CAFs are detected in human PDAC
Exponent practice.
Presentation transcript:

Gene Set Enrichment Analysis

GSEA: Key Features Ranks all genes on array based on their differential expression Identifies gene sets whose member genes are clustered either towards top or bottom of the ranked list (i.e. up- or down regulated) Enrichment score calculated for each category Permutation test to identify significantly enriched categories Extensive gene sets provided via MolSig DB – GO, chromosome location, KEGG pathways, transcription factor or microRNA target genes

GSEA Each gene category tested by traversing ranked list Enrichment score starts at 0, weighted increment when a member gene encountered, weighted decrement otherwise Enrichment score – point where most different from zero Most significantly up-regulated genes Unchanged genes Most significantly down-regulated genes Disease Control

Enrichment Score (ES) is calculated by evaluating the fractions of genes in S (‘‘hits’’) weighted by their correlation and the fractions of genes not in S (‘‘misses’’) present up to a given position i in the ranked gene list, L, where N genes are ordered according to the correlation, 4 Enrichment Score

GSEA algorithm

Null distribution of enrichment scores Actual ES GSEA: Permutation Test Randomise data (groups), rank genes again and repeat test 1000 times Null distribution of 1000 ES for geneset FDR q-value computed – corrected for gene set size and testing multiple gene sets