Babelomics Functional interpretation of genome-scale experiments Barcelona, 28 November de 2007 Ignacio Medina David Montaner

Slides:



Advertisements
Similar presentations
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Advertisements

Pathways analysis Iowa State Workshop 11 June 2009.
Gene Set Enrichment Analysis (GSEA)
Introduce GeneSpring GX12 Yun Lian GeneSpring Layout.
Lecture Outline Introduction Data mining sources: –GO, InterPro, KEGG, UniProt Tools to do the data mining: –FatiGO –FatiWISE.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Gene Ontology John Pinney
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
An update on ongoing projects within Biorange SP Biorange Project Meeting Leiden, September 15 Tim Hulsen.
Pathway and network analysis Functional interpretation of gene lists
CACAO - Penn State Gene Function and Gene Ontology January 2011
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
 2 Outline  Review of major computational approaches to facilitate biological interpretation of  high-throughput microarray  and RNA-Seq experiments.
Lecture 4: Gene Annotation & Gene Ontology June 11, 2015.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
Using The Gene Ontology: Gene Product Annotation.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Ranking-Aware Integration and Explorative Search of Distributed Bio-Data Dipartimento di Elettronica e Informazione NETTAB 2012 Integrated Bio-Search November.
BIF Group Project Group (A)rabidopsis: David Nieuwenhuijse Matthew Price Qianqian Zhang Thijs Slijkhuis Species: C. Elegans Project: Advanced.
CANDID: A candidate gene identification tool Janna Hutz March 19, 2007.
Ontologies, data standards and controlled vocabularies.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
Course on Functional Analysis
1 of 38 Data Mining in Ensembl with BioMart. 2 of 38 Simple Text-based Search Engine.
EBI is an Outstation of the European Molecular Biology Laboratory. GOA: Looking after GO annotations Emily Dimmer Gene Ontology Annotation (GOA) Database.
UBio Training Courses Micro-RNA web tools Gonzalo
Data Mining in Ensembl with BioMart Nov,
GO-based tools for functional modeling TAMU GO Workshop 17 May 2010.
From genes to functional blocks in the study of biological systems Fátima Al-Shahrour, Joaquín Dopazo National Institute of Bioinformatics, Functional.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
An overview of Bioinformatics. Cell and Central Dogma.
A curated database of biological pathways.
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
GO enrichment and GOrilla
Copyright OpenHelix. No use or reproduction without express written consent1.
Microarray data analysis using GEPAS and Babelomics Department of Bioinformatics, Centro de Investigación Príncipe Felipe, and Functional genomics node,
Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
2/3/2005 Gene Ontology (GO) The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Canadian Bioinformatics Workshops
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
Canadian Bioinformatics Workshops
This publication represents the views of the Authors, not the EC. The EC is not liable for any use that may be made of the information. EADGENE and SABRE.
Annotating with GO: an overview
Regulatory Genomics Lab
GO : the Gene Ontology & Functional enrichment analysis
::: Schedule. Biological (Functional) Databases
A bioinformatic analysis of microRNAs role in osteoarthritis
GSEA-Pro Tutorial Gene Set Enrichment Analysis for Prokaryotes
Regulatory Genomics Lab
Identification of acetylated peptides and proteins by LC-MS/MS.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Babelomics Functional interpretation of genome-scale experiments Barcelona, 28 November de 2007 Ignacio Medina David Montaner Bioinformatics Department CENTRO DE INVESTIGACION PRINCIPE FELIPE (VALENCIA)

Babelomics: A systems biology web resource for the functional interpretation of genome-scale experiments.

Genome-scale experiment output 1007_s_at 1053_at 117_at 121_at 1255_g_at 1294_at 1316_at _at 1405_i_at 1431_at 1438_at 1487_at 1494_f_at 1598_g_at _at 1729_at 1773_at 177_at _s_at _at _at _at _g_at _at _at _at _i_at _at _at _at _f_at _g_at _at _at Functional Interpretation

ENSEMBL Ensembl ID HGNC symbol EMBL acc UniProt/Swiss-Prot UniProtKB/TrEMBL Ensembl IDs RefSeq EntrezGene Affymetrix Agilent PDB Protein Id IPI…. Arabidopsis thaliana Homo sapiens Mus musculus Rattus norvegicus Drosophila melanogaster Caenorhabditis elegans Saccharmoyces cerevisae GO KEGG Interpro Transcription Factors Gene expression Cisred Bioentities Literature Gallus gallus Babelomics imported databases

Babelomics tools FatiGO: Finds differential distributions of Gene Ontology terms between two groups of genes. FatiGOplus: an extension of FatiGO for InterPro motifs, pathways and SwissProt KW, transcription factors (TF), gene expression in tissues, bioentities from scientific literature, cis-regulatory elements CisRed. Tissues Mining Tool: compares reference values of gene expression in tissues to your results. MARMITE Finds differential distributions of bioentities extracted from PubMed between two groups of genes. FatiScan: detect significant functions with Gene Ontology, InterPro motifs, Swissprot KW and KEGG pathways in lists of genes ordered according to differents characteristics. MarmiteScan: Use chemical and disease-related information to detect related blocks of genes in a gene list with associated values. GSEA: Detects blocks of functionally related genes with significant coordinate over- or under-expression using the Gene Set Enrichment Analysis. 1007_s_at 1053_at 117_at 121_at 1255_g_at 1294_at 1316_at _at 1405_i_at 1431_at 1438_at 1487_at 1494_f_at 1598_g_at _at 1729_at 1773_at 177_at _s_at _at _at _at _g_at _at _at _at _i_at7.7

Gene List1 Gene List2 Organism Biological process Molecular function Cellular component KEGG pathways Biocarta Pathways (new) Interpro motifs Swissprot keywords Bioentities from literature (Marmite) Gene Expression (TMT) Transcription Factor binding sites Cis-regulatory elements (CisReD) miRNAs (new) FatiGO Text files with a column of identifiers your project name

Testing the distribution of functional terms among two groups of genes (remember, we have to test hundreds of GOs) Biosynthesis 60%Biosynthesis 20% Sporulation 20% Group AGroup B Genes in group A have significantly to do with biosynthesis, but not with sporulation. Are this two groups of genes carrying out different biological roles? 84 No biosynthesis 26 Biosynthesis BA

FatiGO Results Gene group1 is enriched in this functional block Gene group2 is enriched in this functional block percentages p-values corrected p-values

Organism Gene List ordered according the experimental value FatiScan Gene112.4 Gene211.5 Gene310.3 Gene410.2 Gene59.9 Gene69.3 Gene78.2 Gne88.1 Gene107.7 gene Biological process Molecular function Cellular component KEGG pathways Interpro motifs Keywords Swissprot Transcription Factor Cis-regulatory elements

Index ranking genes according to some biological aspect under study. Database that stores gene class membership information. Fa tiScan searches o ver the whole ordered list, trying to find runs of functionally related genes. List of genes + - Annotation label A Annotation label B Annotation label C B A C Testing along the ordered list Block of genes enriched in the annotation A Annotation C is homogeneously distributed along the list Block of genes enriched in the annotation B

% Genes with the specific GO annotation for each partition Fatiscan results List of genes + - B A C

% Genes with the specific GO annotation for each partition GO over- represented among genes over-expressed in A GO over- represented among genes over- expressed in B A B Expression level - + Functional interpretation

FatiScan Example TumorControl + t - t t ~ Tumor mean expression – Control mean expression All genes in the array Proliferation Is more associated with the genes on the top of the list Is more associated with the genes that show higher expression in Tumors