Transcriptomics Patrick Kemmeren European Bioinformatics Institute Genomics Lab, UMC Utrecht.

Slides:



Advertisements
Similar presentations
ArrayExpress A public database for microarray based gene expression data European Bioinformatics Institute EMBL-EBI Alvis.
Advertisements

Misha Kapushesky November 28, 2003 Expression Profiler: Next Generation.
The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Minimum Information About a Microarray Experiment - MIAME MGED 5 workshop.
Time line and procedures for datasets BCBC Pre-retreat Workshop Tyson’s Corner, VA May 11, 2011.
The MGED Ontology Is An Experimental Ontology Bio-Ontologies Aug 8, 2002 Chris Stoeckert, Helen Parkinson and the MGED Ontology Working Group.
MIAME and Data Standards Phillip Lord. Why Standards? "However, there is a subtle implication that standardization (fixation) is a good thing". An anonymous.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Patrick Kemmeren Using EP:NG.
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
MARS: Microarray analysis, retrieval, and storage system Albert F. Cervantes.
Persistent Systems Pvt. Ltd. Gene Expression Analysis Using Microarrays Dr Mushtaq Ahmed Technology Incubation Division Persistent.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
EMBL Outstation — The European Bioinformatics Institute MIAME and ArrayExpress - a standard for microarray data annotation and a database to store it Helen.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
EBI is an Outstation of the European Molecular Biology Laboratory. MAGE-TAB - The ArrayExpress Production Experience Helen Parkinson, PhD.
Microrray Data Standardisation Microarray Gene Expression Database group -- MGED December, 2000.
The European Bioinformatics Institute MIAME and Ontologies for Sample Description Helen Parkinson Microarray Informatics Team European Bioinformatics Institute.
1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
European Bioinformatics Institute MGED Society Establishing the infrastructure for sharing microarray data Alvis Brazma European Bioinformatics Institute.
Gene Expression Omnibus (GEO)
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
Susanna-Assunta Sansone (Toxicogenomics project coordinator) Microarray Informatics Team EMBL- EBI (European Bioinformatics Institute) Transcriptome Symposium,
ILSI-HESI agreement with EBI: ArrayExpress, public repository for toxicogenomics data Susanna Assunta Sansone Microarray Informatics.
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
September 2003 Aix en Provence Jonathon Blake EMBL Biochemical Instrumentation.
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
DESPRAD subproject Alvis Brazma EMBL-EBI Hinxton, October 20, 2003.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Content, Format, and Standards in Genomics Scale Data The ILSI – EBI Collaboration Wm. B. Mattes, PhD, DABT.
Genomics Laboratory University Medical Center Utrecht... Microarray technology group microarray production and use Transcription regulation genome-wide.
MIAMExpress development October 2002 Mohammad shojatalab
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
TEMBLOR review meeting - EMBL-EBI, Hinxton, October 20 th 2003 Integration of J-Express with ArrayExpress Partner 20 University of Bergen Inge Jonassen.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
The MGED Ontology W3C Workshop on Semantic Web for life Sciences October 27, 2004 Presented by Liju Fan MGED Ontology Working Group Senior Scientist, KEVRIC.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
ArrayExpress Ugis Sarkans EMBL - EBI
Web Resources for Genomics Kei Cheung, Ph.D. Assistant Professor Yale Center for Medical Informatics (MBB 452a Genomics & Bioinformatics) Oct. 8, 2003.
Exploiting semantic technologies to build an application ontology
Using ArrayExpress.
How to store and visualize RNA-seq data
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Presentation transcript:

Transcriptomics Patrick Kemmeren European Bioinformatics Institute Genomics Lab, UMC Utrecht

mRNAcDNA hybridise to microarray What are microarrays ? Transcriptomics?

hybridisation labelled nucleic acid array RNA extract Sample Array design hybridisation labelled nucleic acid array RNA extract Sample hybridisation labelled nucleic acid array RNA extract Sample hybridisation labelled nucleic acid array RNA extract Sample hybridisation labelled nucleic acid Microarray RNA extract Sample Experiment Gene expression data matrix normalization integration Protocol genes

Samples Genes Gene expression levels Sample annotation Gene annotation Gene expression matrix Microarray data and annotation

Traditions of data sharing in Life Sciences Data used in publications should be made available so that the experiments can be reproduced and the conclusions can be verified the others can build on other’s results In genome sequencing this has evolved into submissions to public sequence databases DDBJ/EMBL/Genbank – most journals require such submissions

Array scans Spots Quantitations Genes Samples A B C D Sharing microarray data – which data?

MGED standards - MIAME

ArraySample Sample source Sample treatments Extraction protocol Labeling protocol Array design information Location of each element Description of each element Hybridization protocol Quantification matrix Analysis protocol Software specifications Image Scanning protocol Software specifications Hybridisation MIAME 6 parts of a microarray experiment MGED – MIAME

Microarray experiment SamplesExtracts Labelled Extracts Colours related to labels Hybridizations Shapes related to array designs Experiment name Rustici et al., S. pombe cell-cycle mutant data (2004)

MIAMExpress Database Submissions Database Retrieval of raw & processed data for analysis Gene, sample, and experiment centric queries, Submission support Curation MAGE- ML XML VisualisationData download Data upload User Functionality Database Architecture MAGE- ML External Application ArrayExpress Repository AE Data Warehouse

Submission and annotation tool Potential local data annotation tool Based on MIAME concepts Accepts protocol, array and experiment submissions User accounts allow re-use of protocols and arrays Works with your own or commercial arrays MIAMExpress

MIAMExpress schema

MIAMExpress Database Submissions Database Retrieval of raw & processed data for analysis Gene, sample, and experiment centric queries, Submission support Curation MAGE- ML XML VisualisationData download Data upload User Functionality Database Architecture MAGE- ML External Application ArrayExpress Repository AE Data Warehouse

ArrayExpress A public repository for microarray data at the EBI

Data in ArrayExpress

Submissions by pipelines Online (MIAMExpress) Submissions

ArrayExpress data - by organism Total ~ 7000 hybridisations

MIAMExpress Database Submissions Database Retrieval of raw & processed data for analysis Gene, sample, and experiment centric queries, Submission support Curation MAGE- ML XML VisualisationData download Data upload User Functionality Database Architecture MAGE- ML External Application ArrayExpress Repository AE Data Warehouse

Gene-centric Query Prototype New!

Gene-centric Query Prototype New! - Driven by a BioMart backend

Gene-centric Query Prototype New!

MIAMExpress Database Submissions Database Retrieval of raw & processed data for analysis Gene, sample, and experiment centric queries, Submission support Curation MAGE- ML XML VisualisationData download Data upload User Functionality Database Architecture MAGE- ML External Application ArrayExpress Repository AE Data Warehouse

Expression Profiler An online microarray data analysis platform

What can you do with the data?

...view as a heatmap... Expression Profiler Data Viewer Component

What can you do with the data?...cluster the data... Expression Profiler Hierarchical Clustering Component

What can you do with the data?...look at GeneOntology enrichment of a selected cluster... Expression Profiler GO Annotation Component

What can you do with the data?... check out how clusterings compare... Expression Profiler Clustering Comparison Component

What can you do with the data? Expression Profiler Threeway Similarity Analysis... integrate several data types together...

–Data Selection –Data Transformation –Missing Value Imputation –Hierarchical Clustering & K- groups Clustering –Clustering Comparison –Signature Algorithm –Sequence Homology –SPEXS: Promoter Discovery –Visual Pattern Matching –Ordination (COA, PCA) –Between Group Analysis –Three-way Similarity Analysis –GO Annotation Uses: ArrayExpress suite of tools Standalone tool Locally installed (UJI, UMC Utrecht) Teaching tool Pipelines, workflows, high-throughput analysis Available Components

Original EP Development: Jaak Vilo (Tartu) Patrick Kemmeren (Utrecht) Misha Kapushesky EP:NG Framework Development: Patrick Kemmeren (Utrecht) Misha Kapushesky Caroline Johnston (UCL) Visualization Components: Misha Kapushesky Steffen Durinck (Leuven) Phil Hyoun Lee Clustering Comparison: Aurora Torrente Christine Körner (Leipzig) PCA/COA/BGA: Aedín Culhane (Cork) Signature Algorithm: Jan Ihmels (Tel-Aviv) Gene Ordering: Karlis Freivalds (Riga) Normalisation: Caroline Johnston (UCL) Web Services: Antonio Estruch (UJI) Acknowledgements EBI Microarray Informatics Team Alvis Brazma, Head of Microarray Informatics Group Ahmet Oezcimen, Scientist (Oracle DBA) Anastasia Samsonova, PhD student Anjan Sharma, Scientist (Software Developer) Anna Farne, Scientist (Curation) Aurora Torrente, PhD Student Bhuwan Tiwari, Trainee Catherine Leroy, Summer Student Ele Holloway, Scientist (Curation) Gabriella Rustici, Scientist (Postdoc) Gaurab Mukherjee, Scientist (Curation) Gonzalo Garcia Lara, Scientist (Web Designer/Programmer) Helen Parkinson, Scientist (Curation Coordinator) Jaak Vilo, Consultant Lev Soinov, Scientist (Postdoc Wellcome Trust) Misha Kapushesky, Scientist (Scientific Application Programmer) Mohammadreza Shojatalab, Scientist (Database Programmer) Niran Abeygunawardena, Scientist (Web Designer/Programmer) Patrick Kemmeren, Consultant Per Lilja, Scientist (Database Programmer) Philippe Rocca-Serra, Scientist (Nutrigenomics Proj. Coordinator) Pierre Marguerite, Summer Student Richard Coulson, Scientist (Biosapiens Project) Sergio Contrino, Scientist (Database Programmer) Steffen Durinck, Student Susanna-Assunta Sansone, Scientist (Toxicogenomics Proj. Coordinator) Tim Rayner, Scientist (Curation) Ugis Sarkans, Scientist (Database Development Coordinator)