ArrayExpress Ugis Sarkans EMBL - EBI www.ebi.ac.uk/arrayexpress.

Slides:



Advertisements
Similar presentations
Misha Kapushesky November 28, 2003 Expression Profiler: Next Generation.
Advertisements

The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Minimum Information About a Microarray Experiment - MIAME MGED 5 workshop.
Welcome to mini-symposium on ontologies for biological sample description EMBL-EBI Wellcome Trust Genome Campus Deceber 5, 2001.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Transcriptomics Patrick Kemmeren European Bioinformatics Institute Genomics Lab, UMC Utrecht.
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
MARS: Microarray analysis, retrieval, and storage system Albert F. Cervantes.
Persistent Systems Pvt. Ltd. Gene Expression Analysis Using Microarrays Dr Mushtaq Ahmed Technology Incubation Division Persistent.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
EMBL Outstation — The European Bioinformatics Institute MIAME and ArrayExpress - a standard for microarray data annotation and a database to store it Helen.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Microrray Data Standardisation Microarray Gene Expression Database group -- MGED December, 2000.
1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
Data Curation and Management activities within the UCT Computational Biology Group Dr Nicky Mulder.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
Gene Expression Omnibus (GEO)
Susanna-Assunta Sansone (Toxicogenomics project coordinator) Microarray Informatics Team EMBL- EBI (European Bioinformatics Institute) Transcriptome Symposium,
ILSI-HESI agreement with EBI: ArrayExpress, public repository for toxicogenomics data Susanna Assunta Sansone Microarray Informatics.
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Copyright OpenHelix. No use or reproduction without express written consent1.
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
VectorBase Gene expression data in VectorBase Fotis Kafatos, George Christophides, Bob MacCallum & Seth Redmond Imperial College London (thanks also to.
EBI is an Outstation of the European Molecular Biology Laboratory. Anatomy ontology ArrayExpress Helen Parkinson,
1 Data Warehouses BUAD/American University Data Warehouses.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
Content, Format, and Standards in Genomics Scale Data The ILSI – EBI Collaboration Wm. B. Mattes, PhD, DABT.
Genomics Laboratory University Medical Center Utrecht... Microarray technology group microarray production and use Transcription regulation genome-wide.
MIAMExpress development October 2002 Mohammad shojatalab
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
1 Open Ontology Repository: Architecture and Interfaces Ken Baclawski Northeastern University 1.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
Gene Expression Omnibus (GEO)
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Data Format Description Language (DFDL) WG Martin Westhead EPCC, University of Edinburgh
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
Using ArrayExpress.
Data Warehouse.
Gene Expression Omnibus (GEO)
Presentation transcript:

ArrayExpress Ugis Sarkans EMBL - EBI

Outline why the domain model is not simple ArrayExpress object model ArrayExpress implementation status future developments

Underlying principles must be able to accommodate needs of a technology that is under constant development must be able to manage data in absence of standard measurement units and standards for reliability information gene expression data have any meaning only in the context of what are the experimental conditions –controlled vocabularies and ontologies needed for unambiguous sample annotation MIAME-compliant

ArrayExpress - conceptual overview

Simple version of AE object model - ArrayExpressBasic

Motivation for 2 object models many spots - one gene raw data - cleaned-up data - ratios - normalizations - higher-level analysis how detailed sample description is needed? for data mining we need ways to unify several datasets: –array features across different array platforms –samples from different experiments –various raw and derived measurements

ArrayExpressComplete

Scope of ArrayExpress object models useable for a public repository as well as a laboratory database (e.g., as a part of LIMS) implementation of “intermediate” models possible mapping to RDBMS tables - not necessarily straightforward models and documentation available at

ArrayExpress - features able to import MAML format can deal with both raw and processed data independence of: –experimental platforms –image analysis methods –data normalization methods object model-based query mechanism will support upcoming OMG standard for expression data

Key constructs in the AE object model structured sample descriptions notion of ExpressionValueSet several dimensions for ExpressionValues Transformations working on ExpressionValueSets and their dimensions

Structured representation of sample and treatment relations Sample source Primary sample 1 Primary sample 2 Derived sample 1 Labeled extract 1 Extract 1 Derived sample 2 A new state of sample source Extract 2 Labeled extract 2Hybridization labeling extraction treatment

Microarray expression value representation expression value types primary images composite images e.g., green/red ratios primary spots composite spots primary measurements derived values

Current status object model - stable, supports current MIAME physical database schema MAML data loader populated with one dataset from EMBL currently accessible through SQL

In development data loader - changes following MAML evolution annotation & MAML export tool Web interface to ArrayExpress –programmatic interface will follow

Proposed architecture data submission & curation database data warehouse application server Web server image server? ArrayExpress curation pipeline MAML data

Future developments will support upcoming OMG standard for gene expression data (XML, queries) diagrammatic interface to sample description submodel integration with other databases analytical tools running on top of ArrayExpress data curation pipeline development

Acknowledgements –MGED - MIAME, MAML –Incyte - Genomic Knowledge Platform –OMG gene expression data proposal submitters - Rosetta & NetGenics