1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI.

Slides:



Advertisements
Similar presentations
The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
Advertisements

ArrayExpress Query Interface Gonzalo Garc í a Lara January, / 24.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
Presented by Amr Ali AL-Hossary (M.B.,B.Ch)
1 MAGE: Revised submission against LSR RFP-007 "Gene Expression" Ugis Sarkans, EBI Michael Miller, Rosetta Inpharmatics.
The MEMOPS Programming Framework Wayne Boucher, Cambridge
NYU Microarray Database (NYUMAD)
The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
1 A Student Guide to Object- Orientated Development Chapter 9 Design.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
1 Successful Middleware Integration Using a Common Domain Model October, 2000 Gregor Hohpe.
Microrray Data Standardisation Microarray Gene Expression Database group -- MGED December, 2000.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
European Bioinformatics Institute MGED Society Establishing the infrastructure for sharing microarray data Alvis Brazma European Bioinformatics Institute.
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
Susanna-Assunta Sansone (Toxicogenomics project coordinator) Microarray Informatics Team EMBL- EBI (European Bioinformatics Institute) Transcriptome Symposium,
ILSI-HESI agreement with EBI: ArrayExpress, public repository for toxicogenomics data Susanna Assunta Sansone Microarray Informatics.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
CS 160: Software Engineering October 8 Class Meeting
Introduction to MDA (Model Driven Architecture) CYT.
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Presentation on SubmissionTrackingTool: by Anjan Sharma.
1 SYS366 Lecture Visual Modeling and Business Use Case Diagrams.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
Object Management Group (OMG) Specifies open standards for every aspect of distributed computing Multiplatform Model Driven Architecture (MDA)
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
MIAMExpress development October 2002 Mohammad shojatalab
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
1 © 1999 Microsoft Corp.. Microsoft Repository Phil Bernstein Microsoft Corp.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
Technical Overview The Fastest Way to Create Architecture!
FuGE: A framework for developing standards for functional genomics Angel Pizarro Univesrity of Pennsylvania Andrew Jones University of Manchester.
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center.
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Introduction to UML CS A470. What is UML? Unified Modeling Language –OMG Standard, Object Management Group –Based on work from Booch, Rumbaugh, Jacobson.
TEMBLOR review meeting - EMBL-EBI, Hinxton, October 20 th 2003 Integration of J-Express with ArrayExpress Partner 20 University of Bergen Inge Jonassen.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
CIM LAB MEETING Presentation on UML Rakesh Mopidevi Kwangyeol Ryu.
SimDB Implementation & Browser IVOA InterOp 2008 Meeting, Theory Session 1. Baltimore, 26/10/2008 Laurent Bourgès This work makes use of EURO-VO software,
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
TEMBLOR mid-term review Participation in DESPRAD project Bernd Drescher Robert Wagner.
Chapter 5 System Modeling. What is System modeling? System modeling is the process of developing abstract models of a system, with each model presenting.
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
Copyright (C), No Magic, Inc Welcome to No Magic!
Ontologies Reasoning Components Agents Simulations An Overview of Model-Driven Engineering and Architecture Jacques Robin.
Introduction to UML and Rational Rose UML - Unified Modeling Language Rational Rose 98 - a GUI tool to systematically develop software through the following.
ArrayExpress Ugis Sarkans EMBL - EBI
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 14 Slide 1 Object-Oriented Design.
Expression Data Integration Microarray Gene Expression Database Meeting Sunday 14th November 1999.
Chapter 5 – System Modeling Lecture 1 1Chapter 5 System modeling.
Director’s Challenge IT Overview
Using ArrayExpress.
Phil Bernstein Microsoft Corp.
Software Architecture & Design Pattern
Evaluating Compuware OptimalJ as an MDA tool
UML profiles.
Analysis models and design models
Presentation transcript:

1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI

2 Outline what is MAGE-OM what is ArrayExpress what language is used for modeling MAGE-OM structure ArrayExpress status and future MAGE future developments

3 MAGE-OM MicroArray Gene Expression Object Model –also: MAGE-ML (.. Markup Language), MAGE-STK (..Software ToolKit) Merging of MAML (MicroArray Markup Language) and GEML (Gene Expression Markup Language)

4 MAGE: brief history December initial submissions of proposals to OMG (Object Management Group): –EBI (on behalf of MGED) - MAML –Rosetta (on behalf of GEML community) - GEML + some IDLs –NetGenics - IDLs Decision to proceed with a joint submission Decision to comply with Model Driven Architecture (MDA) principles October joint submission to OMG (Rosetta and MGED)

5 Model Driven Architecture Platform Independent Model (UML) –most of the time spent on this Platform Specific Models –XML UML (refined from PIM) DTD (generated plus hand modifications) –CORBA (not for MAGE) UML (refined from PIM) IDL (hopefully generated) –….

6 ArrayExpress first version (object model) , in collaboration with German Cancer Research Centre (DKFZ) second version (object model) - end of 2000, prototype development funded by Incyte

7 ArrayExpress (2) implementation - first half of Oracle schema, data loader (from MAML), prototype Web interface, a few datasets loaded decision to use MAGE-OM as basis for further development EU funding , 8 new positions

8 ArrayExpress - features MIAME-compliant able to import MAML (MAGE-ML) formatted data can deal with both raw and processed data independence of: –experimental platforms –image analysis methods –data normalization methods object model-based query mechanism supports upcoming OMG standard for expression data

9 Unified Modeling Language graphical language for describing software systems (and more..) notation - yes methodology - no

10 UML diagram types class state collaboration sequence ……..

11 State diagram

12 Sequence diagram

13 Collaboration diagram

14 Class diagram

15 Class diagrams - notation classes attributes –types operations relationships –subclass relationship –aggregate relationship –association role names cardinalities navigation

16 class class from another package attribute aggregation navigation role name cardinality association name inheritance

17 Class diagram

18 Implementation issues Java, C++ - “easy” relational databases –classes - tables –1:1, 1:N - foreign key –N:M - table –subclass relations all subclasses in the same table separate table for superclass and subclasses XML

19 Tools Rational Rose –bad graphical capabilities –forward/reverse engineering –API (VB-based) open source –ArgoUML

20 BSANEBQS Description Protocol Measurement Audit Treatment Transformation BioEventExperiment ArrayDesign BioMaterial BioAssayData BioAssay DesignElement UML Packages HigherLevelAnalysis BioSequence ArrayManufacture QuantitationType

21 Top level structure

22 BioAssay

23 Biomaterial

24 ArrayDesign

25 DesignElement

26 DesignElement

27 DesignElement mapping

28 Data

29 BioSequence

30 ArrayManufacture

31 Quantitations

32 HigherLevelAnalysis

33 BioEvent

34 Protocol

35 Description

36 AuditAndSecurity

37 Measurement

38 ArrayExpress: current status Object model (MAGE-OM) - stable Database schema - generated (standard SQL, we run under Oracle) Data loader from MAGE-ML - generated Web interface (queries, browsing) - under development

39 Near future developments Dedicated hardware for ArrayExpress Good quality data coming from collaborators (annotation tools needed) Data uploading and Web interface made public

40 Future developments Integration with existing tools (Expression Profiler) New analytical tools Links with other databases Data curation, liaison with data providers

41 ArrayExpress architecture central database (experiment-centred) data warehouse application server (Java servlets) Web server image server ArrayExpress curation MAGE-ML API curation tool database

42 MAGE schedule OMG meeting, Dublin, November specification hopefully adopted Mechanism for incorporating changes and user feedback MAGE programming jamboree, EBI, December 6-11: API development, parser generation, annotation tools (MAGE STK)

43 Resources Web site –links to documents presentations UML models –also HTML version and PNG image files of diagrams – Mailing list –to subscribe, send the following to subscribe lsr-ge

44 Doug Bassett (Rosetta) Alvis Brazma (EBI) Steve Chervitz (Affymetrix) Francisco Dela Vega (Applied Biosystems) Michael Dickson (NetGenics) David Frankel (IONA) Scott Markel (NetGenics) Michael Miller (Rosetta) Dave Nellesen (Incyte) Alan Robinson (EBI) Martin Senger (EBI) Paul Spellman (Lawrence Berkley Lab) Jason Stewart (NCGR) Charles Troup (Agilent) Acknowledgements