THE INSTITUTE FOR GENOMIC RESEARCH TIGR MGED Ontology Workshop MGED7 September 8-10, 2004 Toronto, Canada.

Slides:



Advertisements
Similar presentations
Conversion owl to SKOS and Further development of WB V.2 Prepared by - Ahsan, Armando, Sachit and Prashanta.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Connecting to Databases. relational databases tables and relations accessed using SQL database -specific functionality –transaction processing commit.
The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
The MGED Ontology Workshop MGED 7 September 8, 2004 Chris Stoeckert Center for Bioinformatics & Dept. of Genetics University of Pennsylvania.
1 MAGE: Revised submission against LSR RFP-007 "Gene Expression" Ugis Sarkans, EBI Michael Miller, Rosetta Inpharmatics.
Automated creation of verification models for C-programs Yury Yusupov Saint-Petersburg State Polytechnic University The Second Spring Young Researchers.
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
Descriptions Robert Grimm New York University. The Final Assignment…  Your own application  Discussion board  Think: Paper summaries  Time tracker.
ANLE1 CC 437: Advanced Natural Language Engineering ASSIGNMENT 2: Implementing a query expansion component for a Web Search Engine.
OntologyEntry in MAGE Chris Stoeckert, Helen Parkinson Trish Whetzel, Joe White Gilberto Fragoso, Liju Fan, Mervi Heiskanen, Angel Pizarro Ontology Working.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI.
Semantic Web. Course Content
NETMOD Architecture Phil Shafer IETF 72.
Conclusions What’s next? * Implementation of additional input formats * Additional vendor support: As vendors become more open with their APIs for accessing.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Introduction to Databases A line manager asks, “If data unorganized is like matter unorganized and God created the heavens and earth in six days, how come.
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
Supporting High- Performance Data Processing on Flat-Files Xuan Zhang Gagan Agrawal Ohio State University.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
UML based expert system generation Using Enterprise Architect to model and generate a web-based expert system.
Presentation : Konstantinos Kanaris.  What is Jena?  Usage of Jena  Main Concepts  Main Components  Storage Models  OWL API  RDF API  Reasoning.
Get your hands dirty cleaning data European EMu Users Meeting, 3rd June. - Elizabeth Bruton, Museum of the History of Science, Oxford
MIAMExpress development October 2002 Mohammad shojatalab
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
Internet Technologies Review Week 1 How does Jigsaw differ from EchoServer.java? What abstractions are made available to the servlet writer (under.
XML Grammar and Parser for WSOL Kruti Patel, Vladimir Tosic, Bernard Pagurek Network Management & Artificial Intelligence Lab Department of Systems & Computer.
Client: Paul Mather Virginia Tech CS4624, Blacksburg May 1, 2014 By Nathanael Bice, Scott Brink & Adam Piorkowski.
FuGE: A framework for developing standards for functional genomics Angel Pizarro Univesrity of Pennsylvania Andrew Jones University of Manchester.
Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center.
Metadata Mòrag Burgon-Lyon University of Glasgow.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Representing Flow Cytometry Experiments within FuGE Josef Spidlen 1, Peter Wilkinson 2, and Ryan Brinkman 1 1 BC Cancer Research Centre, Vancouver, BC,
1.Registration block send request of registration to super peer via PRP. Process re-registration will be done at specific period to info availability of.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Extending FuGE into other domains Andrew Jones School of Computer Science, University of Manchester
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
The MGED Ontology W3C Workshop on Semantic Web for life Sciences October 27, 2004 Presented by Liju Fan MGED Ontology Working Group Senior Scientist, KEVRIC.
May 24, 2004IVOA Interop Meeting1 An AXIS-based Java SkyNode Ramon Williamson NCSA T HE US N ATIONAL V IRTUAL O BSERVATORY.
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
ModTransf A Simple Model to Model Transformation Engine Cédric Dumoulin.
Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.
CaArray User Community Meeting Feature Overview and Review of MAGE-TAB Update and Export Specification Call in: Participant Passcode:
Application Report: An extensible policy editing API for privacy and identity management policies Giles Hogben jrc. It European Commission.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Experience with XML Schema Ashok Malhotra Schema Usage  Mapping XML Schema and XML documents controlled by the Schema to object classes and instances.
STAR Scheduling status Gabriele Carcassi 9 September 2002.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
July 19, 2004Joint Techs – Columbus, OH Network Performance Advisor Tanya M. Brethour NLANR/DAST.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Institute of Informatics & Telecommunications NCSR “Demokritos” Spidering Tool, Corpus collection Vangelis Karkaletsis, Kostas Stamatakis, Dimitra Farmakiotou.
May 8, 2006 MAGE v1 and MAGE v2 Michael Miller Lead Software Developer Rosetta Biosoftware NCI MAGE Jamboree.
SCDB Update Michel Jouvin LAL, Orsay March 17, 2010 Quattor Workshop, Thessaloniki.
T Network Application Frameworks and XML Web Services and WSDL Sasu Tarkoma Based on slides by Pekka Nikander.
Web Ontology Language for Service (OWL-S)
Data Modeling II XML Schema & JAXB Marc Dumontier May 4, 2004
Validation & conformity testing
MGED Ontology [--usage]
Supporting High-Performance Data Processing on Flat-Files
Enterprise Web Services
Presentation transcript:

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MGED Ontology Workshop MGED7 September 8-10, 2004 Toronto, Canada

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MAGE Workshop May 24 th - May 28 th 2004 The Institute for Genomic Research and University of Maryland, Shady Grove Rockville, MD

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MAGE Workshop Goals MAGE -> MGED Ontology API –Main goal for this meeting –Build a mechanism for people to use the MO as part of MAGE Ontology Tools –In order to use MO we need tools for manipulating and managing the ontology MAGE v2 Model –Continue model discussions toward MAGE v2 –Document Model and changes as the model is developed –Begin work on Mapping MAGE v1 to MAGE v2

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MAGE Workshop Goals Standalone ADF Converter –Continue work on simplified Array Design Format and a reader and writer for it –Integrate this into MAGE v2 Documentation –MO policies and usage –MOE class or methods –MAGE v2 Code, Code, Code …

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Ontology Tools daml file parsing scripts for extracting the classes, instances and properties from MO ANSI SQL scripts for creating MO in a relational database like MySQL or Sybase Script based methods for updating a datrabase implementation of the MO Perl and Java methods for searching the MO for classes, instances, and properties Others ?

THE INSTITUTE FOR GENOMIC RESEARCH TIGR OWL Reader: Adam Witney developed an reader based on the Redland RDF reader that will parse the MGED Ontology from.OWL as well as.DAML files. This reader became useful to the Ontology helper API. It is located in cvs: lib/Perl/script/MGEDOntology_parser.pl

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Ontology Helper API: Eric Deutsch and Kjell Petersen developed ‘MGEDOntologyHelper.pm’ which will create OntologyEntry objects based on ‘leaf node’ data. Both applications follow MGEDOntology policies. MO Traversal application written by Kjell Petersen and Stathis Sideris.was extended to use ‘leaf node’ data to instantiate an OE MAGE-ML object. Eric Deutsch ported Stathis’ code to a perl modules, which returns nested OE objects. The Perl helper is nearly complete. It works well in simple cases, but in nested cases, the final ‘value’ doesn’t get inserted. These ‘prototype’ modules and applications are available in cvs: Perl: MAGE-Perl/MAG/Tools. Java: MAGE-Java/MGEDOntologyEntry The code also is a working prototype. Example scripts are located in ‘lib/Perl/MGEDOntology The Jave Helper has been completed and is fully functional.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Perl modules MGEDOntologyClassEntry.pm MGEDOntologyEntry.pm MGEDOntologyHelper.pm MGEDOntologyPropertyEntry.pm Java classes MGEDOntologyEntry.java MGEDOntologyClassEntry.java MGEDOntologyPropertyEntry.java OntologyHelper.java StringManipHelpers.java

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Example Perl for MGEDOntologyHelper my $ontologyEntry1 = Bio::MAGE::Tools::MGEDOntologyClassEntry->new( parentObject => $qt, ## ref to a QT object className => 'QuantitationType', association => 'Scale', values => { Scale => 'linear_scale', }, ontology => $ontology, ## ref to MO Helper obj );

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Resulting OE <OntologyEntry value="linear_scale" category="Scale"> <DatabaseEntry URI=" tology.php#linear_scale" accession="linear_scale">

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Another example my $ontologyEntry1 = Bio::MAGE::Tools::MGEDOntologyClassEntry->new( parentObject => $BioSource, ## ref to parent className => 'BioMaterial', association => 'Characteristics', values => { OrganismPart => 'lung', }, ontology => $ontology, ## ref to MO obj. );

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Missing value in resulting OE istics <DatabaseEntry URI=" accession="OrganismPart">

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Autogenerated MO classes: Scott Gustofsen has devised a method of generating Java classes from the MO. He calls this Java Ontology Bindings. The code isn’t yet implemented, but will be in the near term. The classes would have to be regenerated with each release of MO. He has agreed to push it into cvs.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR BioMoby Web service: Tina Boussard and Derek Fowler began work to establish a BioMoby service to do the following: 1) search MO for terms and definitions, 2) return instantiated classes [objects], 3) provide command-line client register service [for batch processes]. The service would likely be hosted at CBIL (Upenn). A namespace has been defined; datatype and output/input formats for searches still need to be defined. Currently, the service uses GO as the test case database. Derek Fowler has set up a BioMoby test service: test_GetMoTerm. The Moby namespace was problematic because another group at Cornell has registered their namespace incorrectly. They've been asked to fix it.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MO Term Tracker/Validator: Trish Whetzel implemented a term tracker for the RAD database. The use cases for the Tracker are: 1) return new terms proposed by date and/or submitter, 2) return all terms for any MO class, 3) consistency check the MO. The Tracker would be hosted at CBIL. Helen Parkinson and Trish discussed methods and use cases for managing the MO in RAD.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR ADF Converter: Philippe Rocca-Serra continued work on the ADF format with Michael Miller and Pierre Bushel. In order to incorporate chip/cgh data he had to remove Reporter and CompositeSequence identifiers. The format now is in an Excel workbook with 3 worksheets: headers, Reporters and CompositeSequences. The code will be placed in cvs when ready.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MAGEv2 Progress: Ugis Sarkans, Paul Spellman, Michael Miller and Angel Pizarro continued re-model-ing discussions. The following changes are being considered: 1. Use Channel to link FactorValues to particular BioMaterials 2. BioAssays are now generalized to (hopefully) include all types of experimental protocols. As such the BioMaterial Treatment object is now a type of BioAssay. 3. ArrayDesign and DesignElement will mostly be left as-is in the model and the reference implementation of the model will have native support for a simpler format, probably ADF. It is not yet clear whether the default serialization (XML schema) will have both formats. 4. Protocols have been changed to be multistep protocols, i.e. a set of ordered steps, and not just simple protocol descriptions. This has allowed the changes in BioAssay to take place. 5. A new abstract class, ‘Referenceable’ was devised to separate the concept of internal MAGE references (Identifiable) to objects that exist in other resources (e.g. they have associations to DatabaseEntry and BibliographicReference) 6. OntologyEntry will be redefined to allow for representation of frame-based and descriptive logic ontologies (MO), as well as simpler node-based ontologies (GO) 7. HigherLevelAnalysis will be extended to represent other types of analytical results, The current cluster represnetation will remain the same, modulo some bug fixes. 8. The MAGE submitters notes were searched for best-practices issues that could be solved by the model. The conclusion was that most of the best-practice recommendations stem from semantic checks, not syntactic, so the model changes would not suffice.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR MO Problems: Kjell and Eric found a couple of inconsistencies in the Ontology which are being investigated - these have been posted into the MGED Ontology tracker. There was an issue about how we handle deprecated instances and classes; this will also be investigated at the next VOW.

THE INSTITUTE FOR GENOMIC RESEARCH TIGR Ontology Helper API: Eric Deutsch and Kjell Petersen developed an API which will create OntologyEntry objects based on ‘leaf node’ data, i.e. the instance values for the OE. Both applications follow MGEDOntology policies. Kjell Petersen extended a Java application written together with Stathis Sideris after the EBI jamboree last December. The application traverses the MO to generate uninitalised data structures according to the policies, with the possible choices from MO available at each node. The extended code will now take a minimum set of 'leaf node' data as input, validate them against MO and instantiate the full data structureof MAGE OntologyEntry. The code is a working prototype. The classes are in cvs: MAGE-Java/MGEDOntologyEntry. Kjell completed the Java Helper such that it produces complete and filled OE objects. Eric Deutsch ported Stathis’ code to a set of perl modules that are used by a further module, ‘MGEDOntologyHelper.pm’, which returns nested OE objects. The helper is nearly complete. It works well in simple cases, but in nested cases, the final ‘value’ doesn’t get inserted. These modules are available in cvs: MAGE-Perl/MAGE/Tools. The code also is a working prototype. Example scripts that use it are located in ‘lib/Perl/MGEDOntology’ as test1.pl, test2.pl, and test3.pl.).