Wheat Data Interoperability. 2  Endorsed in March 2014  Focus:  Improve/reach semantic interoperability of Wheat data  The WG will focus first on.

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Status on the Mapping of Metadata Standards
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
FAO and UNESCO-IOC/IODE Combine Efforts in their Support of Open Access Written by Marc Goovaerts, U. Hasselt, BE.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Interoperability Framework Overview March 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting Director, Office of Interoperability & Standards ONC HIT.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Supporting open data & models in the agri-food sector: Experiences from the RDA Agriculture Data IG & Wheat Data Interoperability WG V. Protonotarios (Agro-Know),
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
RDA Wheat Data Interoperability Cookbook and last developments 9 th March 2015, San Diego.
1 Federal Health IT Ontology Project (HITOP) Group The Vision Toward Testing Ontology Tools in High Priority Health IT Applications October 5, 2005.
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
1 SIF Interoperability Assessment Presented by Steve Browdy.
© 2014 GB Group Limited RDF as a Universal Healthcare Exchange Language March, 2015 Angell Wang.
CaBIG Semantic Infrastructure 2.0: Supporting TBPT Needs Dave Hau, M.D., M.S. Acting Director, Semantic Infrastructure NCI Center for Biomedical Informatics.
Interest group for semantic standards/Linked Open Data in TDWG Luca Matteis, Marie-Angelique Laporte, Elizabeth Arnaud.
Molecular marker data and their impact on gene bank management Chris Richards NCGRP, Fort Collins, CO Curator Workshop, Atlanta Georgia.
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Interoperability Framework Overview Health Information Technology (HIT) Standards Committee June 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting.
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Building the Research Data Alliance Dr. Beth Plale Vice Chair of Technology Programs, RDA/US Indiana University.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Wheat Data Interoperability Esther DZALE YEUMO KABORE Richard FULSS.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
The (IMG) Systems for Comparative Analysis of Microbial Genomes & Metagenomes: N America: 1,180 Europe: 386 Asia: 235 Africa: 6 Oceania: 81 S America:
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Manufacturing Systems Integration Division Development Process and Testing Tools for Content Standards Simon Frechette National Institute of Standards.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
1 LS DAM Overview August 7, 2012 Current Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Mervi Heiskanen, NCI-CBIIT, Joyce.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Cyril Pommier et al. / Feedback from the RDA and WheatIS recommendations for Wheat Data Interoperability Adoption of the Wheat Data Interoperability Guidelines.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
C. Pommier, E. Dzalé Yeumo Kabore, E. Arnaud, P. Larmande, M. Alaux, H
Wheat Data Interoperability Esther DZALE YEUMO KABORE Richard FULSS
knowledge organization for a food secure world
Functional Annotation of the Horse Genome
Applications of IFLA Namespaces
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
OBI – Standard Semantic
LOD reference architecture
Introduction to the MIABIS SOP Working Group
Bird of Feather Session
Australian and New Zealand Metadata Working Group
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Wheat Data Interoperability

2  Endorsed in March 2014  Focus:  Improve/reach semantic interoperability of Wheat data  The WG will focus first on the following data types: Single Nucleotide Polymorphisms (SNPs), Genomic annotations, Phenotypes, Genetic Maps, Physical Maps, Germplasm, expression data. Wheat Data Interoperability

3  « The ability of computers systems to exchange data with unambiguous, shared meaning.» (Wikipedia)  Make everyone speak the same language (common data formats and vocabularies)  Based on the Wheat research community practices  Leverage existing data formats and vocabularies  Translation mecanisms  Leverage existing translations rules  Promote RDF as common information representation format Achieving semantic interoperability

4  RDF is self describing  RDF is easy to map from other data representations  RDF captures information, not syntax  Multiple data models and vocabularies can be easily combined and interrelated  RDF enables smarter data use and automated data translation Why RDF Quoted from « Why RDF as a Universal Helathcare Exchange Language » by David Booth, Ph.D., Hawaï Resource Group:

5 Towards semantic interoperability Survey/Interviews Identify data formats and vocabularies used within/by the Wheat research community Workshops Identify/agree on the use of common data formats and vocabularies Access then improve the level of accessibility and interoperability of data formats and vocabularies Collect interoperability use cases Implementation Hub of linked vocabularies Prototype - Assess the gain of interoperability based on collected use cases

6 Where we are 1/2

7 The survey: participants Top wheat producers(in million metric tons) RankCountry Source: UN Food & Agriculture Organization [71]UN Food & Agriculture Organization [71] —World China India United StatesUnited States France Russia Australia Canada Pakistan Germany Turkey Ukraine Iran Kazakhst anKazakhst an United KingdomUnited Kingdom Argentina Source :

8 The survey: participants Additional domains (Other) Molecular biology Genetic resources Phenotyping Molecular plant pathology Entomology Molecular genetics Plant biotechnology Genomics Climate Data management Modeling Ontologies, taxonomies …

9 The survey: participants Also work with ↙ SNPs (76) Genomic annotations (64) Phenotypes (121) Genetic maps (72) Physical maps (41) Germplasms (104) SNPs 70,3%50,4%76,4%75,6%54,8% Genomic annotations 59,2%33,9%54,2%68,3%37,5% Phenotypes 80,3%64,1%81,9%75,6%89,4% Genetic maps 72,4%60,9%48,8%92,7%51,9% Physical maps 40,8%43,8%25,6%52,8%30,8% Germplasms 75,0%60,9%76,9%75,0%78,0%

10 The survey: use of data formats Data typeData formats StandardizedTool specificNon standardized SNPsVCFBAM/SAM, BED, VARSCAN, VEP Genomic annotations Genbank Flat File, General Feature Format (GFF), EMBL GermplasmsMPCD, ABCD, Darwin Core, Darwin Core Germplasm Grin Globaltabulated Gene expression Many format standards laid out by repositories such as NCBI (GEO) and EBI Array Express Physical mapsGFFCmap, fpc Genetic mapsCmap, gnpmap PhenotypesDrops, ped, isa- tab, ephesis tabulated

11 Where we are 2/2 Survey/Interviews A survey launched in April 2014 Answers from more than 200 respondents Workshops 1-2 October 2014 List of recommended data formats and vocabularies for each data type List of follow up actions for each data type (standardization e.g for traits, minimal set of metadata e.g for SNPs files provenance or for markers, QTL and maps handling, check for existing mapping tools, etc..) List of interoperability use cases

12 Workshop summary RecommendationsFollow up actions SNPsUse of VCF data formatLook at a metadata set to contextualize the provenance of SNPs files Genomic annotations Use of GFF3 data format Use of ontologies to fill « Attributes » column (a list of feature attributes in the format tag=value) Provide description guidelines for filling content for column 9 GermplasmsUse of MPCD and Darwin Core Germplasm formats Check how to integrate with tool specific formats (Grin Global, Genesys) Provide a table like human readable format for DWC Germplasm Gene expression Follow existing format standards laid out by repositories (NCBI GEO, EBI Array Express Check for mapping and conversion tools Physical maps Same as for Genomic annotations Genetic maps Data formats depend on tools that are used, rather concentrate on metadata harmonization Look at a minimal metadata set to handle markers, QTL, maps Obtain details for linking requirements PhenotypesUse of isa-tab data formatStandardize the traits metadata Improve the reference to ontologies used for traits

13  Biosharing registries working group  Register the Wheat Data Interoperability WG outputs: guidelines, standards  Data types registries working group  Register the definitions for each data format used within/by the Wheat research community plus, where useful, references to tools that can operate on them.  Metadata interest group  Specific advices:  what requirements should we consider when thinking of a minimal metadata set in general?  what standard could we base on to build a specific minimal metadata set (for example to contextualize the provenance of SNPs files)? Possible interactions with other WGs