GACS: Towards a common concept scheme for information in agriculture International Conference on Big Data and Knowledge Discovery Bangalore, March 9-11,

Slides:



Advertisements
Similar presentations
ERA-Nets as Information Sources Gerry Lawson, UK Natural Environment Research Council.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Library and Documentation Systems Division Johannes Keizer 5 th AOS workshop, Beijing May 2004 AOS Project And Realization.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
FAO of the UN Library and Documentation Systems Division ECDL 2003 Trondheim August 03 Automatic multi-label subject indexing in a multilingual environment.
Margherita Sini, FAO 1/ FAO projects in the area of the Semantic Technologies 23rd APAN Meeting Manila, Philippines
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
1/ 22 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service Concept Server OWL model DC 2006 Mexico, 4 October.
A Registry for controlled vocabularies at the Library of Congress
Agricultural Biotechnology Network for Regional Collaboration and Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
Near East Rural & Agricultural Knowledge and Information Network - NERAKIN Food and Agriculture Organization of the United Nations Near East and North.
Food and Agriculture Organization of the UN helping to build a world without hunger © FAO Europe-Africa Concertation Meeting Research Knowledge Infrastructures.
From Local to Global: Launching the New Rangelands West Portals and Database XIV IAALD World Congress 2013 Cornell University July 22, 2013.
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division Semantic Standards for the Web A Comprehensive.
Slide 1 The Agricultural Ontology Service (AOS) Effort for Content Standardization in Agriculture Frehiwot Fisseha (UNFAO)
6 th AOS Workshop at EFITA 2005, Vila Real Johannes Keizer Library and Documentation Systems Division Coherence in Agricultural Information Systems.
FAO 1/ AOS Community Margherita Sini & Gauri Salokhe Food and Agriculture Organization 8 th AOS Workshop – 22 Sept
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division The Agricultural Ontology Service - project, a.
Communities and infrastructures to access agricultural knowledge Open Access Open data.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
1 Finnish Health Information on the Semantic Web 30 November 2007 Osma Suominen, Kim Viljanen, Eero Hyvönen Semantic Computing Research Group (SeCo) Helsinki.
1 Issues in Reusing and Sharing the Content of Thesauri and Taxonomies in OOR Marcia Zeng NKOS (Networked Knowledge Organization Systems/Services) My participating.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
FAO of the UN Library and Documentation Systems Division Nordic AOS Workshop Copenhagen February 03 A Comprehensive Framework for Building Multilingual.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting April 24, 2007 Cynthia Dickinson EPA/OEI/OIC Data Standards.
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
Publications Office Metadata Registry (MDR) INSPIRE Registry and Registers Workshop Willem van Gemert Publications Office of the EU Dissemniation and Reuse.
Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini July 2005 Managing domain ontologies within the.
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division Slide 1 AGRIS the next steps of the network
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1 Sharing Metadata on Job Opportunities Gauri Salokhe, Valeria Pesce, Eero Mikkola, Johannes Keizer.
Global Terrestrial Observing System linking the world’s terrestrial monitoring systems to provide a global vision of the Earth we share.
The Coordination of Agricultural Research in the Mediterranean and its e-infrastructure tool Presented by WP1 Leader Mipaaf-IT, Annamaria Marzetti & Serenella.
APAN AG-WG Bangkok Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini Slide Sustainable.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
A Collaborative Framework for Managing and Publishing KOS Armando Stellato, Ahsan Morshed, Gudrun Johannsen, Yves Jaques, Caterina Caracciolo, Sachit Rajbhandari,
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
The Agricultural Ontology Service: A Proposal to Create a Knowledge Organisation Framework in the Area of Food and Agriculture Johannes Keizer, Food and.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentation Systems Division FAO-IUFRO- GFIS-CABI Discussion about a Multilingual.
The Agricultural Ontology Service: A proposal to create a Knowledge Organisation Framework in the Area of Food and Agriculture Johannes Keizer, Food and.
ALA Metadata - Goals and Issues Donald Hobern, Director, Atlas of Living Australia 29 August 2008.
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies:
Cynthia US Department of Agriculture National Agricultural Library 30 September 2015 Ag Data Commons Adding value to open agricultural research.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Margherita Sini, FAO 1 / 19 Using RSS to Share KOS Metadata Margherita Sini, Gauri Salokhe IV Ecoterm Vienna, Austria April.
1 Jennifer Bowen University of Rochester Chicago, IL May 9, 2007 Working Group on the Future of Bibliographic Control Second.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair Vienna,
E-SI Theme: Exploiting Diverse Sources of Scientific Data Re-use or Re-invention - a Roadmap for Data Integration 27 th -28th November 2006 Prof. Jessie.
Gauri Salokhe, FAO 1/ Examples of Ontology Applications Seventh Agricultural Ontology Service Workshop Bangalore, India Gauri.
Matt Goldner Product & Technology Advocate Mela Kircher Product Manager WorldCat Local Metasearch 13 November 2009.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Food and Agriculture Organization of the UN Library and Documentation Systems Division Slide 1 July 2005 Mapping CAT to AGROVOC 6 th AOS Workshop Vila.
The GACS Project Semantics for Open Data
knowledge organization for a food secure world
Towards WISE as a distributed system
Presentation transcript:

GACS: Towards a common concept scheme for information in agriculture International Conference on Big Data and Knowledge Discovery Bangalore, March 9-11, 2016 Food and Agriculture Organization of the UN

GACS project: participants Working group: – FAO (AGROVOC) – CABI (CABT) – NAL (NALT) Steering committee – Those above + – INRA – CGIAR consortium – AgroKnow Caterina Caracciolo211/03/2016

GACS = Global Agriculture Concept Scheme To make a common repository of terminological and conceptual information in agriculture – To enable.. Of information To achieve efficiency of scale by maintaining core concepts in cooperation A voluntary project – So far, only funded by in-kind contribution of the partners Caterina Caracciolo311/03/2016

Why bother? #1 To have smoother access to data Thesauri/vocabularies are used in applications, as part of document metadata Caterina Caracciolo411/03/2016

Why bother? #2 To “inherit” information from the others Caterina Caracciolo5 FAO: Biotechnology Glossary Geopolitical Ontology AGROVOC 11/03/2016 Example: outbound links of AGROVOC SKOS mappings

A vision for the future 11/03/20166Caterina Caracciolo

Requirements 1.Ensure compatibility with existing databases 2.Maximize reuse of work, e.g., available translations 3.Use RDF-based technologies (e.g., URIs, SKOS) for integration with other resources on the web 4.Available as Linked Open Data

Three thesauri Caterina Caracciolo811/03/2016

First estimate of overlap* Caterina Caracciolo9 * Obtained by automatic mapping using AgreementMakerLight 11/03/2016

The process 1.Selected a core from the three thesauri 2.Mapped concepts to one another 3.Clean up results

From each: the 10,000 concepts most frequently used in their respective databases. Step 1: concept selection Plus: all countries AND all higher-level organisms

Automatic mapping Tool: AgreementMakerLight applied to the full thesauri, for completeness Manual validation Tool: spreadsheets Evaluated 60 to 150 rows/hour Evaluation took 500 to 600 hours for GACS Beta. Step 2: Concepts mapped

Step 3: clean up results The goal is: To obtain a “core” of concepts usable in real life to replace (part of) AGROVOC and NALT To have a “core” to which other resources may hook up – e.g., thesauri, code lists

Something to consider...

Different in concept granularity AGROVOC: [animal oil], [animal fats] NALT: [animal fats and oils]

Different modelling of scientific names and taxonomies ! Very many scientific and common names of organisms in each thesaurus ! Common names and scientific names - are they different things? Or different names for the same things? – Salmo salar, Salmo carpio, Salmo ferox – salmon

The same thing?

Scientific names and taxonomies AGROVOC – has separate hierarchies (taxonomic / common sense), but often not very up-to-date CABI – has mostly scientific names NAL – has both scientific and common names, al together. More up-to-date

Lumps clusters of concepts originated from the mapping

Example: lumps in March 2015 Lumps 11/03/201620Caterina Caracciolo NOW: all solved 15,090 concepts 972 lumps

Only hierarchies? – No, add Thematic groups Plus Product 11/03/201621Caterina Caracciolo

Currently: Beta ~ 15,000 concepts, ~ 400,000 terms, 29 lang. All RDF-SKOS Editing and viz tool from partners: – VocBench (U Tor Vergata, Rome IT & FAO) – SKOSMOS (Finnish National Library) Caterina Caracciolo2211/03/2016

Beta 1.6 Caterina Caracciolo /03/2016

AGROVOC and NALT may be phased out Extension module(S)? GACS CABT GACS

Ongoing Quality improvement – Hierarchyies, labels, scope notes, definitions Consolidate decisions – E.g, on taxonomies and common names Write guidelines 1.To document decisions taken 2.To guide future editors of GACS Some new issues coming up – copyright on definitions? Caterina Caracciolo2511/03/2016

Next 1.Release GACS data officially 2.Define AGROVOC and CABI as “extension” Caterina Caracciolo2611/03/2016

References A web site will be set up soon Now, all reports are available from AIMS To follow progress, subscribe to the AIMS community!

Forming GACS concepts by merging the source concepts and aggregating their information rice UF paddy UF paddy rice cereals UF feed cereals UF small grain cereals (grain) Oryza sativa UF Oryza glutinosa UF Oryza indica UF Oryza japonica UF Oryza sativa … (subsp, var etc.) Oryza UF Padia UF rice (plant) agrovoc:c_5435 cabt:82917 nalt:56271 exactMatch agrovoc:c_5438 cabt:82935 nalt:56277 exactMatch agrovoc:c_1474 cabt:26247 exactMatch agrovoc:c_6599 cabt: nalt:56293 exactMatch (Note: GACS uses SKOS, not traditional thesaurus tags) GACS concept Mapped to the Thesauri of origin