Multilingual Information Exchange APAN, Bangkok 27 January 2005

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
Library and Documentation Systems Division Johannes Keizer 5 th AOS workshop, Beijing May 2004 AOS Project And Realization.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS/ETDE THESAURUS MAINTENANCE & USE OF COMPUTER-ASSISTED INDEXING.
Agricultural Ontology Web Services Striving for more interoperability in agricultural information management OASIS Symposium May 2006 Boris Lauser.
Thesauri, Terminologies and the Semantic Web
Entering A New ERA : The European Research Area Ken Miller UK Data Archive University Of Essex June 11-15, 2002.
Margherita Sini, FAO 1/ FAO projects in the area of the Semantic Technologies 23rd APAN Meeting Manila, Philippines
Human Language Technologies. Issue Corporate data stores contain mostly natural language materials. Knowledge Management systems utilize rich semantic.
Environmental Terminology System and Services (ETSS) June 2007.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
1/ 22 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service Concept Server OWL model DC 2006 Mexico, 4 October.
Reengineering AGROVOC to Ontologies Step towards better semantic structure NKOS Workshop 31 May 2003 Rice University Houston, Texas, USA Frehiwot Fisseha.
Knowledge organisation and information architecture, Nils Pharo Knowledge organisation and the Web Nils Pharo, 6th November 2002.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Building a Multilingual Thesaurus for Agriculture, Forestry and Fisheries in Japan Hiroko AOKI Agriculture, Forestry and Fisheries Research Information.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
Information Extraction with Linked Life Data 19/04/2011.
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
FAO, Library and Documentation Systems Division – Dr. Johannes Keizer | May 2006 AGRIS – A new Vision and Strategy CAAS, Beijing May 2006 A new vision.
ONTOLOGIES AND SEMANTIC WEB 2007/2008 OSCAR CORCHO OCTOBER 11TH, 2007 Domain description for the hands-on activities.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
An ontology is a semantic structure that formalizes the knowledge that members of a community have about a given domain. consists of concepts and relations.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Page 1 Agrovoc Maintenance Interface Margherita Sini – GILW May 2003.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
FAO Ontology Portal Prototypes FISHERY January 2004.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Food and Agriculture Organization of the UN Library and Documentation Systems Division GILW FAO's activities on Thesauri and Terminology Systems.
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
FAO, Library and Documentation Systems Division – Dr. Johannes Keizer | May 2006 AGRIS – A new Vision and Strategy GAAS, Guangzhou May 2006 A new vision.
ISO 25964: a standard in support of interoperability Stella G Dextre Clarke Project Leader, ISO NP
Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini July 2005 Managing domain ontologies within the.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
APAN AG-WG Bangkok Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini Slide Sustainable.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
SDC JE-2031 Linda Spencer U.S. EPA January 19, 2000 Open Forum on Metadata Registries Santa Fe, NM.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Margherita Sini, FAO 1 / 19 Using RSS to Share KOS Metadata Margherita Sini, Gauri Salokhe IV Ecoterm Vienna, Austria April.
A Collaborative Approach to Developing a Multilingual Forestry Thesaurus A project in development between IUFRO, CABI and FAO –Gillian Petrokofsky, CAB.
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
Gauri Salokhe, FAO 1/ Examples of Ontology Applications Seventh Agricultural Ontology Service Workshop Bangalore, India Gauri.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
Food and Agriculture Organization of the UN Library and Documentation Systems Division Slide 1 July 2005 Mapping CAT to AGROVOC 6 th AOS Workshop Vila.
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
RDA and linked data Gordon Dunsire Presented to Code4Lib Ottawa, MacOdrum Library, Carleton University, Ottawa, 27 April 2016.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
RDA, linked data, and development
Thai AGROVOC Ontology Base for Agricultural Information Retrieval
RDA, linked data, and development
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
RDA, linked data, and update on development
NKOS workshop Alicante, 2006
RDA in a non-MARC environment
Presentation transcript:

Multilingual Information Exchange APAN, Bangkok 27 January 2005

The general problem Searching for multilingual resources is not easy: –on the web –on metadata catalogues / bibliographical databases –on full text documents Results are generally in the language used in the search query => We need a multilingual approach and multilingual tools (Thesauri / Ontologies, etc.)

What we can achieve (1): Multilingual concept resolution With a multilingual thesaurus or ontology we can find resources on any language Because we can realize Multilingual concept resolution!

What we can achieve (2): Brokering With a multilingual thesaurus or ontology we can find resources from several sources also if we do not know the terminology and the language used in these sources vessels crafts fishing vessels ships navio navire 船舶 bateau de pêche fishing boat fishing vessel Results in multiple languages from multiple databases

How to build a multilingual Thesaurus / Ontology Lexicalizations of concepts in multiple languages: –{… fishing boat; bateau de pêche; 捕捞渔船 … } 捕捞渔船 For every language we can have synonyms: –{ … fishing vessel, fishing boat, fishing craft … } –{ … bateau de pêche, navire de pêche, … } –{ … 捕捞渔船, … } 捕捞渔船

FAO activities (ongoing) Food safety ontology (English, Spanish, French) Fishery ontology (English, Chinese) Food and Nutrition ontology-based portal (English, Spanish, French) Extensive work with AGROVOC –RDFS / OWL version –Semantic refinements –Expand multilingual coverage –Expand subject coverage

The multilingual vocabulary... Must cover all concepts of interest to the users in the various languages,... at a minimum all domain concepts lexicalized in any of the participating languages Must accommodate hierarchical structures suggested by different languages (Dr. Soergel)

Problems (1) Translation of an English thesaurus into German does not make a German thesaurus => whenever possible we need to consider the concept in his globality (many languages, definitions, “surrounding context” etc.) Equivalence of terms holds only in some contexts More difficult to translate non-specialized terms (Dr. Soergel)

Problems (2) Two terms mean almost the same thing but differ slightly in meaning or connotation: –English: alcoholism –French: alcoholisme –English: vegetable (includes potatoes) –German: Gemüse (does not include potatoes) If the difference is big enough, one needs to introduce two separate concepts under a broader term; otherwise a scope note needs to clearly instruct indexers in all languages how the term is to be used so that the indexing stays, as far as possible, free from cultural bias or reflects multiple biases by assigning several descriptors. (Dr. Soergel)

Available resources: example SuperThes,... SWAD-Europe initiative: thesaurus activities –RDF encoding of multilingual thesaurus Multilingual labelling approach (mirroring relations for every language) Interlingual mapping approach (different structures to be mapped)

SWAD-Europe: Inter-Thesaurus Mapping SKOS mapping: –Exact –Inexact –Major –Minor –Partial –Broad –Narrow –AND –OR –NOT

Inter-Thesaurus Mapping: example Academie Academy Buildings

Available resources: another possibility Use OWL –Define concepts –Define terms –Define string –Define relationships between these 3 elements:,, (+ skos suggestions),,,

Available resources: other techniques NLP Knowledge discovery: helps on the creation of ontologies in a specific language Used to create good IS –Concept extraction –Multilingual search engine …

Conclusion We need multilingual tools –Ontologies better than traditional thesauri The task is not easy –Subject experts are essential –NLP could help We need tools –To help experts to realize the mapping –To do annotations –…

Live demo

Thank you.