FAO of the UN Library and Documentation Systems Division DC 2002 Florence October 02 A Comprehensive Framework for Building Multilingual Domain Ontologies: Creating an ontology on Food Safety, Animal and Plant Health (OFsAPH) Boris Lauser Tanja Wildemann,Allison Poulos Frehiwot Fisseha, Johannes Keizer, Stephen Katz DC 2002: Florence16 th October 2002
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Agenda Introduction: –Motivation –Ontologies and modeling approach Framework for ontology creation Application of framework: –Creation of the Food Safety Ontology prototype Outlook: –Current project status –Application scenario Questions Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Motivation Rapid growth in electronically available information Badly performing search tools Pages often inconsistently indexed BUT Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Example: Full text search Who established the Agreement of Agriculture Search … … … …Agreement… … …Agriculture… … International organization standards WTO WHO FAO Agreement of Agriculture establish... Background knowledge …. … WTO established the Agreement of Agriculture in …. … Specified search Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Example: Indexed Search Who established the Agreement of Agriculture Search … … International organization standards WTO WHO FAO Agreement of Agriculture establish... Background knowledge Agricultural Agreement Synonym …. … WTO established the Agreement of Agriculture in …. … Document Indexed with “Agricultural Agreement” High Chance Of retrieval Low Chance Of retrieval Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Ontology as form of background knowledge Goal: To create an explicit, formal specification of a shared conceptualization of a domain of interest Ontology Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Ontology: conceptual model Concept label synonym stem description Concept relationship Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Ontology: RDFS model Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Introduction Framework Application Outlook Discussion Agenda Introduction: –Motivation –Ontologies and modeling approach Framework for ontology creation Application of framework: –Creation of the Food Safety Ontology prototype Outlook: –Current project status –Application scenario Questions
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 The framework A comprehensive framework for building a domain ontology Focus: Acquisition and Development step In the lifecycle of ontology creation Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 The framework: processes Ontology acquisition (2 paths) –Creating core ontology from scratch –Automatic extraction of ontological knowledge from base vocabulary and domain specific text sources Merging into one ontology Refinement and Extension Evaluation and Assessment Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Core ontology Manual creation Focused Web crawling List of domain start web pages List of frequent terms List of domain Specific documents Term BT t1 NT t2 RT t3 Term USE t3 … Thesaurus RDFS ontology model convert Ontology pruning and learning algorithm Domain corpus Generic corpus Pruned ontology List of frequent terms Manual creation of core ontology 1 st acquisition approach 2 nd acquisition approach Text To Onto The Framework: overview Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Introduction Framework Application Outlook Discussion Agenda Introduction: –Motivation –Ontologies and modeling approach Framework for ontology creation Application of framework: –Creation of the Food Safety Ontology prototype Outlook: –Current project status –Application scenario Questions
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Creation of the core ontology 67 concepts 91 relationships Information Resources: Brainstorming Codex Alimentarius SPS Agreement Core Ontology Ontology Editor (SOEP) 3 subject specialists Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 1 st Acquisition Approach: Focused Crawling Focused Web Crawling 68 concepts 91 relationships Core Ontology List of extracted main sites: Gateway to Government Food Safety Information Center for Food Safety & Applied Nutrition Canadian Food Inspection Agency Iowa State University - Food Safety Project Iowa State University - Food Safety Consortium United States Department of Agriculture, Food Safety and Inspection Service Foodborne Ilness Education Information Center World Health Organization – Regional Office for Europe Food Safety Programme List of 257 food Safety domain web pages Grouping into Main sites Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Selection of Documents Domain Set: Manual selection –11 documents Codex Alimentarius: Description, Code of Ethics, Food Hygiene, Food Import and Export Report of consultation on risk assessment of microbiological hazards in foods Ensuring food quality and safety, Protecting food quality and safety Domain Set: Focused Crawler Output –5 documents extracted: Generic documents: Manual Selection –8 documents Several documents of the animal feed domain Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 2 nd Acquisition Approach: Thesaurus Pruning Food Safety Documents Generic Documents Rice BT … NT … RT … RT … RT … … AGROVOC keywords Automatic Pruning Extracted ontological structure: # of concepts: 504 taxonomic depth: 5 5 evaluation runs 1632 frequent terms Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Merging of Ontologies and Refinement 1632 Terms from pruning process 12 new concepts extracted Ontological structure extracted from AGROVOC 23 new concepts With hierarchical relationships extracted 67 concepts 91 relationships Core Ontology Assembly step 92 new relationships created Food Safety Ontology Prototype 102 concepts 183 relationships Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Final Prototype Food Safety Ontology Prototype 102 concepts 183 relationships 1.79 relationships concept Core Ontology 67 concepts 91 relationships relationships concept 1.36 Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October Concepts Agreement of Agriculture ALOP ALOP, Codex ALOP, OIE ALR animal byproducts animal diseases animal fats animal feed additives animal feed contaminants animal feed ingredients animal feeding animal health animal processing animal products animal waste animals antibiotics Bacteria bakery products biological agent CAC Caragene protocol CCFH cereal products cheese chemical agent Codex Committees commodities Consumer health diseases eggs exposure assessment fabrication FAO fishes food food additives food consumption food contaminants food export food import food ingredients food safety food-borne diseases fungi good hygienic practices hazard hazard characterization hazard identification human health human nutrition humans international agreements international food trade international governmental organizations IPPC labelling meat microorganisms microorganisms byproducts microorganisms processing microorganisms products microorganisms waste milk milk products non-pathogens OIE packaging parasites pathogens physical agent plant byproducts plant diseases plant feed additives plant feed contaminants plant feed ingredients plant feeding plant health plant processing plant products plant waste plants processed animal products processed plant products processed products processing risk analysis risk assessment risk characterization risk communication risk management slaughter SPS agreement standards sugar TBT agreement transport viruses WHO WTO Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October Unique Relationships adopts adversely affect are included in are produced by are the source for can be used as constitutes describes determines ensures establishes govern has economical impact on Implies includes influences interacts with is a consequence of is a step in the process is comprised of is established by is protected by originate from refer to requires rule sustains trades uses Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Evaluation: Food Safety ontology web browser Open to users and subject specialists for evaluation oportal/dispatcherhttp://localhost:8080/fa oportal/dispatcher Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Agenda Introduction Framework Application Outlook Discussion Introduction: –Motivation –Ontologies and modeling approach Framework for ontology creation Application of framework: –Creation of the Food Safety Ontology prototype Outlook: –Current project status –Application scenario Questions
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Current project status Ontology creation: 2 nd application of framework Introduction Framework Application Outlook Discussion Food Safety Ontology Prototype 102 concepts 183 relationships Text To Onto ~ 100 domain Specific documents AGROVOC Revised Ontology Pruner List of frequent terms Pruned Agrovoc: ~3000 concepts Ontology Editor (OIModeler) Merging & Refinement 1 st acquisition approach 2 nd acquisition approach
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Usage Scenario Search: Risk assessment Biosecurity Portal: … … Ontology Enabled Search Application Ontology based search extension Risk characterization Hazard characterization Hazard identification Exposure assessment Risk assessment Risk management Risk communication Risk analysis Is a Step In the process Is a Step In the process Extended Search Mark the terms below, which you might want to include in your search: Interacts with Risk assessment Risk characterization Risk analysis Search: Ontology Doc base Search results Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Current project status Application scenario: 2 use cases Use Case 1: Indexing the subject of a document Use Case 2: Searching information on the portal Risk;… Subject Title … … OFsAPH Risk;… Search … …
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Current project status Application: Ontology Browser for the Ontology on Food Safety, Animal and Plant Health
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Sample Application Ontoweb –European funded platform for collaborative research in ontology and semantic web issues –Ontology based web portal –Knowledge discovery through path completion –Example: search for ‘ben rudi’ – Introduction Framework Application Outlook Discussion
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Enabler: KAON Tool Suite Open Source! Java based! Highly portable!
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Agenda Introduction Framework Application Outlook Discussion Introduction: –Motivation –Ontologies and modeling approach Framework for ontology creation Application of framework: –Creation of the Food Safety Ontology prototype Outlook: –Current project status –Application scenario Questions
FAO of the UN Library and Documentation Systems Division DC 2002 Florence, Italy October 02 Search: Risk assessment Biosecurity Portal: … … Ontology Enabled Search Application Doc base Risk characterization Hazard characterization Hazard identification Exposure assessment Risk assessment Risk management Risk communication Risk analysis Is a Step In the process Is a Step In the process Extended Search Mark the terms below, which you might want to include in your search: Interacts with Risk assessment Risk management Risk analysis Search: Ontology based search extension: … 3. Pastoral risk management for disaster prevention and preparedness in Central Asia with special reference to the case of Mongolia 4. … … Search Results has economical impact on International agreements Agreement of Agriculture … 2. Doc 3. Doc … is established by WTO