Ontology Evaluation 2009-10-22. Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.

Slides:



Advertisements
Similar presentations
Experiences in Evaluation and Selection of Ontologies Bruno Grilo INESC-ID H. Sofia Pinto IST/INESC-ID
Advertisements

Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
Optimizing search engines using clickthrough data
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
Performance Evaluation
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Definitions Performance Appraisal
OASIS Reference Model for Service Oriented Architecture 1.0
Introduction Information Management systems are designed to retrieve information efficiently. Such systems typically provide an interface in which users.
Klaus M. Frei1 WordNet „An On-line Lexical Database“ (Miller, G. A.; Beckwith, R.; Fellbaum, Chr.; Gross, D.; Miller, K. 1993, title). Based on psycho-linguistic.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Learning to Advertise. Introduction Advertising on the Internet = $$$ –Especially search advertising and web page advertising Problem: –Selecting ads.
Web Mining Research: A Survey
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Formulating objectives, general and specific
1 The Literature Review March 2007 (3). 2 The Literature Review The review of the literature is defined as a broad, comprehensive, in- depth, systematic,
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Framework for K-12 Science Education
In The Name Of God. Jhaleh Narimisaei By Guide: Dr. Shadgar Implementation of Web Ontology and Semantic Application for Electronic Journal Citation System.
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. Collaborative Building of Controlled Vocabularies Crosswalks Mateusz.
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
A Taxonomy of Evaluation Approaches in Software Engineering A. Chatzigeorgiou, T. Chaikalis, G. Paschalidou, N. Vesyropoulos, C. K. Georgiadis, E. Stiakakis.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
Grouping search-engine returned citations for person-name queries Reema Al-Kamha, David W. Embley (Proceedings of the 6th annual ACM international workshop.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Ontology Engineering & Maintenance Semantic Web - Spring 2008 Computer Engineering Department Sharif University of Technology.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
O NTOLOGY E VALUATION AND R ANKING USING O NTO QA By. Samir Tatir and I.Budak Arpinar Department of Industrial Engineering Park Jihye.
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Illustrations and Answers for TDT4252 exam, June
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Standard Setting Results for the Oklahoma Alternate Assessment Program Dr. Michael Clark Research Scientist Psychometric & Research Services Pearson State.
Using Meta-Model-Driven Views to Address Scalability in i* Models Jane You Department of Computer Science University of Toronto.
OntoQA: Metric-Based Ontology Quality Analysis Samir Tartir, I. Budak Arpinar, Michael Moore, Amit P. Sheth, Boanerges Aleman-Meza IEEE Workshop on Knowledge.
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Mariano Fernández López &Asunción Gómez Pérez The integration of OntoClean in WebODE Mariano Fernández-López Asunción Gómez-Pérez Artificial Intelligence.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Presented By- Shahina Ferdous, Student ID – , Spring 2010.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
Ontology and the lexicon Nicola Guarino and Christopher A. Welty(2004). An Overview of OntoClean Weber ( 張澄清 ) 2014/04/23 1.
Developing Assessment Instruments Instructional Design: Unit, 3 Design Phase.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Ontology Evaluation and Ranking using OntoQA Samir Tartir and I. Budak Arpinar Large-Scale Distributed Information Systems Lab University of Georgia The.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
CCNT Lab of Zhejiang University
IB Environmental Systems and Societies
EER Assurance Presentation of Issues and Project Update June 2018
Rafael Almeida, Inês Percheiro, César Pardo, Miguel Mira da Silva
CS 430: Information Discovery
An Empirical Study of Property Collocation on Large Scale of Knowledge Base 龚赛赛
Introduction Task: extracting relational facts from text
[jws13] Evaluation of instance matching tools: The experience of OAEI
Filtering Properties of Entities By Class
An Approach to Abstractive Multi-Entity Summarization
Connecting the Dots Between News Article
Presentation transcript:

Ontology Evaluation

Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches

Motivation Ontologies are to be widely adopted in the semantic web and other semantics-aware applications. Users facing a multitude of ontologies need to have a way of assessing them and deciding which one best fits their requirements. People constructing an ontology need a way to evaluate the resulting ontology and possibly to guide the construction process and any refinement steps.

Evaluation Criteria Criteria – 正确性( Correctness ) – 一致性( Consistency ) – 可扩展性( Expandability ) – 清晰性( Clarity ) – 最小本体约定( Minimal Ontological Commitment ) – 最小本体偏好编码( Minimal Encoding Bias )

Criteria Correctness – 对现实世界而言,本体的定义应该是正确的。 Consistency – 应该支持与其定义相一致的推理。它所定义的公理以及用自然 语言进行说明的文档都应该具有一致性 Clarity – 本体必须有效的说明所定义术语的意思。定义应该是客观的, 与背景独立的。当定义可以用逻辑公理表达时,它应该是形式 化的。定义应该尽可能的完整。所有定义应该用自然语言加以 说明

Criteria(2) Expandability – 支持在已有的概念基础上定义新的术语,以满足特殊的需求, 而无须修改已有的概念定义 Minimal Ontological Commitment – 本体约定应该最小,只要能够满足特定的知识共享需求即可 Minimal Encoding Bias – 概念的描述不应该依赖于某一种特殊的符号层的表示方法。 因为实际的系统可能采用不同的知识表示方法

Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches

Schema Metrics – Address the design of the ontology schema – Schema could be hard to evaluate: domain expert consensus, subjectivity etc. Metrics – Relationship diversity – Depth / Breadth / Fanout – Tangledness Evaluation Measures

Evaluation Measures( 1 ) Relationship diversity – This measure differentiates an ontology that contains mostly inheritance relationships (≈ taxonomy) from an ontology that contains a diverse set of relationships. Schema Depth – This measure describes the distribution of classes across different levels of the ontology inheritance tree

Fanout factor Tangledness

Evaluation Measures(2) Instance level Metrics – Evaluate the placement, distribution and relationships between instance data which can indicate the effectiveness of the schema design and the amount of knowledge contained in the ontology.

Evaluation Measures( 3 ) Class Utilization – Evaluates how classes defined in the schema are being utilized in the KB.

Evaluation Measures ( 4 ) Class Importance (popularity) – This metric evaluates the importance of a class based on the number of instances it contains compared to other classes in the ontology. Relationship Utilization – This metric evaluates how the relationships defined for each class in the schema are being used at the instances level.

Evaluation Measures ( 5 ) Ontology Score Calculation

Criteria & Measures Some measures cover a range of criteria but not completely, for example, depth and breadth. These two measures related to expandability but the relation is limited. Some criteria are difficult to quantify, for example, conciseness, completeness,clarity. Some measures do not resolve to any criteria, for example, connectivity and importance.

Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches

Golden Standard Evaluation Evaluating lexicon and taxonomy Criteria Based Evaluation Task Based Evaluation Evaluation Approaches

Rationale – Compare an ontology with another ontology which is deemed to be the benchmark Methods – Measure similarities between ontologies both lexically and conceptually Golden Standard Evaluation

Lexicon Similarity Measure

Concepts Similarity Measure

Example

Lexicon Measure Premise – The term frequency is an indication of naturalness, – More users would use natural terms more frequently than other synonyms. Method – based on the term frequency counts in results of the search engine.

Measure of Taxonomies Rationale – If an ontology contains the relationship X IS-A Y then this a natural statement if there are many Web documents that contain both X and Y. Methods – For each concept pair in an IS-A relationship (X i,Y j ), generating a non-related concept pair as ((X i or Y j ),Z k ) where Z k is a randomly selected concept. – Naturalness of an IS-A relationship can be approximated by a Google search.

Evaluating Ontology Decisions with ONTOCLEAN Nicola Guarino, Christopher Welty Communications of the ACM,2002

Correctness Evaluation Rationale – For every class in an ontology, associated with labels or metaproperies which include essence, rigidity, identity and unity,and according to constraints of metaproperties to decide the correctness of ontology modeling. Metaproperties – essence, Rigidity(rigidity,semi-rigidity,anti-rigidity) – Identity, unity

Metaproperties ( 1 ) Essence – A property of an entity is essential to that entity if it must be true of it in every possible world. Rigidity – Rigidity A special form of essentiality is rigidity a property is rigid if it is essential to all its possible instances – semi-rigidity properties that are essential to some entities and not essential to others – Anti-rigidity properties that are not essential to all their instances

Metaproperties(2) Identity – Identity refers to the problem of being able to recognize individual entities in the world as being the same (or different), Unity – Unity refers to being able to recognize all the parts that form an individual entity.

Constraints Given two properties, p and q, when q subsumes p the following constraints hold: – If q is anti-rigid, then p must be anti-rigid; – If q carries an identity criterion, then p must carry the same criterion; – If q carries a unity criterion, then p must carry the same criterion; – If q has anti-unity, then p must also have anti-unity.

AutomaticEvaluationofOntologies (AEON) Johanna,DennyVrande,YorkSure ISWC2005

Automatic Evaluation of Correctness Methods – In term of the meaning of metaproperties,defining lexico-syntactic patterns on the Web to obtain positive and negative evidence for rigidity, identity, unity of concept Architecture

Automatic Evaluation of Correctness(2) Patterns – Rigidity(negative evidence) – Unity(negative evidence) – Unity(positive evidence)

Tasks based evaluation Rationale – Based on the competency of the ontology in completing tasks. By measuring its performance in a quantitative manner within the context of the application. Strength – whether an ontology is suitable for the application or task in a quantitative manner by measuring its performance within the context of the application. Weak – An evaluation for one application or task may not be comparable with another task. Hence, evaluations need to be taken for each task being considered

Ontology Evaluation Using Wikipedia Categories for Browsing Metrics – Depth / Breadth / Fanout – Tangledness Method – Comparing original category of wikipedia with generated category with clustering method and untangled category. – For given browsing task, measuring efficiency and effectiveness of browsing task according to browsing history.

Browsing Task Subtree variation from Wikipedia

Wikipedia tree

Experimental Conclusion User behaviour – Exhibited exploratory behaviour if subtree did not help – Tended to backtrack more if the subtree was not helping with the task at hand Original Wikipedia subtree(a) – Helped users perform better with tasks that were less broad Wikipedia untangled subtree(b) – Generally users backtracked less on broader tasks – Generally found more definite relevant articles in the broadest task in both domains Generated subtree(c) – Users obtain more mostly-relevant but not defnitely-relevant articles in Domain X – Users tended to perform better from Domain Y than X

Q/A?