Ontology Maintenance with an Algebraic Methodology: a Case Study Jan Jannink, Gio Wiederhold Presented by: Lei Lei.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Distributed DBMS©M. T. Özsu & P. Valduriez Ch.15/1 Outline Introduction Background Distributed Database Design Database Integration Semantic Data Control.
Fast Algorithms For Hierarchical Range Histogram Constructions
Management Information Systems, Sixth Edition
Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Information Retrieval in Practice
Xyleme A Dynamic Warehouse for XML Data of the Web.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Smart Templates for Chemical Identification in GCxGC-MS QingPing Tao 1, Stephen E. Reichenbach 2, Mingtian Ni 3, Arvind Visvanathan 2, Michael Kok 2, Luke.
Learning to Extract Form Labels Nguyen et al.. The Challenge We want to retrieve and integrate online databases We want to retrieve and integrate online.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 5 Understanding Entity Relationship Diagrams.
Integrating data sources on the World-Wide Web Ramon Lawrence and Ken Barker U. of Manitoba, U. of Calgary
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. WSMX Data Mediation Adrian Mocan
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Testing an individual module
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
Automatic Data Ramon Lawrence University of Manitoba
Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)
March 2000 Gio XIT 1 Increasing the Precision of Semantic Interoperation Gio Wiederhold Stanford University March 2000 report: www-db.stanford.edu/pub/gio/1999/miti.htm.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Overview of Search Engines
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
McGill University Proposal Exam School of Computer Science Ph.D. Candidate in the Modelling, Simulation and Design Lab Eugene Syriani.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
L/O/G/O Metadata Business Intelligence Erwin Moeyaert.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Help Desk System How to Deploy them? Author: Stephen Grabowski.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
DATA-DRIVEN UNDERSTANDING AND REFINEMENT OF SCHEMA MAPPINGS Data Integration and Service Computing ITCS 6010.
1 Relational Expressions Relational expressions: –Expressions that compare operands –Sometimes called conditions –Evaluated to yield a result –Typically.
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Advanced topics in software engineering (Semantic web)
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Algorithmic Detection of Semantic Similarity WWW 2005.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Progress Report (Concept Extraction) Presented by: Mohsen Kamyar.
S calable K nowledge C omposition Ontology Interoperation January 19, 1999 Jan Jannink, Prasenjit Mitra, Srinivasan Pichai, Danladi Verheijen, Gio Wiederhold.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Rigorous Testing by Merging Structural and Behavioral UML Representations Presented by Chin-Yi Tsai.
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Information Retrieval in Practice
Logical Database Design and the Rational Model
Best pTree organization? level-1 gives te, tf (term level)
Development of the Amphibian Anatomical Ontology
Web Ontology Language for Service (OWL-S)
Tools For Resolving Heterogeneity Computer Science Department
Luís Ferreira Pires Dick Quartel Remco Dijkman Marten van Sinderen
Data Model.
[jws13] Evaluation of instance matching tools: The experience of OAEI
Information Networks: State of the Art
Presentation transcript:

Ontology Maintenance with an Algebraic Methodology: a Case Study Jan Jannink, Gio Wiederhold Presented by: Lei Lei

Challenges  Obstacle: Autonomy of diverse knowledge sources  Data volatility and amount increases cost  Major challeges: Establish and maintain application specific portion of knowledge sources

An Algebraic Approach  Construct virtual knowledge bases geared to a specific application  Use composable operators to transform contexts into contexts  Operators express relevant parts of a source and the conditions using rules  Rules define a valid context transformation

On-line Dictionary:Webster  Autonomously maintained to develop a novel thesaurus application  120,000 entries, two million words  Semi-annual updates  Errors and inconsistencies help robustness

Target Application  Construct a graph of the definitions to determine related terms, and automatically generate thesaurus entries

Related Work  Ontology composition (Wiederhold 1994)  Rule-based approach to semantic integration (Bergamaschi et al. 1999)  Semantic reconciliation (Siegel 1991)  Uschold et al  Specification morphisms, (Smith 1993)  WordNet system (Miller & al. 1990)  WHIRL (Cohen 1998)  PageRank (Page&Brin 1998)  Latent semantic indexing (Deerwester 1990)  Hypertext authority (Kleinberg 1998)

Outline  Algebra Usage Scenario  Background  Context Creation  Ontology Maintainance  Future Work  Conclusion  My Evaluation

Typical Algebra Usage Scenario A minimal sufficient set of Linkage between items in different resources

Background  Algebraic Operators Canonical unary to establish and refine a context within which the source knowledge meets the application requirement

Background(Cont.)  Semantic Context * No global notion of consistency * Defined as objects that encapsulate other objects * Congruity: relevance of source info. to application * Similarity: equivalent and mergeable objects between different sources

Rule Language(Cont.)  Allow uninterpreted components of an object to become attributes of the object  Constructors: create new objects  Constructors: generate proxy objects  Editors & convertors: modify the objects

Object Model(Cont.)  Subsume existing models  Only objects have an identity to which others can refer  Correspond to XML supplemented with obj. identity  Rich to model complex relationship

Context Creation  Summarize Operator (S operator) Transforms source data based on a predicate Create object: Encapsulates & populates Data classification: Groups source into equivalent classes Syntax: (given contexts c1,c2, a matching rule e)

Context Creation(Cont.)  1.Predicate e partitions the objects of c1 into n equivalent parts  2. c2 consists of n+1 values: e,s1,s2,…,sn  3.One is an exception class, not match e

Example with Webster’s Dictionary  Automatic Thesaurus Extraction from Dictionary

Example(Cont.)  Construct a directed graph from definition: 1.Each head word and definition grouping is a node 2.Each word in a definition node is an arc to the node having that head word  Definition from the dictionary data for Egoism

Context Creation(Cont.) *Syllable and accent markers in head words *Misspelled head words *Mis-tagged fields *Stemming and irregular verbs(Hopelessness) *Common abbreviations in definitions(etc.) *Undefined words with common prefixes(un-) *Multi-word head words(Water Buffalo) *Underfined hyphenated and compound words Set 99% accuracy in the conversion from data to graph stru.

Constructing the Congruity Expression  An object that represents the entire source  Subdivided into chunks One head word One definition  Express congruity relationship between the dictionary and thesaurus application

Ontology Maintenance  Context Refinement  Return the ten longest head words of the dictionary

Maintaining the Ontology  Changes in source help correct and extend dict.  Maintain statistics with the S operator when extracting the relevant parts of the dictionary Find no longer needed rules  Note which rules no longer needed  A comparison of the terms reveals new errors

Future Work  A web based interface to display ArcRank algorithm based on PageRank (

Conclusion  An on-line dictionary is good test-bed  An algebraic approach improving maintainability  Congruity simplified identification and handling of changes  Use Summarize to define and refine a context that prepare the dictionary data for thesaurus service use

My Assessment  Strength * Decouple the selection of congruent parts of the source data *Congruity and similarity measure use algebra rather than single language *Mirror classes using operators of the algebra instead of low level abstract primitives that are difficult to compose  Weakness * Details of ci’=S(ci) are needed *Difficult to grasp the capability of S operator *Accuracy and error accumulation problem *Ambiguous Rules Generation

Questions?