Using Non-Taxonomic Knowledge to Improve Semantic Matching Peter Yeh July 22, 2003.

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

BAH DAML Tools XML To DAML Query Relevance Assessor DAML XSLT Adapter.
Level 1 Recall Recall of a fact, information, or procedure. Level 2 Skill/Concept Use information or conceptual knowledge, two or more steps, etc. Level.
Chapter 5: Introduction to Information Retrieval
Conceptual Graph Analysis Chapter 20 Lori Nuth | EDIT 730 | Fall 2005.
Task Analysis EDU 553 – Principles of Instructional Design Dr. Steve Broskoske.
FCA-MERGE: Bottom-up Merging of Ontologies
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
A Framework for Ontology-Based Knowledge Management System
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Chapter 6 Methodology Conceptual Databases Design Transparencies © Pearson Education Limited 1995, 2005.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Methodology Conceptual Database Design
Ontology translation: two approaches Xiangkui Yao OntoMorph: A Translation System for Symbolic Knowledge By: Hans Chalupsky Ontology Translation on the.
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Modeling & Designing the Database
Evaluating Ontology-Mapping Tools: Requirements and Experience Natalya F. Noy Mark A. Musen Stanford Medical Informatics Stanford University.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Chapter 10 Architectural Design
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Chapter 14: Artificial Intelligence Invitation to Computer Science, C++ Version, Third Edition.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
SLB /04/07 Thinking and Communicating “The Spiritual Life is Thinking!” (R.B. Thieme, Jr.)
 Knowledge Acquisition  Machine Learning. The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
Methodology Conceptual Databases Design
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
The Revised Bloom’s Taxonomy (RBT): Improving Curriculum, Instruction, and Assessment in an Accountability-Driven, Standards-Based World Developed and.
Subject (Exam) Review WSTA 2015 Trevor Cohn. Exam Structure Worth 50 marks Parts: – A: short answer [14] – B: method questions [18] – C: algorithm questions.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Methodology: Conceptual Databases Design
Methodology - Conceptual Database Design
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Component Based SW Development and Domain Engineering 1 Component Based Software Development and Domain Engineering.
Finding Semantic Matches Between Conceptual Graphs University of Texas, Austin May 14, 2002.
1 Everyday Requirements for an Open Ontology Repository Denise Bedford Ontolog Community Panel Presentation April 3, 2008.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
Approach to building ontologies A high-level view Chris Wroe.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Efficient Rule-Based Attribute-Oriented Induction for Data Mining Authors: Cheung et al. Graduate: Yu-Wei Su Advisor: Dr. Hsu.
1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Methodology Conceptual Databases Design
Cross-Ontological Relationships
Methodology Conceptual Database Design
CCNT Lab of Zhejiang University
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Architecture Components
Associative Query Answering via Query Feature Similarity
Introduction Artificial Intelligent.
OWL-S: Experiences and Directions, 6th of June, Austria, 2007
CSc4730/6730 Scientific Visualization
[jws13] Evaluation of instance matching tools: The experience of OAEI
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Methodology Conceptual Databases Design
Presentation transcript:

Using Non-Taxonomic Knowledge to Improve Semantic Matching Peter Yeh July 22, 2003

Talk Outline Introduction Analysis of Existing Techniques Our Approach Initial Evaluation Proposed Work

Introduction Many AI tasks require determining whether two knowledge representations encode the same knowledge.

Information Retrieval Match queries with documents. Q: “A car with a bumper made of gold.” Car BumperGold has-part material Car Gold material Car Acme Produce agentobject Car BumperGold has-part material A: “Acme makes a car made of Gold.”

Knowledge Acquisition Match new knowledge with existing knowledge. KB KB: Are you trying to encode a conversion? MicrobePollution agentobject DestroyCreatecauses Food result MicrobePollution agentobject DestroyCreatecauses Food result New Knowledge Conversion next-event agentobject DestroyCreate result agent subevent Entity Existing Knowledge

Rule-based Classification Match rule antecedents with working memory. For example, Course of Action (COA) critiquing. Attack Delay Military-Unit agent object causes Military-Unit IF THEN AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Pattern COA Attack Delay Military-Unit agent object causes Military-Unit “This COA has a rating of good for enemy maneuver engagement.”

The Core Problem Solving this matching problem is hard because multiple encodings of the same knowledge rarely match exactly. Representations don’t match exactly because: –Expressive Ontology. –Knowledge is encoded by different sources. –Knowledge being encoded is complex.

Types of Mismatches Informal examination of a knowledge-base containing: –Patterns. –COAs. Knowledge-base was built by two Subject Matter Experts (SMEs) participating in DARPA’s RKF project. Looked for cases of mismatch.

Types of Mismatches (cont.) Taxonomic Differences “an armored brigade engaging an armored battalion.”

Types of Mismatches (cont.) Taxonomic Differences Equivalent Alternatives “One military unit attacking another unit.”

Types of Mismatches (cont.) Taxonomic Differences Equivalent Alternatives Omissions “Mechanized infantry brigade engaging mechanized infantry battalion.”

Types of Mismatches (cont.) Taxonomic Differences Equivalent Alternatives Omissions Granularity “Support attack occurs before main attack.”

Analysis of Existing Techniques Analogy Inexact Matching Semantic Matching Conceptual Indexing Ontology Merging

Analogy Analogy: mapping of knowledge from a base domain to a target domain. Structure Mapping Engine (Forbus et. al. 89): –Maps relational knowledge (mappable systems). –Systematicity Principle used to select best analogy. Analogy based on common generalizations (Leishman 92) –Maps both relational knowledge and object attributes. –Prefers minimal common generalization.

Analogy: Structure Mapping Engine Attack Delay Military-Unit agent object causes Military-Unit agent object causes AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes agent object agent causes agent object agent causes agent object agent causes agent object agent causes agent object agent causes AttackBlock Artillery-Unit Armor-Unit agent object agent causes Block Delay Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit

Inexact Matching Inexact Matching: tries to address mismatches between representations Graph Editing (Tsai et. al. 83, Shapiro and Haralick 81, Messmer et. al. 93, Wolverton et. al. 2003) –Uses edit distance parameters. –Similarity based on shortest sequence of edits. Partial Matching –Does not require representations to be isomorphic. –Similarity based on amount of structural overlap. –Minimal Common Supergraph (Bunke et. al. 2000) and Maximal Common Subgraph (Bunke and Shearer 98).

Inexact Matching: MCS Attack Delay Military-Unit agent object causes Military-Unit AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Block Delay Artillery-Unit Armor-Unit agent object agent causes AttackBlock Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit

Semantic Matching Semantic Matching: uses knowledge to match representations. Projection: –Uses taxonomic knowledge. –Ontoseek (Guarino et. al. 99) and ELEN (Huibers et. al. 96). Projection+: Projection alone is too restrictive –  -projection (Genest and Chein 97). –Common generalization, graph splitting, regular expressions (Fargues 92, Buche et. al. 2000, Martin et. al. 2001). Semantic Overlap –Maximal Joins and Generalizations (Myaeng 92, Poole et. al. 95). –Shared Semantic Structures (Zhong et. al. 2002).

Semantic Matching: Semantic Overlap Attack Delay Military-Unit agent object causes Military-Unit AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Artillery-Unit Armor-Unit agent object Attack Delay Artillery-Unit Armor-Unit agent object Attack Delay Military-Unit agent object causes Military-Unit

Conceptual Indexing Conceptual indexing: how to organize and index knowledge. Requires so form matching. Generalization hierarchy (Bournard et. al. 95, Ellis 92, Levinson 82, Woods 97). –Knowledge indexed by common generalizations. –Generalizations organized hierarchically by subsumption relationships. –Retrieve Most Specific Subsumer (MSS) of a query. Match procedure is similar to Projection - suffers the same problems.

Ontology Merging and Translation Ontology Merging: merge multiple ontologies built by different sources –Chimaera (McGuinness et. al. 2000) –SMART (Noy and Musen 99). Ontology Translation: translates a representation from one language to another –Ontomorph (Chalupsky 2000). Goals are different but share some of the same problems.

Our Approach The goal of this research is to solve the matching problem. We believe existing semantic approaches can be extended with additional knowledge to significantly improve matching. What kinds of additional knowledge? –Transformations Handle mismatches. Improve matching. –Not taxonomic knowledge.

Our Approach (cont.) Generality and domain-independence. –Want additional knowledge (e.g. Transformations) to be useful across domains. We believe domain-independence is possible given a reusable domain-neutral upper ontology. –Contains a small set of general concepts. –SMEs use this upper ontology to build KBs on specialized topics (e.g. chemistry, biology, battle space planning). –No training in logic or knowledge representation.

Illustration of Our Framework Transformations Ontology KB KE SME/KE KB can be viewed as a domain-specific matcher (e.g. match symptoms to diseases). Domain-independent KB for the task of matching.

Our Prototype Extend semantic matchers with transformations. Apply transformations in a forward-chaining manner. Use existing techniques for reasoning with Conceptual Graphs (Corbett et. al. 99, Salvat et. al. 96, Willems 95): –Projection. –Unification. –Graph rules. Two caveats because existing techniques lead to promiscuous matches.

Transformations that Retains Semantics Buyobject agent origin Car Person: Y Person: X Buyobject agent origin Car Person: Y Person: X CarLike Person: X object agent Projection CarLikeobject agent Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob CarSell Person object agent recipientPerson

Transformations that Retains Semantics Buyobject agent origin Car Person: Y Person: X Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob CarSell Person object agent recipientPerson Buyobject agent origin Car Person: Y Person: X CarSell Person: Y object agent Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob Sellobject agent Car Person: John Sellobject agent CarSell Person: Y object agent

Rule Applicability Buyobject agent origin Car Person: Y Person: X CarSell Person: Y object agent Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob Sellobject agent Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob Buyobject agent origin Car Person: Y Person: X Buyobject agent origin Car Person Driving-Licensepossesses

Rule Applicability Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob CarSell Person object agent recipientPerson Buyobject agent origin Car Person: Y Person: X CarSell Person: Y object agent Driving-Licensepossesses Buyobject agent origin Car Person: John Person: Bob Sellobject agent Buyobject agent origin Car Person: Y Person: X CarSell Person: Y object agent

Enumerating Transformations Transformations derived from our domain-neutral upper ontology. Enumerated all ways that a relation can be legally used to encode information in a conceptual graph. Considered whether the same information can be expressed differently. Enumeration was possible because: –Small upper ontology. –Each concept had well-defined semantics.

Transformations Enumerated We were able to enumerate about 300 transformations. Resulting transformations fall into three general categories: –Transitivity –Part Ascension –Transfers Through

Transformations Enumerated (cont.) relationTransitivePart AscensionTransfers Through causesX-subevent, resulting-state caused-byXsubevent-ofresulting-from defeats--- defeated-by-subevent-ofcaused-by enablesX-causes, resulting-state, subevent enabled-byXsubevent-ofcaused-by, resulting-from inhibits-subevent-ofresulting-state inhibited-by-subevent-ofcaused-by, resulting-from by-means-ofX-- means-by-whichX-- prevents-subevent-of- prevented-by-subevent-ofcaused-by, resulting-from resulting-state--causes resulting-from---

Example: Our Approach Attack Delay Military-Unit agent object causes Military-Unit AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes agent object Military-Unit Delay causes Attack Military-Unit Attack Military-Unit Delay Military-Unit Delay 1: 2: 3: 4: 5: agent object causes Attack Delay A: B: C: D: E: Advance agent Block Artillery-Unit Block Armor-Unit object agent BlockArmor-Unit BlockDelay causes Armor-Unit F: G: H: I: l 1 =l 1 = {(1,A)} M = {}{(1,A)}

Example: Our Approach {(1, A)}, {(3,C)}, {(4,D)}, {(5,E)} } M = { A B CD E F G H I AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit A B CD E F G H I AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit A B CD E F G H I AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit A B CD E F G H I AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit A B CD E F G H I AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit

Example: Our Approach AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit Transformations Action causes Action causes Action causes Action causes

Example: Our Approach Transformations Action causes Action causes AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit causes AttackBlock Delay Advance Artillery-Unit Armor-Unit agent object agent causes Attack Delay Military-Unit agent object causes Military-Unit causes

Initial Evaluation Used our matcher in an application in the domain of battle space planning (DARPA's RKF Project). The task is to analyze COAs. Battle space ontology built by extending our upper ontology. Two military analysts used this ontology to build KBs containing: –Patterns. –COAs. Our matcher matched the patterns to COAs.

Example Output

Experiment 1 Evaluates our first hypothesis. –How significant is the improvement? Compared our matcher to: –Maximal Common Subgraph (MCS). –Semantic Search Lite (SSL). Methodology: –300 domain-neutral transformations; 80 domain-specific transformations. –Matched the patterns to the COAs. –A pattern matches a COA if the match score meets or exceeds a pre-specified threshold. –Used metrics of precision and recall.

Experiment 1: Precision

Experiment 1: Recall

Experiment 2 Initial evaluation of our second hypothesis. –Assesses the domain independence of using transformations. Limited - conducted in only one domain, but can still offer some insight. Methodology: –Divided transformations into 2 groups (domain-neutral vs. domain-specific). –Used domain-neutral transformations to construct DN –Used domain-specific transformations to construct DS –Everything else is the same as Experiment 1.

Experiment 2: Precision

Experiment 2: Recall

Proposed Work More Comprehensive Evaluation. Use background knowledge. Incorporate indexing to make matching more efficient.

Comprehensive Evaluation Evaluate our approach in several applications in four domains. Four data sets: –Chemistry (Halo). –Biology (RKF). –Battle Space Planning (RKF). –Office Procedures (EPCA). Three Applications: –Elaboration: Chemistry and Office Procedures. –Question Answering: Biology and Battle Space. –Plan Evaluation: Battle Space and Office Procedures.

Background Knowledge Background Knowledge. Can be used to normalize new knowledge at acquisition time via a join (Mineau et. al. 93). Idea can be applied to matching. –Increase similarity. Two problems: –When should a join be performed? –How to better control the join? Ontology BlockMove object prevents Military-Unit Block

Background Knowledge Background Knowledge. Can be used to normalize new knowledge at acquisition time via a join (Mineau et. al. 93). Idea can be applied to matching. –Increase similarity. Two problems: –When should a join be performed? –How to better control the join? BlockMove object prevents Military-Unit AttackBlock object Military-Unit causes object AttackMove object prevents Military-Unit BlockMove object prevents Military-Unit Attack object Military-Unit Attack Military-Unitobject Move object Military-Unit Move object Military-Unit

Indexing Need indexing to make matching more efficient. A common technique is a generalization hierarchy –Overhead for storage can be expensive. –Finding the MSS can also be expensive. We intend to study: –How to index knowledge by content? –Other index structures that are more parsimonious.