Presented by: Rhea McCaslin Dhaval Salvi

Slides:



Advertisements
Similar presentations
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Advertisements

First-Order Logic (and beyond)
Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Answering complex questions and performing deep reasoning in advance question answering systems Chitta Baral 1, Michael Gelfond 2 and Richard Scherl 3.
C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.
Using Link Grammar and WordNet on Fact Extraction for the Travel Domain.
Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.
Background information Formal verification methods based on theorem proving techniques and model­checking –to prove the absence of errors (in the formal.
Outline Recap Knowledge Representation I Textbook: Chapters 6, 7, 9 and 10.
1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.
Introduction to Computability Theory
Question Answering with deep reasoning Chitta Baral, Arizona State U. Michael Gelfond, Texas Tech U. Richard Scherl, Monmouth Univ.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Let remember from the previous lesson what is Knowledge representation
Lesson 6. Refinement of the Operator Model This page describes formally how we refine Figure 2.5 into a more detailed model so that we can connect it.
© 2005 by Prentice Hall Chapter 9 Structuring System Requirements: Logic Modeling Modern Systems Analysis and Design Fourth Edition.
Information Extraction from Documents for Automating Softwre Testing by Patricia Lutsky Presented by Ramiro Lopez.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
9/8/20151 Natural Language Processing Lecture Notes 1.
Notes for Chapter 12 Logic Programming The AI War Basic Concepts of Logic Programming Prolog Review questions.
Artificial intelligence project
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
An Intelligent Analyzer and Understander of English Yorick Wilks 1975, ACM.
Pattern-directed inference systems
Logical Agents Logic Propositional Logic Summary
Slide 1 Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs Jim Little UBC CS 322 – CSP October 20, 2014.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Formal Specification of Intrusion Signatures and Detection Rules By Jean-Philippe Pouzol and Mireille Ducassé 15 th IEEE Computer Security Foundations.
Semantic Construction lecture 2. Semantic Construction Is there a systematic way of constructing semantic representation from a sentence of English? This.
Key Concepts Representation Inference Semantics Discourse Pragmatics Computation.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Semantics In Text: Chapter 3.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Artificial Intelligence: Natural Language
Copyright © Cengage Learning. All rights reserved.
Automated Reasoning Early AI explored how to automated several reasoning tasks – these were solved by what we might call weak problem solving methods as.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
1 Reasoning with Infinite stable models Piero A. Bonatti presented by Axel Polleres (IJCAI 2001,
© 2005 by Prentice Hall Chapter 9 Structuring System Requirements: Logic Modeling Modern Systems Analysis and Design Fourth Edition Jeffrey A. Hoffer Joey.
Formal Verification. Background Information Formal verification methods based on theorem proving techniques and model­checking –To prove the absence of.
LDK R Logics for Data and Knowledge Representation Propositional Logic Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto Giunchiglia,
EEL 5937 Content languages EEL 5937 Multi Agent Systems Lecture 10, Feb. 6, 2003 Lotzi Bölöni.
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
Knowledge Engineering. Sources of Knowledge - Books - Journals - Manuals - Reports - Films - Databases - Pictures - Audio and Video Tapes - Flow Diagram.
Logics for Data and Knowledge Representation ClassL (part 1): syntax and semantics.
COMP 412, FALL Type Systems C OMP 412 Rice University Houston, Texas Fall 2000 Copyright 2000, Robert Cartwright, all rights reserved. Students.
Artificial Intelligence Logical Agents Chapter 7.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Logical architecture refinement
Chapter 9 Structuring System Requirements: Logic Modeling
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Chapter 9 Structuring System Requirements: Logic Modeling
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

Presented by: Rhea McCaslin Dhaval Salvi Knowledge Representation and Question Answering Marcello Balduccini, Chitta Baral, Yuliya Lierler Presented by: Rhea McCaslin Dhaval Salvi

Outline Introduction English  Logical Theories Systems COGEX DD ASU QA System Textual Entailment Mueller and Event Calculus Conclusion

Introduction PROBLEM: Given a large body of documents, how do you find answers to questions based on the information in these documents Information Retrieval (IR) Most IR systems cannot reason Database Querying Query a database using a database query language Question Answering (QA) System has increasing number of documents (most accommodate for) Queries in natural language

Question Answering System

Knowledge Representation in QA Text John and Mike took a plane from Paris to Baghdad. On the way, the plane stopped in Rome, where John was arrested Questions Where is Mike at the end of this trip? Where is John at the end of this trip? Where is the plane at the end of this trip? Where would John be if he was not arrested? Analysis Baghdad Rome why

Knowledge Representation in QA Text John took the plan from Paris to Baghdad. He planned to meet his friend Mike, who was waiting for him there. Questions Did John meet Mike? Analysis Yes why

Knowledge Representation in QA Text John is in Boston on Dec 1. He has no passport. Questions Can he go to Paris on Dec. 4? Analysis No why

Approaches Select relevant text from a selection of relevant documents Logic Form Information Extraction Select relevant text from a selection of relevant documents Convert text to logical theory The logical theory, domain and commonsense knowledge form the Knowledge Base (KB) Convert Question to a logic form, pose it against the KB, and use a theorem prover Select relevant text from a selection of relevant documents Classifier is used to determine the text to extract Extracted text is added to the domain and commonsense knowledge (KB) Translate question to the logical language of the KB and pose against it There is also a mixed approach, where logic forms are used in information extraction. Process the logic forms to obtain the relevant facts and then start at step 2

English  Logical Theories Identify syntactic structure of sentence “parse tree” Assign logic encoding to the lexical items Describe how logical representations of sub-parts of the discourse are to be combined in the representation of larger parts of it Convert English text to a logical representation and then use a reasoning system to reason with the resulting logical thoery.

Lambda calculus Introduce a new binding operator λ Variables bound by λ intuitively specify where each substitution occurs A woman walks : an element of the class “woman” “walks” λx.plane(x) λx.plane(x) @ boeing767 λw.λz.Ǝy.(w @ y ˄ z @ y) Once x is bound to a variable it will be used as the argument of plane Is equivalent to plane(boeing767) “a woman walks” Representation of “a” is parameterized by the class, w, and the property, z, of the object y

Lambda calculus Obtain representation of “a plane” by combining the encoding of “a” with the one of “plane” λw.λz.Ǝy.(w @ y ˄ z @ y) @ λx.plane(x) = λz.Ǝy.(λx.plane(x) @ y ˄ z @ y) = λz.Ǝy.(plane(y) ˄ z @ y) Assumption that the noun phrase is followed by a verb phrase (placeholder z for verb)

Lambda calculus (place holders) “John” λu.(u @ john) “ John did f ” λu.(u @ john) @ λx.f(x)  λx.f(x) @ john f (john)

COGEX Logic Prover Most systems use the approach of detecting matching patterns between the Question and Documents (textual sources) Textual sources are called candidate answers Easy to find fragments of text that possibly contain the answer, but difficult to associate with them a ranking for the best answers Candidate answers can be conflicting

COGEX Logic Prover LCC AQ system extended with a prover called COGEX Analyzes the connection between question and candidate answers Question “Did John visit New York City on Dec, 1?” Required Knowledge “New York City” and “City” are same location “flying to a location” implies the location will be visited Data Sources “John flew to the City on Dec, 1” “In the morning of Dec, 1, John went down memory lane to his trip to Austrailia”

WordNet and Glosses Knowledge is extracted from WordNet World knowledge: meaning of “fly(to a location)” Natural language knowledge: link between “New York City” and “City” Glosses - descriptions of the meaning of words Glosses are collected and mapped into logic forms {word, gloss_LLF}

WordNet and Glosses

WordNet and Glosses Connect “New York City” and “City” Interchangeable prepositions

Prover Task For each candidate answer: Refute the negation of the question using candidate answer and KB Success – correct answer has been identified Failure – relax the question and find new proof Relaxation System keeps track of how many relaxation steps are needed to find a proof The higher the value, the farther apart they are

LCC style logic forms (LLF) In LLF approach, a triple <base, pos, sense> is associated with every noun, verb, adjective, adverb, conjunction and preposition where base is the base form of the word, pos is its part of speech, and sense is the word’s sense in the classification found in WordNet database base_pos_sense(c,a0,…,an) “John takes a plane” John_NN(x1), take_VB_11(e1,x1,x2), Plane_NN_1(x2)

LCC style logic forms (LLF) In LLF approach, logic forms are also augmented with named- entity tags, based on lexical chains among concepts. Lexical chains are sequences of concepts such that adjacent concepts are connected by an hypernymy relation (is-a relation) Lexical chains allow to add to logic forms information implied by the text, but not explicitly stated Logic form of “John takes a plane” contains a named entity tag: human_NE(x1) Stating that John(part of the sentence denoted by x1) is a human being. Named entity tag is derived from the lexical chain connecting name “John” to concept “human(being)”.

LCC style logic forms (LLF) A recent extension is to further augment the logic forms by means of semantic relations. Semantic relations are relations between two words or concepts that provide a somewhat deeper description of the meaning of the text. More than 30 different types of semantic relations have been identified. Possession (POS_SR(X,Y)): X is a possession of Y. Agent ( AGT_SR(X,Y)): X performs or causes the occurrence of Y. Location,Space,Direction (LOC_SR(X,Y)): X is the location of Y. Manner (MNR_SR(X,Y)): X is the way in which event Y takes place. “John takes a plane” : AGT_SR(x1,e1)

DD Question Answering system A QA system capable of deep reasoning. DD system focuses on answering questions in natural language about the evolution of dynamic domains and is able to answer questions such as : Its particular focus is on travel and trips. Reasoning about narratives Predictive reasoning Planning Counterfactual reasoning Reasoning about intentions

DD Question Answering system In DD the behavior of dynamic domains is modeled by transition diagrams. Directed graphs whose nodes denote states of the domain and whose arcs, labeled by actions, denote state transitions caused by the execution of those actions. The theory encoding a domain’s transition diagram is called here model of the domain. The language of choice for reasoning in DD is AnsProlog (A-Prolog) Problem solving tasks are reduced to computing models, called answer sets, of suitable AnsProlog programs.

Overall Architecture of DD system Approach followed in DD system for understanding natural language consists of translating the natural language discourse into its semantic representation in the following steps: Collecting facts describing the semantic content of the discourse and a few linking rules. The task of answering queries is then reduced to performing inference on the theory consisting of the semantic representation and model of the domain

Overall Architecture of DD system Given a discourse H in natural language, describing a particular history of the domain, and a question Q, as well in the natural language, the DD system Obtains Logic forms for H and Q. Translates the logic forms for H and Q into a Quasi-Semantic Representation (QSR), consisting of AnsProlog facts describing properties of the objects of the domain and the occurrences of events that alter such properties.

Overall Architecture of DD system Given a discourse H in natural language, describing a particular history of the domain, and a question Q, as well in the natural language, the DD system 3. Maps the QSR into an Object Semantic Representation (OSR), a set of AnsProlog atoms which describe the contents of H and Q using the relations with which the domain model is encoded.. 4. Computes the answer sets of the AnsProlog program consisting of the OSR and the model of the domain and extracts the answer(s) to the question from such answer sets.

An example of the DD system Consider a history H consisting of the sentences “John is in Paris. He packs the laptop in the carry-on luggage and takes a plane to Baghdad,” a query, Q, “Where is the laptop at the end of the trip?”

An example of the DD system The first step consists in obtaining logic forms for H and Q. For H, the LLF generator returns the following logic form, Hlf :

An example of the DD system The logic form, Qlf , for Q is:

An example of the DD system The second step of the process consists in deriving the QSR from Hlf and Qlf . Atoms of the form true_at(FL,M) are used in the QSR to state that property FL is true at the moment M of the evolution. The phrase corresponding to Is encoded in QSR as : QSR of phrase associated with: Is:

An example of the DD system A default temporal sequence of the moments in the evolution of the domain is extracted from Hlf by observing the order in which the corresponding verbs are listed in the logic form. Finally the QSR of Qlf is obtained by analyzing the logic form to identify the property that is being queried The QSR for Hlf contains facts: All the information is condensed in the QSR:

An example of the DD system The next step consists in mapping the QSR relations to the domain relations. Since the translation depends on the formalism used to encode the transition diagram, the task is accomplished by an interface module associated with the domain model. The rules of the interface module are called Object Semantic Representation rules (OSR rules for short).

An example of the DD system The domain model used in our example is the travel domain, a commonsense formalization of actions involving travel. Two main relations used in formalization are : h which stands for holds and states which fluents hold at each time point o which stands for occurs and states which actions occur at each time point.

An example of the DD system Atoms of the form true_at(FL,M) from the QSR are mapped into domain atoms by the rule: The mapping of relation eventually_true, used in the QSR for the definition of relation answer_true, is: Since OSR rules are written in AnsProlog, the compuation of OSR can be combined with the task of finding the answer given the OSR. A convenient way of extracting the answer when SMODELS is used as inference engine is to add following two directives to AnsProlog program:

An example of the DD system As expected, for our example the SMODELS returns

ASU QA system: In ASU QA system to extract the relevant facts from sentences, Link Grammar is used to parse the sentence so that dependent relations between pairs of words are obtained. Such dependent relations are known as links. From the links between pairs of words, a simple algorithm is then used to generate AnsProlog facts Input: Pairs of words with their corresponding links produced by the Link Grammar parser. Output: AnsProlog facts.

ASU QA system algorithm

ASU QA system example “The train stood at the Amtrak station in Washington DC at 10:00 AM on March 15, 2005”

ASU QA system example The following facts are extracted based on the Link Grammar output:

ASU QA system example After extracting the facts from the sentences, it is necessary to assign the correct meanings of nouns and verbs with respect to the sentence. The process of identifying the types utilizes WordNet hypernyms. Word a is a hypernym of word b if a has a “is-a” relation with b. In the travel domain, it is essential to identify nouns that are of the types transportation (denoted as tran) or person (denoted as person). Such identification is performed using predefined sets of hypernyms for both transportation and person. The predefined sets of hypernyms of transportation and person are: Htran = {travel, public transport, conveyance} and Hperson = {person}. For instance, the hypernym of the noun train is conveyance. So we assign a AnsProlog fact transportation(train).

ASU QA system example A similar process is performed for each extracted verb by using the hypernyms of WordNet. Using the extracted facts together with verbs and nouns with their correct senses, reasoning is then done with an AnsProlog background knowledge base similar to the one in the DD system described earlier.

Recognizing Textual Entailment (RTE) PROBLEM: Given a text and a hypothesis, decide whether a human reasoner would call the hypothesis a consequence of the text Recognize “being a late wife” is that same thing as “being a widow” Text Yoko Ono unveiled a statue of her late husband, John Lennon. Hypothesis Yoko Ono is John Lennon’s widow. Expected Entailment Yes

Recognizing Textual Entailment (RTE) Approach : use first-order reasoning tools to check whether the hypothesis can be derived from the text and relevant background knowledge Nutcracker system implemented by Bos and Markert

Nutcracker - Algorithm Text and Hypothesis are first represented by discourse representation structures and then by first-order formulas T and C, respectively potentially relevant background knowledge is identified and expressed by a first-order formula BK An automated reasoning system, first-order logic theorem prover or model builder, is used to check whether the implication T ˄ BK → C is logically valid

“Yoko Ono unveiled a statue of her late husband, John Lennon” Step 1 of algorithm “Yoko Ono unveiled a statue of her late husband, John Lennon” a_,n_,v_,r_,p_ equals adjective, noun, verb, relation, person Ǝx y z e (p_ono(x) ˄ p_yoko(x) ˄ r_of(z,x) ˄ n_statue(y) ˄ r_of(y,z) ˄ a_late(z) ˄ n_husband(z) ˄ p_lennon(z) ˄p_john(z) ˄ n_event(e) ˄ v_unveil(e) ˄ r_agent(e,x) ˄ r_patient(e,y))

Step 2 of algorithm Background knowledge (BK) Domain specific: ∀x y (n_husband(x) ˄ a_late(x) ˄ r_of(x,y) → (n_widow(y) ˄ r_of(y,x))) Generic knowledge ∀e x y (n_event(e) ˄ r_agent(e,x) ˄ f_in(e,y) → f_in(x,y))

Step 3 of algorithm if a theorem prover finds a proof for the formula, Nutcracker concludes that Text entails Hypothesis if a theorem prover finds a proof for the formula ¬(T ˄ BK) ˄ C, then Nutcracker concludes that Text does not entail the Hypothesis due to the fact that they are inconsistent if a model builder finds a model for the negation of the formula T ˄ BK ˄ ¬C, then the system concludes that there is no entailment

Mueller and Event Calculus Mueller developed a technique for obtaining a semantic representation of the discourse Uses Event Calculus

Example “Bogota, 15 Jan 90—In an action that is unprecedented in Columbia’s history of violence, unidentified persons kidnapped 31 people in the strife-torn banana-growing region of Uraba, the Antiouqia governor’s office reported today. The incident took place in Puerto Bello, a village in Turbo municipality, 460 Km northwest of Bogota.” Templates descriptions of the events consisting of frames with slots and slot fillers Scripts scripts determine the type of commonsense knowledge the reasoner will use

Example (Template) 0. MESSAGE: ID DEV-MUC3-00040 (NNCOSC) 1. MESSAGE: TEMPLATE 1 2. INCIDENT: DATE – 15 JAN 90 3. INCIDENT: LOCATION COLUMBIA: URABA (REGION) TURBO(MUNICIPALITY):PUERTO BELLO(VILLAGE) 4. INCIDENT: TYPE KIDNAPPING […] 8. PERP: INCIDENT CATEGORY TERRORIST ACT 9. PERP: INDIVIDUAL ID “UNIDENTIFIED PERSONS”/[…] 19: HUM TGT: NAME – 20: HUM TGT: DESCRIPTION: “VILLAGERS”

Example (Script) Initially the human targets are at a first location and the perpetrator is at a second location Initially the human targets are alive, calm, and uninjured The perpetrator loads a gun The perpetrator walks to the first location The perpetrator threatens the human targets with the gun The perpetrator walks to the second location with the human targets For each target: If the effect on the human target is death, the perpetrator shoots the human target resulting in death Otherwise, if the effect is injury, the perpetrator shoots the human resulting in injury Otherwise, if the effect is regained freedom, the human leaves the building and walks back to the first location

Event Calculus % An object can be only in one location at a time Reasoner performs inferences on the theory formed by the problem and the commonsense knowledge % An object can be only in one location at a time HoldsAt(At(object, location1), time) ˄ HoldsAt(At(object, location2), time) → location1 = location2 % For an actor to activate a bomb, he must be holding it Happens(BombActivate(actor, bomb), time) → HoldsAt(Holding(actor, bomb), time)

Event Calculus Was actor “a” present when event “e” occurred? If for every time point t at which e occurs, the locations of a and the actor of e are the same, the answer is “yes” If for every time point t at which e occurs, the two locations are not the same, the answer is “no” Otherwise, the answer is “some of the times” Was fluent f true before event e occurred If f is true for all time points less than or equal to t, the answer is “yes” If f is false for all time points less than or equal to t, the answer is “no”

Conclusion To answer natural language questions posed with respect to natural language text, one either needs to develop a reasoning engine directly in natural language or needs a way to translate natural language to a formal language for which reasoning engines are available. we discussed two approaches to go from natural language to a formal representation. The first approach converts natural language to particular representations in classical logic.

Conclusion We discussed two such attempts: One does a syntactic parsing of the text, disambiguates the meaning of sentences using WordNet, creates a logic form, and uses a specialized reasoning engine (COGEX) Second uses parsing but does not disambiguate, constructs first-order representations of knowledge and then uses first- order reasoning tools. (Nutcracker)

Conclusion The second approach extracts relevant facts from the natural language. We discussed three such attempts: One that obtains relevant facts from the logic form mentioned earlier (DD) Second that uses the semantic parser Link Grammar, the WordNet database and background knowledge to obtain relevant facts. (ASU) Third that uses an information extraction system to fill slots in templates (Mueller)

Conclusion What is needed is a community wide effort to build a knowledge repository that is open and to which anyone can contribute. Some of the issues are: Which formal language(s) should be used by the community. How do we organize knowledge modules and how do we figure out which modules are needed to answer a particular question with respect to a particular text collection? If more than one logic needs to be used how do modules in different logics interact seamlessly?

Thank You