NATURAL LANGUAGE UNDERSTANDING FOR SOFT INFORMATION FUSION Stuart C. Shapiro and Daniel R. Schlegel Department of Computer Science and Engineering Center.

Slides:



Advertisements
Similar presentations
LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
Advertisements

CPSC 422, Lecture 16Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 16 Feb, 11, 2015.
An F-Measure for Context-Based Information Retrieval Michael Kandefer and Stuart C. Shapiro University at Buffalo Department of Computer Science and Engineering.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
Explanation Producing Combination of NLP and Logical Reasoning through Translation of Text to KR Formalisms CHITTA BARAL ARIZONA STATE UNIVERSITY 1 School.
Opinion Mapping Travelblogs Efthymios Drymonas Alexandros Efentakis Dieter Pfoser Research Center Athena Institute for the Management of Information Systems.
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
1 Lecture 35 Brief Introduction to Main AI Areas (cont’d) Overview  Lecture Objective: Present the General Ideas on the AI Branches Below  Introduction.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
The GLAIR Architecture for Cognitive Robotics Stuart C. Shapiro Department of Computer Science & Engineering and Center for Cognitive Science.
The GLAIR Cognitive Architecture Stuart C. Shapiro and Jonathan P. Bona Department of Computer Science & Engineering Center for Cognitive Science.
Knowledge Representation for Self-Aware Computer Systems Stuart C. Shapiro Department of Computer Science and Engineering, and Center for Cognitive.
S.C. Shapiro Development of a Cognitive Agent Stuart C. Shapiro Department of Computer Science and Engineering and Center for Cognitive Science.
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Semantics of a Propositional Network Stuart C. Shapiro Department of Computer Science & Engineering Center for MultiSource Information Fusion.
Detecting Economic Events Using a Semantics-Based Pipeline 22nd International Conference on Database and Expert Systems Applications (DEXA 2011) September.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
S.C. Shapiro An Introduction to SNePS 3 Stuart C. Shapiro Department of Computer Science and Engineering and Center for Cognitive Science State.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Knowledge Representation and Semantic Capturing Albena Strupchanska Linguistic Modelling Department, Institute for Parallel Processing, Bulgarian Academy.
Erasmus University Rotterdam Introduction Nowadays, emerging news on economic events such as acquisitions has a substantial impact on the financial markets.
Erasmus University Rotterdam Introduction With the vast amount of information available on the Web, there is an increasing need to structure Web data in.
Chapter 14: Artificial Intelligence Invitation to Computer Science, C++ Version, Third Edition.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Survey of Semantic Annotation Platforms
APPLICATIONS OF CONTEXT FREE GRAMMARS BY, BRAMARA MANJEERA THOGARCHETI.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
Information Extraction From Medical Records by Alexander Barsky.
Artificial intelligence project
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Inference Graphs: A Roadmap Daniel R. Schlegel and Stuart C. Department of Computer Science and Engineering L A – Logic of Arbitrary.
Understanding PML Paulo Pinheiro da Silva. PML PML is a provenance language (a language used to encode provenance knowledge) that has been proudly derived.
USE OF BACKGROUND KNOWLEDGE IN NATURAL LANGUAGE UNDERSTANDING FOR INFORMATION FUSION Stuart C. Shapiro Department of Computer Science and Engineering Center.
Ngoc Minh Le - ePi Technology Bich Ngoc Do – ePi Technology
SceneMaker: Automatic Visualisation of Screenplays School of Computing & Intelligent Systems Faculty of Computing & Engineering University of Ulster, Magee,
Concurrent Inference Graphs Daniel R. Schlegel Department of Computer Science and Engineering Problem Summary Inference graphs 2 in their current form.
Semiautomatic domain model building from text-data Petr Šaloun Petr Klimánek Zdenek Velart Petr Šaloun Petr Klimánek Zdenek Velart SMAP 2011, Vigo, Spain,
Markov Logic and Deep Networks Pedro Domingos Dept. of Computer Science & Eng. University of Washington.
Concurrent Reasoning with Inference Graphs Daniel R. Schlegel and Stuart C. Shapiro Department of Computer Science and Engineering University at Buffalo,
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Triplet Extraction from Sentences Technical University of Cluj-Napoca Conf. Dr. Ing. Tudor Mureşan “Jožef Stefan” Institute, Ljubljana, Slovenia Assist.
Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Lexico-semantic Patterns for Information Extraction from Text The International Conference on Operations Research 2013 (OR 2013) Frederik Hogenboom
Shallow Parsing for South Asian Languages -Himanshu Agrawal.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Emotion Recognition from Text Using Situational Information and a Personalized Emotion Model Yong-soo Seol 1, Han-woo Kim 1, and Dong-joo Kim 2 1 Department.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
Lyon Research Center for Images and Intelligent Information Systems IEEE International Conference on Pervasive Services 2006 FRE 2672 INSA Lyon ICPS, 27.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Computer Troubleshooting Intelligent System (CTIS) is being developed for Computer Services Department of the Student Health Center at UCF EEL 5874 EXPERT.
Versatile Information Systems, Inc International Semantic Web Conference An Application of Semantic Web Technologies to Situation.
Concurrent Reasoning with Inference Graphs Daniel R. Schlegel Stuart C. Shapiro Department of Computer Science and Engineering Problem Summary Rise of.
Definition and Technologies Knowledge Representation.
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Ontology From Wikipedia, the free encyclopedia
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Logical Agents.
Chapter 10: Compilers and Language Translation
Artificial Intelligence 2004 Speech & Natural Language Processing
A Link Grammar for an Agglutinative Language
Presentation transcript:

NATURAL LANGUAGE UNDERSTANDING FOR SOFT INFORMATION FUSION Stuart C. Shapiro and Daniel R. Schlegel Department of Computer Science and Engineering Center for Multisource Information Fusion And Center for Cognitive Science University at Buffalo, The State University of New York

Outline Introduction The Syntactic and Enhanced Syntactic KBs Syntax-Semantics Mapping Evaluation Acknowledgments 7/10/ S. C. Shapiro & D. R. Schlegel Fusion 2013

Tractor Input: Short English intelligence message. Output: Semantic knowledge base (KB) representing contents of message. 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

Context: Hard & Soft Information Fusion Information from multiple Soft information sources English messages Hard information sources RADAR, SONAR, LIDAR, … are fused for situation assessment. => Requirement: Capture semantic content of each message as completely and correctly as possible. 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

Approach Not: Information Extraction Look for prespecified classes of Entities Events Properties. Instead: Natural Language Understanding (Semantic Analysis) Translate Entities, Events, Properties, Relations, … Expressed in the text Into a formal Knowledge Representation (KR) language that supports reasoning. 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

Syntactic Processing in GATE Tractor Architecture 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion English Message Annotations Syntactic KB Semantic KB Enhanced Syntactic KB Syntax-Semantics Mapper CBIR Propositionalizer Named-Entity Recognizers Co-Referencers POS Tagger & Parser Co-Reference Editor Tokenizer & Sentence Splitter Morphological Analyzer Cyc Ontology GEOnet Names Server option

Part of a Syntactic Knowledge Base 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion /03/10 - Dhanun Ahmad has been placed into custody by the Iraqi police and transferred to a holding cell in Karkh; news of his detainment is circulated in his neighborhood of Rashid Semantic information Logical form: (TextOf Ahmad n169)

Some Syntactic Relations/Assertions (TextOf x y) token y is an occurrence of word x (RootOf x y) x is the root form of the word of token y (SyntacticCategoryOf x y) x is the part-of-speech of y (nsubj x y) the subject of x is y (nsubjpass x y) the passive subject of x is y (dobj x y) the direct object of x is y (prep x y) x is modified by a PP headed by y (nn x y) the head noun x is modified by the noun y (token-start-pos x i) token x begins with character i (token-end-pos x i) token x ends just before character i 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

CBIR Enhancement 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion From GEOnet Names Server

Some Semantic Relations/Assertions (Isa x y) x is an instance of category y (MemberOf x y) x is a member of group y (Type x y) The category x is a subcategory of category y (hasName x y) The name of x is y (per-religion x y) The religion of the person x is y (age x y) The age of x is y (agent x y) The agent of the action x is y (theme x y) The theme (what is acted upon) of the action x is y 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

Examples of Syntax-Semantics Mapping 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion “Ahmad detained a Sunni youth.” 4 syntax-semantics mapping rules will be shown with before & after graphs.

properNounToName 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion (defrule properNounToName (SyntacticCategoryOf NNP ?token) (TextOf ?text ?token) => (assert `(hasName,?token,?text)) (unassert `(SyntacticCategoryOf NNP,?token)) (unassert `(TextOf,?text,?token)))

nounPhraseToInstance 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion (defrule nounPhraseToInstance (SyntacticCategoryOf NN ?nn) (:when (isNPhead ?nn)) (RootOf ?root ?nn) (:unless (numberTermp ?root)) => (assert `(Isa,?nn,?root)) (unassert `(SyntacticCategoryOf NN,?nn)) (unassert `(RootOf,?root,?nn)))

hasReligion 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion (defrule hasReligion (Isa ?religiongrp ReligiousGroup) (nn ?per ?religiongrp) (hasName ?religiongrp ?religion) => (assert (MemberOf ?per ?religiongrp)) (assert (per-religion ?per ?religion)) (unassert (nn ?per ?religiongrp)))

subjAction 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion (defrule subjAction (nsubj ?action ?subj) (Isa ?action Action) => (assert `(agent,?action,?subj)) (unassert `(nsubj,?action,?subj)))

Final Semantic Graph (KB) 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion “Ahmad detained a Sunni youth.” There is a detain action, with someone named Ahmad as agent, and some youth as theme. The age of the youth is young. The religion of the youth is Sunni.

Evaluation Mapping Rules developed using training messages. Evaluated using test messages. Questions How general are the rules? How thorough are the rules? Are the rules too general? 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

How General are the rules? 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion TypeCount# Fired% FiredTimes FiredTimes Per Msg CBIR11100% SYN %1, SEM55100% SYNSEM %2, INFER9888.9% CLEAN10880%6, TOTAL %11, Rule firings on test messages Conclusion: Reasonably general.

How thorough are the rules? 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion KBSyntactic Assertions Semantic Asertions % Semantic Syntactic24691, % Semantic w/ CBIR53848, % Semantic w/o CBIR5385, % Syntactic vs. Semantic Assertions Observation: Rules convert from 68% syntactic to 91% semantic w/o CBIR assertions Conclusion: Very thorough.

Are the rules too general? 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion Type# FiredTimes Fired# Correct% Correct CBIR % SYN131,5671, % SEM % SYNSEM562,6512, % INFER % CLEAN56, % TOTAL8811,59711, % Rule firings on correct parses in test messages Conclusion: Not too general.

Conclusions Tractor’s purpose: Semantic Analysis of short English messages for fusion with other soft and hard information. Syntactic analysis of message creates syntactic KB. Syntactic KB enhanced with ontological and geographical information. Mapping rules map syntactic KB into semantic KB. Mapping rules developed with training messages generalize well to test messages. Syntactic KBs average 68% syntactic; Final Semantic KBs average 99% semantic. 7/10/2013 S. C. Shapiro & D. R. Schlegel Fusion

Acknowledgments This work has been supported by a Multidisciplinary University Research Initiative (MURI) grant (Number W911NF ) for "Unified Research on Network- based Hard/Soft Information Fusion", issued by the US Army Research Office (ARO) under the program management of Dr. John Lavery. 7/10/ S. C. Shapiro & D. R. Schlegel Fusion 2013