MOQA Meaning Oriented Question Answering An AQUAINT Project from ILIT.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

TU/e technische universiteit eindhoven Hera: Development of Semantic Web Information Systems Geert-Jan Houben Peter Barna Flavius Frasincar Richard Vdovjak.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
T-FLEX DOCs PLM, Document and Workflow Management.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Object-Oriented Analysis and Design
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Semantic Web Tools for Authoring and Using Analysis Results Richard Fikes Robert McCool Deborah McGuinness Sheila McIlraith Jessica Jenkins Knowledge Systems.
Cornell University Library Instruction Statistics Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24) Wilson Muyenzi.
Lecture 13 Revision IMS Systems Analysis and Design.
A System for A Semi-Automatic Ontology Annotation Kiril Simov, Petya Osenova, Alexander Simov, Anelia Tincheva, Borislav Kirilov BulTreeBank Group LML,
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Overview of Software Requirements
Copyright 2004 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Second Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter.
Knowledge-Based NLP and the Semantic Web Sergei Nirenburg Institute for Language and Information Technologies University of Maryland Baltimore County Workshop.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
Introduction to Systems Analysis and Design
Meaning-Oriented Question-Answering with Ontological Semantics An AQUAINT Project from ILIT.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter 1 The Systems.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
MAHI Research Database Project Status Report August 9, 2001.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
New Task Group CRIS Architecture & Development Maximilian Stempfhuber RWTH Aachen University Library
S/W Project Management
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
‘One Sky for Europe’ EUROCONTROL © 2002 European Organisation for the Safety of Air Navigation (EUROCONTROL) Page 1 VALIDATION DATA REPOSITORY Overview.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
CoGenTex, Inc. Ontology-based Multimodal User Interface in MOQA AQUAINT 18-Month Workshop San Diego, California Tanya Korelsky Benoit Lavoie Ted Caldwell.
Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.
Košice, 10 February Experience Management based on Text Notes The EMBET System Michal Laclavik.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1 INTEROP WP1: Knowledge Map Michaël Petit (U. of Namur) January 19 th 2004 Updated description of tasks after INTEROP Kickoff Meeting, Bordeaux.
Software Engineering, 8th edition Chapter 8 1 Courtesy: ©Ian Somerville 2006 April 06 th, 2009 Lecture # 13 System models.
1 Introduction to Software Engineering Lecture 1.
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
© 2007 IBM Corporation SOA on your terms and our expertise Software WebSphere Process Server and Portal Integration Overview.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
Volgograd State Technical University Applied Computational Linguistic Society Undergraduate and post-graduate scientific researches under the direction.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
 An Information System (IS) is a collection of interrelated components that collect, process, store, and provide as output the information needed to.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
Human-Assisted Machine Annotation Sergei Nirenburg, Marjorie McShane, Stephen Beale Institute for Language and Information Technologies University of Maryland.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
Building Enterprise Applications Using Visual Studio®
An Overview of Requirements Engineering Tools and Methodologies*
Development of the Amphibian Anatomical Ontology
Information Retrieval and Web Search
Information Retrieval and Web Search
Software Architecture
A framework for ontology Learning FROM Big Data
Presentation transcript:

MOQA Meaning Oriented Question Answering An AQUAINT Project from ILIT

CRL is a research department in the School of Arts and Sciences at NMSU Director: Jim Cowie Currently has a staff of 10 PhDs Mainly focuses on language engineering research Languages include – Arabic, Farsi, Turkish, Spanish, Chinese, Japanese, Korean Contact: Jim Cowie –

Advanced-technology company in Ithaca, New York Founded in 1990 by Dr. Richard Kittredge, Dr. Tanya Korelsky, and Dr. Owen Rambow. Goal is to transform results from research in natural language processing into practical software applications. Has developed a core set of text generation tools Current focus is on expanding the range of applications for this technology, with a particular focus on the Web. Contact: Tanya Korelsky –

The Institute for Language and Information Technologies at University of Maryland Baltimore County Sergei Nirenburg, Director Opened September 2002 with a team of 3 senior personnel –Sergei Nirenburg –Stephen Beale –Marge McShane Contact: Sergei Nirenburg – ILIT

Meaning-Oriented Question-Answering with Ontological Semantics Domain: travel and meetings –question understanding and interpretation; – determining the answer and – presenting the answer two kinds of data source –open text (in English, Arabic and Persian) –Structured Fact Repository containing instances of ontological entities

Project Tasks Design and Implementation of System Architecture Knowledge Acquisition Question Understanding Question Interpretation Answer Determination Answer Formulation Documentation; User and Evaluator Training; Testing; and System Evaluation

Dialog and Self- Awareness-related Answer Determination: (for running commentary and workflow and context- related communication) Question Interpretation:  task context  dialog context  user profile  analyst team profile Question Understanding Answer Formulation and Presentation Input : User Question in English Output : System Response in English Task-Oriented Answer Determination from Fact Database: IE from Fact Database NL Query Generation: in English, Arabic and one of Persian, Russian, Spanish Answer Determination from open text:  IR  IE  Production of TMRs for Textual Fillers of IE Templates NL Query FACT REPOSITORY: including instances of goals, plans, scripts ::ONTOLOGY including goals, plans, scripts LEXICONS Each Language in System: including names and phrases Knowledge Sources Processing Modules and Intermediate Results Goal and Plan Processing Manager System Working Memory Extended TMR: adds a statement of active goals, plans and scripts in the system System Response in TMR Basic Text Meaning Representation (TMR) Goal Attainment and Plan Execution Agenda Using XML

Development Methodology Rapid Prototyping Using pre-existing components Grow the various development activities toward an integrated system Today – look at one example of each User Interaction Analysis Resources XML - TMR

Deliverables A QA system in the domain of travel and meetings, with a capability to search for information in open texts in three languages and in a structured, ontology-based Fact DB; an enhanced text analysis system for each of the languages; a question interpretation module that takes into account user goals and the context of the dialog; an integrated IR/IE module working on open text in three languages, on the basis of ontologically defined extraction templates;

Deliverables (Cont.) an ontology of about 6,500 concepts; A Fact DB of about 100,000 facts; a system for automating the acquisition of the Fact DB; a semantic lexicon for each of the languages in the system, at about 20,000 entries a decision-making module that determines the answer(s) and system action(s) at each step of the dialog/task processing; an intuitive and intelligent multi-modal user interface, which uses natural language generation in answers and for query validation

Project Status Approval to spend given from August 21 st 2002 UMBC – Ontology and English Lexicon Improvement, Development of Scripts, Meaning Based Text Analysis CoGenTex – Interface design, Human Factors, Text Generation NMSU – Text-preprocessing, Arabic and Farsi resources and analysis, data collection, system integration.

Corpora Being Collected Arabic : – – – English: – – – Persian: – – –

Ontology An ontology is a formally and semantically defined repository of concepts and relations about the world. –Including knowledge about events, objects, and work flow scripts Linked to the ontology are: –fact repositories, including facts about actual events, objects, places, personalities, etc. –lexica, defining words in a language in ontological terms –“ onomastica ”, or multilingual proper name lists

Structured Common Fact Repository Uniform organization for all kinds of data Support for multiple applications and tools Semantically anchored in general ontology Constantly updated; today, manually; tomorrow, semi-automatically; long-term, automatically Supports both domain knowledge and workflow specification

Populating the Fact Repository Original Text CIA Report 02 [15 October, 2002, from a source, said to be credible, in Jordan]: A man named Majed H., using a faked Jordanian passport and travel visa, traveled from Aman, Jordan to Chicago, Illinois on 12 July, Majed H. is now known to have resided in Afghanistan for two years ( ) and has been identified as a member of Al- Qaeda. Gloss An unnamed source, who is very reliable, informed the CIA sometime between July 12, 2002 and October 15, 2002, of a travel- event by Majed H. on July 12, 2002, from Aman (Amman) Jordan to Chicago Illinois. In order to take the trip, Majed H. used a fake passport and visa issued by Jordan. Majed H. was located in Afghanistan from January 1, 1996, through December 31, 1997, and is a member of Al-Qaeda.

Populating the Fact Repository (2) Gloss An unnamed source, who is very reliable, informed the CIA sometime between July 12, 2002 and October 15, 2002, of a travel- event by Majed H. on July 12, 2002, from Aman (Amman) Jordan to Chicago Illinois. In order to take the trip, Majed H. used a fake passport and visa issued by Jordan. Majed H. was located in Afghanistan from January 1, 1996, through December 31, 1997, and is a member of Al-Qaeda. Facts (12 Total) INFORM-1 AGENT: SOCIAL-ROLE-4 BENEFICIARY: ORGANIZATION-1 THEME: TRAVEL-EVENT-0 TIME: <> 07/12/ /15/2002 MODALITY-EPISTEMIC: > 0.6 …………… NATION-2 HAS-NAME: "Jordan" CITY-1 HAS-NAME: "Amman" IN-NATION: NATION-2

LEXICON: English lexical entry mapped to concept “EXIT”

LEXICON: Chinese lexical entry mapped to concept “EXIT”

Using Ontology to Support Retrieval Documents need to be retrieved using the language of the document The representation of queries in the system is in terms of ontological concepts and “facts” We will use the ontology to support retrieval in all three languages Current experiment uses Chinese and Spanish- Ontology-Language lexicons exist for these languages

User Specified Query

Mapped to Associated Concepts

Concepts Map to Language of Documents

Concept-Word Mappings

Further Mapping of Concepts

Generation Tasks Months 1-6 Subtask 1: First prototype of intelligent question answering user interface, involving hypertext generation (December demo) Subtask 2: Gathering of end users feedback on the interface functionality, look-and-feel and user customization Subtask 3: Design of extensions to cover broader collection of concepts from the ontology Milestone at next 6 months: Report on user feedback

MOQA User Interface Support for both natural languages queries and structured queries Intuitive web-based multi-modal interface for answers Tables, text, maps, time line, and social network graphs are interconnected by hyperlinks Natural language generation used in answers and for query validation Implemented using XML-based technology Positive reviews at the kick-off from an HCI expert and program management

MOQA User Interface – Query Page Support for NL- based queries and structured queries Structured query validation with automatically generated NL paraphrase WYSIWYM editing of structured queries

MOQA User Interface –Underlying XML Queries Uses standard XML technology (e.g. XML- compliant browser, XML parsers, etc.) Supports modularity – the XML representation is viewable and exchangeable between subsystems Assures automatic validation of query instances using a query class hierarchy described by XML schemas Uses logical expressions in XML to support complex queries

MOQA User Interface – Results Page (kick- off concept demo version) Concept demo helped to perform requirements analysis Demonstrated integrated display using tables, textual summary, map and time line Illustrated filtering table data by using hyperlinks in text

MOQA User Interface - Details Page (concept demo) Demonstrated display of additional types of information including social networks and source document extracts Illustrated “drill-down” by hyperlinks and typical follow-up queries based on underlying ontology

MOQA User Interface – Research Plans Presentation of partially understood natural language queries Personalization of answer presentation both content-wise (based on user expertise) and form- wise (based on user presentation preferences) Intelligent maintenance of session history based on typical work flow and collaboration patterns within groups Interface portability between subject domains Incremental evolution based on validation by domain experts