A comprehensive framework for multimodal meaning representation Ashwani Kumar Laurent Romary Laboratoire Loria, Vandoeuvre Lès Nancy.

Slides:

Advertisements

Similar presentations

Profiles Construction Eclipse ECESIS Project Construction of Complex UML Profiles UPM ETSI Telecomunicación Ciudad Universitaria s/n Madrid 28040,

Advertisements

Architecture Representation

RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.

An overview of EMMA— Extensible MultiModal Annotation Michael Johnston AT&T Labs Research 8/9/2006.

XML Technology in E-Commerce

Visualization Kenny Inthirath.  Reviewing a Suitable Technique to Use  Scope and Purpose  What types of models can be represented?  Architectural.

DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,

Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.

© 2005 Prentice Hall4-1 Stumpf and Teague Object-Oriented Systems Analysis and Design with UML.

Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.

Kari R. Schougaard, PhD Stud. Værktøjer og Teknikker, 2006 UNIVERSITY OF AARHUS Department of Computer Science Unified Modeling Language Visual language.

DFKI Approach to Dialogue Management Norbert Reithinger, Elsa Pecourt, Markus Löckelt

Course Instructor: Aisha Azeem

Introduction to the course January 9, Points to Cover  What is GIS?  GIS and Geographic Information Science  Components of GIS Spatial data.

Domain-Specific Software Engineering Alex Adamec.

Smart Learning Services Based on Smart Cloud Computing

CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.

TMF - a tutorial TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.

Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.

Chapter 7 Requirement Modeling : Flow, Behaviour, Patterns And WebApps.

Ontologies Reasoning Components Agents Simulations Agent Modeling Language: Behavioral Models Rafael Oliveira Ricson Santana Vinícius Remigo Jacques Robin.

SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.

Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Second RMSIG Workshop Methodology and Process Wolfram Höpken.

Guide to Simulation Run Graphic: The simulation runs show ME (memory element) activation, production matching and production firing during activation of.

December 15, 2011 Use of Semantic Adapter in caCIS Architecture.

Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.

Multimodal user interfaces: Implementation Chris Vandervelpen

Working group on multimodal meaning representation Dagstuhl workshop, Oct

Introduction to MDA (Model Driven Architecture) CYT.

Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.

Multimodal Information Access Using Speech and Gestures Norbert Reithinger

Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.

SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.

ET-ADRS-1, April ISO 191xx series of geographic information standards.

© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.

WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.

Towards multimodal meaning representation Harry Bunt & Laurent Romary LREC Workshop on standards for language resources Las Palmas, May 2002.

ISO a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.

Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.

Practical Object-Oriented Design with UML 2e Slide 1/1 ©The McGraw-Hill Companies, 2004 PRACTICAL OBJECT-ORIENTED DESIGN WITH UML 2e Chapter 4: Restaurant.

Referring to Objects with Spoken and Haptic Modalities Frédéric LANDRAGIN Nadia BELLALEM & Laurent ROMARY LORIA Laboratory Nancy, FRANCE.

Modelling Class T07 Conceptual Modelling – Behaviour References: –Conceptual Modeling of Information Systems (Chapters 11, 12, 13 and 14)

Rational Unified Process Fundamentals Module 7: Process for e-Business Development Rational Unified Process Fundamentals Module 7: Process for e-Business.

TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.

Introduction to OOAD and the UML

Information Dynamics & Interoperability Presented at: NIT 2001 Global Digital Library Development in the New Millennium Beijing, China, May 2001, and DELOS.

Mining the Biomedical Research Literature Ken Baclawski.

Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Third RMSIG Workshop Basic Modeling Concepts Wolfram Höpken.

Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.

A modular metadata-driven statistical production system The case of price index production system at Statistics Finland Pekka Mäkelä, Mika Sirviö.

Week 04 Object Oriented Analysis and Designing. What is a model? A model is quicker and easier to build A model can be used in simulations, to learn more.

DESIGN OF SOFTWARE ARCHITECTURE

ISO TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.

SemAF – Basics: Semantic annotation framework Harry Bunt Tilburg University isa -6 Joint ISO - ACL/SIGSEM workshop Oxford, January 2011 TC 37/SC.

ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.

EEL 5937 Content languages EEL 5937 Multi Agent Systems Lecture 10, Feb. 6, 2003 Lotzi Bölöni.

Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.

W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.

Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.

Mathematical Service Matching Using Description Logic and OWL Kamelia Asadzadeh Manjili

Chapter 9 Architectural Design. Why Architecture? The architecture is not the operational software. Rather, it is a representation that enables a software.

The Movement To Objects

Course Outcomes of Object Oriented Modeling Design (17630,C604)

Systems Analysis and Design With UML 2

Rafael Almeida, Inês Percheiro, César Pardo, Miguel Mira da Silva

Chapter 20 Object-Oriented Analysis and Design

Metadata Framework as the basis for Metadata-driven Architecture

IDEAS Core Model Concept

ETSI TC MTS TDL SC meeting Reports

Presentation transcript:

A comprehensive framework for multimodal meaning representation Ashwani Kumar Laurent Romary Laboratoire Loria, Vandoeuvre Lès Nancy

Overview - 1 Context: Conception phase of the EU IST/MIAMM project (Multidimensional Information Access using Multiple Modalities - with DFKI, TNO, Sony, Canon) Study of the design factors for a future haptic PDA like device Underlying application: multidimensional access to a musical database

Overview - 2 Objectives: Design and implementation of a unified representation language within the MIAMM demonstrator MMIL: Multimodal interface language “Blind” application of (Bunt & Romary 2002)

Methodology Basic components Represent the general organization of any semantic structure Parameterized by data categories taken from a common registry application specific data categories General mechanisms To make the thing work General categories Descriptive categories available to all formats + strict conformance to existing standards

MIAMM - wheel mode

MIAMM architecture dependancies Dialogue Manager MultiModalFusion (MMF) MiaDomo Database Dialogue History y Visual configuration Action Planner (AP) Sentences Haptic Device Display Haptic Processor Visualization Haptic-Visual Generation Visual-Haptic Processing (VisHapTac) Speaker Speech Synthesis Language Generation Scheduling Information Speech Generation Haptic-Visual Interpretation Microphone (Headset) Continuous Speech Recognizer Structural Analysis (SPIN) Word/Phoneme Lattice Speech Analysis Word/Phoneme sequence

Various processing steps - 1 Reco: Provides word lattices Out of our scope (MPEG7 word and phone lattice module) SPIN: Template based (en-de) or TAG-based (fr) dependency structures Low level semantic constructs

Various processing steps - 2 MMF (Multimodal Fusion) Fully interpreted structures Referential (MMILId) and temporal anchoring Dialogue history update AP (Action Planner) Generates MIAMM internal actions Request to MiaDoMo Actions to be generated (Language+VisHapTac)

Various processing steps - 3 VisHaptac Informs MMF of current graphical and haptic configuration (hierarchies of objects, focus, selection) MMIL: must answer those needs But not at the same time

Main characteristics of MMIL Basic ontology Events and participants (organized as hierarchies) Restrictions on events and participant Relations among these Additional mechanisms Temporal anchoring of events Ranges and alternatives Representation Flat meta-model

MMIL meta-model (UML)

An overview of data categories Underlying ontology for a variety of formats Distinction between abstract definition and implementation (e.g. in XML) Standardization objective: implementing a reference registry for NLP applications Wider set of DatCats than just semantics ISO (meta-data registries) as a reference standard for implementing such a registry

DatCat example: Addressee /Addressee/ Definition: The entity that is the intended hearer of a speech event. The scope of this data category is extended to deal with any multimodal communication event (e.g. haptics and tactile) Source: (implicit) an event, whose evtType should be /Speak/ Target: a participant (user or system)

Styles and vocabularies Style: design choice to impement a data actegory as an XML element, a database field, etc. Vocabulary: the names to be provided for a given style E.g. (for /Addressee/) Style: Element Vocabulary: {“addressee”} Note: Multilingualism

Time stamping /Starting point/ Def: indicates the beginning of the event Values: dateTime Anchor: time level Style: attribute Vocabulary: {“startPoint”} Example yearPeriod 1991 <tempSpan startPoint=“ T00:00:00” endPoint=“ T24:59:59”/>

Application: a family of formats Openness: a requirement for MIAMM Specific formats for input and output of each module Each format is defined within the same generic MMIL framework: Same meta-model for all Specific DatCat specification for each

The MIAMM family of formats SPIN-O MMF-O AP-O VisHapTac-O MMF-I MMIL+ The specifications provide typing information for all these formats

SPIN-O example Spiel mir den lied bitte vor (Please play the song) e0e0 e1e1 p1p1 destination evtType=speak dialogueAct=request evtType=play lex=vorspielen p2p2 objectType=user objType=tune refType=definite refStatus=pending object propContent speaker

speak request play vorspielen user tune definite pending

The use of perceptual grouping Reference domains and visual contexts « these three objects »  {,, } « the triangle »  { } « the two circles »  {, } The use of salience

VisHapTac-O e0e0 set 1 s1s1 s2s2 set 2 s 25 … description s 2-1 s 2-2 s 2-3 inFocus inSelection Visual haptic state Participant setting Sub-divisions

VisHapTac output - 1 HGState galaxy <tempSpan startPoint=“ T14:12:06” endPoint=“ T14:12:13”/> …

VisHapTac output - 2 … Let it be set inFocus Lady Madonna … inSelection Revolution 9 …

Conclusion Most of the properties we wanted are fulfilled: Uniformity, incrementality, partiality, openness and extensibility Discussion point: Semantic adequacy: Not a direct input to an inference system (except for underlying ontology) Semantics provided through specification