ENTERFACE’08 Multimodal high-level data integration Project 2 1.

Slides:



Advertisements
Similar presentations
CHART or PICTURE INTEGRATING SEMANTIC WEB TO IMPROVE ONLINE Marta Gatius Meritxell González TALP Research Center (UPC) They are friendly and easy to use.
Advertisements

Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Manuela Veloso, Anthony Stentz, Alexander Rudnicky Brett Browning, M. Bernardine Dias Faculty Thomas Harris, Brenna Argall, Gil Jones Satanjeev Banerjee.
SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Intelligent User Interfaces Research Group Directed by: Frank Shipman.
The Importance of Architecture for Achieving Human-level AI John Laird University of Michigan June 17, th Soar Workshop
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
CS350/550 Software Engineering Lecture 1. Class Work The main part of the class is a practical software engineering project, in teams of 3-5 people There.
Developing Intelligent Agents and Multiagent Systems for Educational Applications Leen-Kiat Soh Department of Computer Science and Engineering University.
1 User Interface Design CIS 375 Bruce R. Maxim UM-Dearborn.
Smart Learning Services Based on Smart Cloud Computing
Vedrana Vidulin Jožef Stefan Institute, Ljubljana, Slovenia
Chapter 7 Requirement Modeling : Flow, Behaviour, Patterns And WebApps.
Succeeding with Technology Information, Decision Support… Decision Making and Problem Solving Management Information Systems Decision Support Systems Group.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.
Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.
1 ICAS’2008 – Gosier, March 16-21, 2008 A Transformational Approach for Pattern-based Design of User Interfaces Costin Pribeanu Jean Vanderdonckt National.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
CHAPTER TEN AUTHORING.
NC-BSI: 3.3 Data Fusion for Decision Support Problem Statement/Objectives: Problem - Accurate situation awareness requires rapid integration of heterogeneous.
Human Learning Asma Marghalani.
The 11 th International one-month Summer Workshop on Multimodal Interfaces Aug 10 th – Sept 4 th 2015, Mons, Belgium A. Camurri, M. Mancini, G. Volpe,
1 Workshop on Business-Driven Enterprise Application Design & Implementation Cristal City, Washington D.C., USA, July 21, 2008 How to Describe Workflow.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Virtual University - Human Computer Interaction 1 © Imran Hussain | UMT Imran Hussain University of Management and Technology (UMT) Lecture 40 Observing.
10th International Baltic Conference on Databases and Information Systems July 8-11, 2012, Vilnius, Lithuania Learner Model’s Utilization in the e-Learning.
ENTERFACE 08 Project 2 “multimodal high-level data integration” Mid-term presentation August 19th, 2008.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
ENTERFACE ’08 Project 2 “Multimodal High Level Data Integration” Final Report August 29th, 2008.
Intelligent Robot Architecture (1-3)  Background of research  Research objectives  By recognizing and analyzing user’s utterances and actions, an intelligent.
Human Interaction with Data “Meaningful Interpretations” “The Power of Crowdsourcing” &
1 1. Representing and Parameterizing Agent Behaviors Jan Allbeck and Norm Badler 연세대학교 컴퓨터과학과 로봇 공학 특강 학기 유 지 오.
EASAIER Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Ying Ding.
Agents that Reduce Work and Information Overload and Beyond Intelligent Interfaces Presented by Maulik Oza Department of Information and Computer Science.
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
Intelligent Agent Framework1 From Chapter 7 of Constructing Intelligent Agents with Java.
Introduction to Interactive Media Interactive Media Tools: Authoring Applications.
Chapter 1. Cognitive Systems Introduction in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans Park, Sae-Rom Lee, Woo-Jin Statistical.
How conscious experience and working memory interact Bernard J. Baars and Stan Franklin Soft Computing Laboratory 김 희 택 TRENDS in Cognitive Sciences vol.
RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"
Vedrana Vidulin Jožef Stefan Institute, Ljubljana, Slovenia
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
21/1/ Analysis - Model of real-world situation - What ? System Design - Overall architecture (sub-systems) Object Design - Refinement of Design.
Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
1 Applying Principles To Reading Presented By Anne Davidson Michelle Diamond.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
E-Learning: components of e-learning Mohammed Hassan 1.
WP6 Emotion in Interaction Embodied Conversational Agents WP6 core task: describe an interactive ECA system with capabilities beyond those of present day.
LECTURE 5 Nangwonvuma M/ Byansi D. Components, interfaces and integration Infrastructure, Middleware and Platforms Techniques – Data warehouses, extending.
NCP meeting Jan 27-28, 2003, Brussels Colette Maloney Interfaces, Knowledge and Content technologies, Applications & Information Market DG INFSO Multimodal.
Functionality of objects through observation and Interaction Ruzena Bajcsy based on Luca Bogoni’s Ph.D thesis April 2016.
Agenda Preliminaries Motivation and Research questions Exploring GLL
Visual Information Retrieval
KRISTINA Consortium Presented by: Mónica Domínguez (UPF-TALN)
Overview of Year 1 Progress Angelo Cangelosi & ITALK team
System Design.
SOAR as a Cognitive Architecture for Modeling Driver Workload
Multimodal Human-Computer Interaction New Interaction Techniques 22. 1
Professor John Canny Spring 2003
Presented By: Darlene Banta
Presentation transcript:

eNTERFACE’08 Multimodal high-level data integration Project 2 1

Team Olga Vybornova (Université catholique de Louvain, UCL-TELE, Belgium) Hildeberto Mendonça (Université catholique de Louvain, UCL-TELE, Belgium) Ao Shen (University of Birmingham, UK) Daniel Neiberg (TMH/CTT, KTH Royal Institute of Technology, Sweden) David Antonio Gomez Jauregui (TELECOM and Management SudParis, France)

Project objectives to augment and improve the previous work, look for new methods of data fusion to resolve the problem and implement a/the technique distinguishing between the data from different modalities that should be fused and the data that should not be fused but analyzed separately to explore and employ a context-aware cognitive architecture for decision-making purposes. 3

4 A set of variables describing states of the world (user’s input, an object, an event, behavior, etc.) represented in different media and through different information channels. GOAL OF DATA FUSION: The result of the fusion (merging semantic content from multiple streams) should give an efficient joint interpretation of the multimodal behavior of the user(s) – to provide effective and advanced interaction Background - Multimodality

Cognitive behavior is goal-oriented, it takes place in a rich, complex, detailed environment, so… the system should: o acquire and process a large amount of knowledge, o be flexible and be a function of the environment, o be capable of learning from the environment and experience. Requirements behavior = architecture + content

6 Types of context Domain context (prior knowledge of the domain, semantic frames with predefined action patterns, user s profiles, situation modelling, a priori developed and dynamically updated ontology defining subjects, objects, activities and relations between them for a particular person) Video context (capturing the users’ actions in the observation scene) Linguistic context (derived from natural language semantic analysis)

Example scenario 7

Knowledge-based approach Restricted-domain ontology – structure and its instantiation Pattern situations (semantic frames) User profile - a priori collected information about users - preferences, social relationships information, etc. - and dynamically obtained data 8

Audio Stream Video Stream Speech Recognizer Video Analyzer Sound Waves Syntactic Analyzer Recognized String Sequence of Images Semantic Analyzer Syntactic Triple Knowledge Base Fusion Mechanism Human Behavior Analyzer Knowledge Base Movements Coordinates Movements Meanings Advise People Architecture Linguistic meanings

Tooling / Implementation Speech recognition: Sphinx-4 Syntactic Analysis: C&C Parser + semantic analyzer ( Semantic Analysis Ontology construction and instatiation: Protégé ( Analysis: Soar ( Video Analysis: Visual Hull (developed by Diego Ruiz, UCL- TELE) Human Behavior Analysis: Soar Ontology: Protégé Fusion Mechanism: Soar Integration: OpenInterface (

Challenges Unrestricted natural language Free natural behavior within home/office environment 11

Why do we need Soar ? CAPABILITIES: manages a full range of tasks expected of an intelligent agent, from routine to extremely difficult, open-ended problems represents and uses different forms of knowledge – procedural (rules), semantic (declarative, facts), episodic employs various problem solving methods interacts with the outside world integrates reaction, deliberation, planning, meta-reasoning dynamically switching between them has integrated learning (chunking, reinforcement learning, episodic & semantic learning) is useful in cognitive modeling + taking into account emotions, feeling and mood is easy to integrate with other systems and environments (SML – Soar Markup Language – efficiently supports many languages) 12

Soar architecture 13

Project schedule 14

WP 1 Pre-workshop preparation Task Identify and investigate the necessary multimodal components and systems to use during the workshop; Task define the system architecture taking advantage of the previously accumulated experience and existing results; Task describe precisely the scenario to work on it during the workshop Task make the video sequences 15

WP 2 Integration of multimodal components and systems (1 st week) Task implement the architecture, putting all multimodal components and systems work together. Task explore and select the most suitable method(s) for action-speech multimodal fusion. Task investigate the fusion implementation 16

WP 3 Multimodal fusion implementation (2 nd week) Task fusion algorithms implementation Task fusion algorithms testing in a distributed computer environment. 17

WP 4 Scenario implementation and reporting (3 rd and 4 th weeks) Task 4.1 – integrate and test of all the components and systems on the OpenInterface platform Task prepare a presentation and reports about the results Task demonstrate the results 18