PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.

Slides:



Advertisements
Similar presentations
TeleMorph & TeleTuras: Bandwidth determined Mobile MultiModal Presentation Student: Anthony J. Solon Supervisors: Prof. Paul Mc Kevitt Kevin Curran School.
Advertisements

Sharing Content and Experience in Smart Environments Johan Plomp, Juhani Heinila, Veikko Ikonen, Eija Kaasinen, Pasi Valkkynen 1.
Irek Defée Signal Processing for Multimodal Web Irek Defée Department of Signal Processing Tampere University of Technology W3C Web Technology Day.
Empirical and Data-Driven Models of Multimodality Advanced Methods for Multimodal Communication Computational Models of Multimodality Adequate.
MediaHub: An Intelligent Multimedia Distributed Hub Student: Glenn Campbell Supervisors: Dr. Tom Lunney Prof. Paul Mc Kevitt School of Computing and Intelligent.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Multimodal User Interface Management Part of STIMULATE project at CAIP Center, Rutgers University PI: Dr. James L. Flanagan Research on natural human-computer.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
1 MPEG-21 : Goals and Achievements Ian Burnett, Rik Van de Walle, Keith Hill, Jan Bormans and Fernando Pereira IEEE Multimedia, October-November 2003.
Multimodal Interaction. Modalities vs Media Modalities are ways of encoding information e.g. graphics Media are instantiations of modalities e.g. a particular.
Information Retrieval in Distributed Environments Based on Context- Aware, Proactive Documents Current Research Information Systems (CRIS 2002) August.
Developing Intelligent Agents and Multiagent Systems for Educational Applications Leen-Kiat Soh Department of Computer Science and Engineering University.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Smart Learning Services Based on Smart Cloud Computing
Evaluating Centralized, Hierarchical, and Networked Architectures for Rule Systems Benjamin Craig University of New Brunswick Faculty of Computer Science.
AmbiLearn: an ambient intelligent multimodal learning environment for children Jennifer Hyndman Supervisors: Dr. Tom Lunney, Prof. Paul Mc Kevitt Intelligent.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
A MBI L EARN Ambient Intelligent Multimodal Learning Environment for Children Confirmation Viva June 2009 Jennifer Hyndman, B.Sc. (Hons.) Supervisors:
B. Ross Cosc 4f79 1 Commercial tools Size of system: –small systems 400 rules single user, PC based –larger systems narrow, problem-type specific or hybrid.
MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney & Paul Mc Kevitt School of Computing and Intelligent Systems Faculty.
An Introduction to Artificial Intelligence and Knowledge Engineering N. Kasabov, Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering,
APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.
10/6/2015 1Intelligent Systems and Soft Computing Lecture 0 What is Soft Computing.
Argumentation and Trust: Issues and New Challenges Jamal Bentahar Concordia University (Montreal, Canada) University of Namur, Belgium, June 26, 2007.
SceneMaker: Multimodal Visualisation of Natural Language Film Scripts Dr. Minhua Eunice Ma School of Computing & Intelligent Systems Faculty of Computing.
Object Management Group (OMG) Specifies open standards for every aspect of distributed computing Multiplatform Model Driven Architecture (MDA)
TeleMorph & TeleTuras: Bandwidth determined Mobile MultiModal Presentation Student: Anthony J. Solon Supervisors: Prof. Paul Mc Kevitt Kevin Curran School.
10 ITK261 The traditional approach to design Reading: Chapter 10 Oct 9, 11.
Web Services for Satellite Emulation Development Kathy J. LiszkaAllen P. Holtz The University of AkronNASA Glenn Research Center.
SceneMaker Intelligent Multimodal Visualisation of Natural Language Scripts Eva Hanser Dipl.-Des. (FH), M.Sc. Prof. Paul Mc Kevitt, Dr. Tom Lunney, Dr.
SceneMaker: Automatic Visualisation of Screenplays School of Computing & Intelligent Systems Faculty of Computing & Engineering University of Ulster, Magee,
Conceptual Architecture of Mozilla Firefox (version ) Jared Haines Iris Lai John,Chun-Hung,Chiu Josh Fairhead June 5, 2007.
The european ITM Task Force data structure F. Imbeaux.
I Robot.
TAL7011 – Lecture 4 UML for Architecture Modeling.
Intelligent Robot Architecture (1-3)  Background of research  Research objectives  By recognizing and analyzing user’s utterances and actions, an intelligent.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
interactive logbook Paul Kiddie, Mike Sharples et al. The Development of an Application to Enhance.
A Quantitative Trust Model for Negotiating Agents A Quantitative Trust Model for Negotiating Agents Jamal Bentahar, John Jules Ch. Meyer Concordia University.
A MBI L EARN Ambient Intelligent Multimodal Learning Environment for Children 100 day review December 2008 Jennifer Hyndman Supervisors: Dr. Tom Lunney,
Volgograd State Technical University Applied Computational Linguistic Society Undergraduate and post-graduate scientific researches under the direction.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
TAUCHI – Tampere Unit for Computer-Human Interaction Markku Turunen Tampere Unit for Human-Computer Interaction University of Tampere MUMIN PhD course,
Simple Object Access Protocol
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C - The World Wide Web Consortium W3C - The World Wide Web Consortium.
Intelligent MultiMedia Storytelling System (IMSS) - Automatic Generation of Animation From Natural Language Input By Eunice Ma Supervisor: Prof. Paul Mc.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
IMSTD:Intelligent Multimedia System for teaching Databases By : NAZLIA OMAR Supervisors: Prof. Paul Mc Kevitt Dr. Paul Hanna School of Computing and Mathematical.
“Intelligent User Interfaces” by Hefley and Murray.
SceneMaker: Automatic Visualisation of Screenplays Eva Hanser Prof. Paul Mc Kevitt Dr. Tom Lunney Dr. Joan Condell School of Computing & Intelligent Systems.
Third International Workshop on Networked Appliance 2001 SONA: Applying Mobile Agent to Networked Appliance Control S.Aoki, S.Makino, T.Okoshi J.Nakazawa.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
A New Approach to Decision-making within an Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Aiden Mc Caughey, Paul Mc Kevitt.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
TECHNOLOGY GUIDE FOUR Intelligent Systems.
Analyzing and Securing Social Networks
MediaHub: An Intelligent MultiMedia Distributed Platform Hub
Intelligent Systems and
Pervasive Computing Happening?
Professor John Canny Spring 2003
Professor John Canny Spring 2004
Presentation transcript:

PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering University of Ulster, Magee Campus Northland Road, Derry {Campbell-g8, TF.Lunney,

PGNET, Liverpool JMU, June 2005 Outline Goals and objectives Key research problems Distributed Processing Distributed Platforms Architecture of MediaHub Decision making in MediaHub Comparison to related research Tools and future development

PGNET, Liverpool JMU, June 2005 Goals The primary objectives of MediaHub are to: Interpret/generate semantic representations of multimodal input/output Perform fusion and synchronisation of multimodal data (decision-making) Implement and evaluate a multimodal platform hub (MediaHub)

PGNET, Liverpool JMU, June 2005 Goals Research questions: Semantic representation? Communication with other elements of a platform? Semantic representation? Decision-making?

PGNET, Liverpool JMU, June 2005 Key research problems Semantic Representation Represent language and vision Frames or XML? Semantic Storage Blackboard model? Non-blackboard model? Decision-making Fusion and synchronisation AI technique

PGNET, Liverpool JMU, June 2005 Frames (CHAMELEON) (Brøndsted et al. 1998, 2001) [MODULE INPUT: input INTENTION: intention-type TIME: timestamp] [SPEECH-RECOGNISER UTTERANCE:(Point to Hanne’s office) INTENTION: instruction! TIME: timestamp] [GESTURE GESTURE: coordinates (3, 2) INTENTION: pointing TIME: timestamp] XML (M3L, SmartKom) ( Bühler et al. 2002, Wahlster et al. 2001) list epg_browse now T19:42: T22:00: T19:50: T19:55:00 Today’s Stock News ARD …….. Semantic representation

PGNET, Liverpool JMU, June 2005 Semantic storage Blackboard or Non-blackboard? High coupling – Blackboard? Low coupling - distributed architecture? Communication Via central blackboard? Message passing between modules?

PGNET, Liverpool JMU, June 2005 Decision-making (fusion & synchronisation) Rule-based Potential for Other AI techniques Fuzzy Logic Neural Networks Genetic Algorithms Bayesian Networks (CPNs)

PGNET, Liverpool JMU, June 2005 Distributed processing DACS (Fink et al. 1995, 1996) Open Agent Architecture (OAA) (Cheyer et al. 1998, OAA 2004) JATLite (Kristensen 2001, Jeon et al. 2000) JavaSpaces (Freeman 2004) CORBA (Vinoski 1993).NET (Fay 2003)

PGNET, Liverpool JMU, June 2005 Intelligent Multimedia Distributed Platforms Blackboard Model: Ymir (Thórisson 1999) CHAMELEON (Brøndsted et al. 1998, 2001) Smartkom (Bühler et al. 2002, Wahlster et al. 2001, SmartKom 2004) DARBS (Nolle et al. 2001) DARPA Galaxy Communicator (Bayer et al. 2001) Psyclone (Psyclone 2004) Spoken Image/SONAS (Ó Nualláin et al. 1994, Ó Nualláin & Smith 1994, Kelleher et al. 2000)

PGNET, Liverpool JMU, June 2005 Intelligent Multimedia Distributed Platforms Non-blackboard Model: WAXHOLM (Carlson et al. 1996) AESOPWORLD (Okada 1996) COLLAGEN (Rich et al. 1997) INTERACT (Waibel et al. 1996) Oxygen (Oxygen 2004) EMBASSI (Kirste 2001, EMBASSI 2004) MIAMM (MIAMM 2004)

PGNET, Liverpool JMU, June 2005 CHAMELEON Language & vision integration system consists of ten modules, mostly programmed in C and C++ DACS communication system used for communication Blackboard stores semantic representations produced by other modules Communication between modules achieved by exchanging semantic representations between themselves or blackboard Semantic representation in form of input, output and integration frames

PGNET, Liverpool JMU, June 2005 Architecture of CHAMELEON

PGNET, Liverpool JMU, June 2005 SmartKom User adaptive interface for human-computer interaction Mobile Public Home/Office Facilitates speech, gestures and facial expression input XML-based mark-up language, M3L, used for semantic representation Distributed multiple blackboard model

PGNET, Liverpool JMU, June 2005 Architecture of SmartKom

PGNET, Liverpool JMU, June 2005 Dialogue Manager Acts as a blackboard module Facilitates communication between other modules Synchronisation Semantic Representation Database Provides semantic representation of language and vision data Decision Making Module AI technique for a unique form of decision-making Bayesian Networks (CPNs) Neural Networks, Genetic Algorithms, Fuzzy Logic Architecture of MediaHub

PGNET, Liverpool JMU, June 2005 Architecture of MediaHub

PGNET, Liverpool JMU, June 2005 Decision Making Module

PGNET, Liverpool JMU, June 2005 Decision making in MediaHub Decisions at Input: Determining semantic content of input Fusing semantics of input (into frames/XML) Resolving ambiguity at input Decisions at Output: Synchronising language with visual output Best modality for output (i.e. language or vision)

PGNET, Liverpool JMU, June 2005 Input example “Copy all files from the ‘process control’ folder of this computer to a new folder called ‘check data’ on that computer”.

PGNET, Liverpool JMU, June 2005 Output Example P T “This is the best route from Paul’s office to Tom’s office”.

PGNET, Liverpool JMU, June 2005 Comparison to related research

PGNET, Liverpool JMU, June 2005 Potential Tools Main Programming Language Java C++ Communication.NET DACS Semantic Representation XML XHTML + Voice SMIL RDF Schema MPEG-7 EMMA

PGNET, Liverpool JMU, June 2005 Potential Tools Decision Making Tools HUGIN GUI / API (Hugin 2004) Microsoft MSBNx / MSBN3 (Kadie et al. 2001) GeNIe/SMILE (Genie 2005) Netica (Norsys 2005) Bayes Net Toolbox (BNT 2005) BUGS (BUGS 2005)

PGNET, Liverpool JMU, June 2005 Hugin Tool for implementing Bayesian Networks as CPNs (Causal Probabilistic Networks) Hugin GUI Graphical user interface to Hugin decision engine Hugin API Library implemented in C, C++, Java Allows programs to implement Bayesian Networks for decision making

PGNET, Liverpool JMU, June 2005 Bayesian Networks AKA Bayes nets, Causal Probabilistic Networks (CPNs), Bayesian Belief Networks Consists of nodes and directed edges between nodes Node represents a variable Edge represents cause-effect relationship An edge connecting two nodes A and B indicates a direct influence exists between state of A and the state of B

PGNET, Liverpool JMU, June 2005 Simple Bayesian Network ‘Diet’ and ‘Exercise’ nodes have influence over ‘Weight Loss’ node

PGNET, Liverpool JMU, June 2005 Future development Define necessary decisions Develop Bayesian decision making using Hugin API for Java Semantic storage Communication Semantic representation scheme Semantic representation database Acquire multimodal corpora for testing Test MediaHub in an existing Multimodal Platform e.g. CONFUCIUS (Ma & Mc Kevitt 2003)

PGNET, Liverpool JMU, June 2005 Conclusion An intelligent multimodal distributed platform hub called MediaHub will be developed MediaHub will interpret and generate semantic representations of multimodal input and output MediaHub will perform fusion and synchronisation of language and vision data MediaHub will provide a new method of decision making within a distributed platform hub MediaHub will be tested within an existing multimodal platform (e.g. CONFUCIUS)