Download presentation
Presentation is loading. Please wait.
Published byRoberta Gregory Modified over 9 years ago
1
MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney & Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering University of Ulster, Magee Campus Derry/Londonderry Northern Ireland {Campbell-g8, TF.Lunney, P.McKevitt} @ulster.ac.uk
2
Outline Research objectives Related research Architecture of MediaHub Dataflow Semantic representation/storage Communication Decision-making in MediaHub Future development
3
Research Objectives Interpret/generate semantic representations of multimodal input/output Perform fusion and synchronisation of multimodal data (decision-making) Implement and evaluate a multimodal platform hub (MediaHub)
4
Semantic representation and storage? Communication? Decision-making? Key research problems
5
Related Research CORBA (Vinoski 1993) COLLAGEN (Rich et al. 1997) Open Agent Architecture (Cheyer et al. 1998) Chameleon (Brøndsted et al. 1998) Ymir (Thórisson 1999) Interact (Jokinen et al. 2002) SmartKom (Wahlster 2003, 2006) Psyclone (Thórisson et al. 2005) Hugin (Jensen 2001)
6
Architecture of MediaHub
8
Marked-up MultiModal Input/Output (XML) Dataflow in MediaHub Dialogue Manager MediaHub Whiteboard (EMMA) Decision-Making ModuleHugin Decision Engine
9
Semantic Representation XML used for input/output data Well established standard mark-up language Allows MediaHub to be integrated into other existing multimodal systems XML input is validated against a Document Type Definition (DTD) Using EMMA (Extensible MultiModal Annotation mark-up language) for semantic representation EMMA is a derivative of XML EMMA is suited to representing confidences relating to multimodal data (confidence tag)
10
Example XML input file 0.8 0.2 0.9 0.1 … Object 1 …
11
Semantic Storage Blackboard-based method of semantic storage Marked-up input in EMMA format stored on central whiteboard (MediaHub Whiteboard) All input/output messages in MediaHub are stored on whiteboard and can be accessed at any stage in the decision-making process Whiteboard and Dialogue Manager form kernel of MediaHub
12
Communication MediaHub uses Psyclone for distributed processing Psyclone uses OpenAIR specification for communication Modules of MediaHub communicate by passing messages through MediaHub Whiteboard Implements a publish-subscribe architecture For example, Decision-Making Module registers for messages of type *input* All messages relating to input posted on whiteboard will automatically be sent to Decision-Making Module Module registration is done in XML specification file, called PsyProbe, run automatically at start-up
13
PsySpec Example java -cp.;JavaOpenAIR.jar DMM psyclone=%host%:%port% name=%name%
14
Decision-making MediaHub employs Bayesian decision- making over multimodal data Bayesian networks developed using Hugin software tool (Jensen 2001) Networks are accessed using Hugin API (Java) A unique approach to decision-making in an intelligent multimedia distributed platform hub
15
Hugin Tool for implementing Bayesian Networks as CPNs (Causal Probabilistic Networks) Hugin GUI Graphical user interface to Hugin decision engine Hugin API Library implemented in Java Allows programs to implement Bayesian Networks for decision-making
16
Bayesian Networks AKA Bayes nets, Causal Probabilistic Networks (CPNs), Bayesian Belief Networks Consists of nodes and directed edges between nodes Node represents a variable Influence between nodes represented by edges Exercise Weigh t Loss Diet ‘Diet’ and ‘Exercise’ nodes have influence over ‘Weight Loss’ node
17
MediaHub Example Network G1-3 represents the belief that the user is referring to Objects 1-3, based on gesture input L1-3 represents the belief that the user is referring to Objects 1-3, based on language input CG1-3 and CL1-3 represent the confidence associated with G1-3 and L1-3
18
Bayesian Network Design Process 1.Characterise decision-making scenarios 2.Design Bayesian networks for decision-making scenarios 3.Use the Hugin GUI to build Bayesian networks and complete conditional probability tables 4.Run and test networks, making changes to networks and tables as required 5.Develop Java code that will open, edit and run the Bayesian network using the Hugin API
19
Decisions in MediaHub Input: Determining semantic content of input Fusing semantics of input Resolving ambiguity at input Output: Synchronising multimodal output Best modality for output
20
Input example “Copy all files from the ‘process control’ folder of this computer to a new folder called ‘check data’ on that computer”.
21
Output Example P “This is the route from Paul’s office to Tom’s office”. T
22
Conclusion An intelligent multimodal distributed platform hub called MediaHub is under development MediaHub interprets/generates semantic representations of multimodal input and output MediaHub performs fusion and synchronisation of multimodal data MediaHub provides a new method of decision-making within a distributed platform hub
23
Future development Define all necessary decisions for example scenarios Develop Bayesian decision-making using Hugin API (Java) Develop a GUI to illustrate the functionality of MediaHub Test MediaHub on example scenarios Compare MediaHub to other systems Write thesis
24
Questions ?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.