Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.

Slides:



Advertisements
Similar presentations
GenArch: Automatic Product Derivation in Multi-agent Systems Product Line Elder Cirilo.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
ANALYSIS MODEL. Analysis Model Actual system development start with this. Aims to structure the system independently of the actual implementation environment.
CHART or PICTURE INTEGRATING SEMANTIC WEB TO IMPROVE ONLINE Marta Gatius Meritxell González TALP Research Center (UPC) They are friendly and easy to use.
Technical and design issues in implementation Dr. Mohamed Ally Director and Professor Centre for Distance Education Athabasca University Canada New Zealand.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
SELBO Agent Ivan Minov University of Plovdiv “Paisii Hilendarski“
1http://img.cs.man.ac.uk/stevens Interaction Models of Humans and Computers CS2352: Lecture 7 Robert Stevens
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 System modeling 2.
CIM2564 Introduction to Development Frameworks 1 Overview of a Development Framework Topic 1.
EDEN 2007 Naples, Italy LIFELONG LEARNING TEACHERS’ NEEDS IN VIRTUAL LEARNING ENVIRONMENTS Josep Maria Boneu 1, Maria Galofré 2, Julià Minguillón 2 1 Centre.
Adding Organizations and Roles as Primitives to the JADE Framework NORMAS’08 Normative Multi Agent Systems, Matteo Baldoni 1, Valerio Genovese 1, Roberto.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
SWE Introduction to Software Engineering
Software Requirements
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
The Multi-model, Metadata-driven Approach to Content and Layout Adaptation Knowledge and Data Engineering Group (KDEG) Trinity College,
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
USE Case Model.
Introduction to Java Appendix A. Appendix A: Introduction to Java2 Chapter Objectives To understand the essentials of object-oriented programming in Java.
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
Why Analysis Process Refer to earlier chapters Models what the system will do makes it easier for understanding no environment considered (hence, system.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Public 1 © 2005 Nokia V1-Filename.ppt / yyyy-mm-dd / Initials Development Challenges of Multilingual Text-to-Speech Systems Kimmo Pärssinen
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Requirements Reference: Chapters 5, 6, & 8. CMSC 345, Fall Objectives To introduce the concepts of user and system requirements To explain functional.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Yarmouk University Department of Computer Information Systems CIS 499 Yarmouk University Department of Computer Information Systems CIS 499 Yarmouk University.
Use Case Diagram The purpose is to communicate the system’s functionality and behaviour to the customer or end user. Mainly used for capturing user requirements.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Ontology-driven VoiceXML Dialogues Generation Marta Gatius, Meritxell González TALP Research Center, Technical University of Catalonia, Barcelona Berlin,
Working with Ontologies Introduction to DOGMA and related research.
Semantically Federating Multi- Agent Organizations R. Cenk ERDUR, Oğuz DİKENELLİ, İnanç SEYLAN, Önder GÜRCAN. AEGEANT-S Group, Ege University, Dept. of.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
AOT Lab Dipartimento di Ingegneria dell’Informazione Università degli Studi di Parma Unifying MAS Meta-Models ADELFE, Gaia & PASSI Carole Bernon, Massimo.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
A facilitator to discover and compose services Oussama Kassem Zein Yvon Kermarrec ENST Bretagne.
Agent Overview. Topics Agent and its characteristics Architectures Agent Management.
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
EEL 5937 The Bond Agent System (1) EEL 5937 Multi Agent Systems Lecture 8, Jan 30, 2003 Lotzi Bölöni.
EEL 5937 Content languages EEL 5937 Multi Agent Systems Lecture 10, Feb. 6, 2003 Lotzi Bölöni.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Real-Time Systems Laboratory Seolyoung, Jeong JADE (Java Agent DEvelopment framework )
1 XML and XML in DLESE Katy Ginger November 2003.
Yes, I'm able to index audio files within Alfresco
Chapter 9 Web Services: JAX-RPC, WSDL, XML Schema, and SOAP
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Semantic Markup for Semantic Web Tools:
LINGUA INGLESE 2A – a.a. 2018/2019 Computer-Aided Translation Technology LESSON 3 prof. ssa Laura Liucci –
Semantic Web Towards a Web of Knowledge - Projects
Presentation transcript:

Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione Parco Area delle Scienze 181/A Parma Italy

Ontologies Ontologies may be used in the context of e-learning to capture knowledge about different domains. An ontology can describe learning activities like enroll in a course, take an assessment test, join a discussion group and also the actors involved in the learning process such as students, tutors etc. A second use of an ontology regards the description of the content of the learning material.

Learning objects and metadata Many e-learning system use learning material in a standardized form of reusable learning objects. These objects can be annotated by metadata to describe the content, the technology used and so on and can be stored in a repository. The metadata used to describe the learning objects are extension of those used to describe the digital libraries, like the Dublin Core, and are standardized by IMS. To interact whit a repository of learning material the ontology should describe both the structure of the metadata and the structure of the domain that the learning objects deal with.

System Initiative Dialog Ontologies are considered the basis of the communication between agents too. It was successful with the advent of speech acts based communication languages for agents [8,9]. Communication acts are also used for agent to human dialog. The simple model is that of a single or system initiative dialog manager in which all the initiative is taken by the agent. In this case the context of the dialog can be represented by a finite state automata.

Mixed initiative dialogs Mixed initiative dialogs arise in situations in which also the human can take the control of the dialog. They are more difficult to manage because one has to deal with belief, desire and intention models. In many practical situation, the single initiative dialog manager is adequate. The agent can ask the human about the action to perform prompting for the required data and so on.

Automatic Generation of Spoken Dialog We present a system that supports an automatic generation of a spoken dialog between users and an agent based application from the ontologies used in the application domain. Our aim is to use the ontology that models the agent to agent communication also to model the speech interaction between humans and agents. In this case, the ontology is the source for the automatic generation of two grammars, one used for the speech recognition process and the other for speech generation. The application should be multilingual.

System Description The system is based on the JADE agent development platform, the Java Speech API and the IBM Via Voice tool. JADE (Java Agent Development framework) is a software framework to aid the development of agent applications in compliance with the FIPA specifications for interoperable intelligent multi- agent systems

The Java Speech API The Java Speech API incorporates speech technology into user interfaces for applets and applications based on Java technology. This API specifies a cross-platform interface to support command and control recognizers, dictation systems and speech synthesizers. We have used the IBM implementation of JSAPI that provides an interface for the Via Voice tool. IBM Via Voice is a tool for the recognition and generation of speech available in many languages. Nevertheless the system does not depend on the specific JSAPI implementation.

Speech Manager The system is based on an agent, called SpeechManager, that allows the interaction between a user and the agents of a JADE application. To manage the conversation the SpeechManager has to maintain a context. To represent this context this agent uses a finite state automaton (implemented through a JADE FSMBehaviour). Each state maintains a “conversation act”.

Conversation Acts RecoSpellAct: speech recognition in spelling mode. RecoDictAct: speech recognition in dictation mode. RecoGramAct: speech recognition on the basis of a predefined grammar. The output is the tags associated to the rules of the grammar activated during the recognition. RecoAltsAct: speech recognition on the basis of a list of alternative strings. RecoIntAct: speech recognition of an integer on the basis of an integer grammar. RecoIntSpellAct: speech recognition of an integer on the basis of the integer grammar and spelling the digits. RecoFloatAct: speech recognition of a float number on the basis of a float grammar. RecoDatetAct: speech recognition of a date on the basis of a date grammar.

Specialized grammars Specialized grammars was developed to support recognition of dates and numbers (integer and floating point both in dictation or in spelling modes). These grammars needs to be implemented for each language supported but are quite general and independent from the specific ontology.

SpeechManager The ontologies used by SpeechManager agents can be derived from the ones normally used by JADE agent systems in the different application domains. The ontology used by SpeechManager consists of two kinds of elements –Concepts: entities that an agent can handle (simple or aggregate objects). –Agent actions: the action the agents can do on the objects which are instances of defined concepts. Both Concepts and Agent Actions have slots and filler value types.

Generating the Dialog At the beginning, the SpeechManager prompts the user to choose an Agent Action. Each attribute of the action is used to prompt for the values of the involved concepts. To obtain the value of a concept, the SpeechManager recursively ask the user for the value of its attributes. Recursion terminate when the requested value is a primitive type (string, number, date etc). At the end of recognition of an Agent Action request, the SpeechManager delegates the execution of the action to an agent of the system. This delegated agent is found by querying the Directory Facilitator of the system about an agent able to perform the action.

Input modalities and lexical entries To fill Agent Action and its attributes from speech input, it is necessary to associate with each element (Agent Action, Concepts and terminal attributes) an input modality (grammar, alternatives or spelling) and, eventually, a specific grammar. It is also necessary to give the lexical entries (strings of words for the synthesizer) for the different languages that correspond to the names of the Agent Actions, Concepts and their attributes involved in the interaction with the user, and to recognize the different word tokens in the grammars (e.g., the different strings in alternatives).

Application to e-learning As a part of an a e-learning system we have developed some application to give the possibility to a student to enroll in a course, to take an assessment test or to discovery the learning material he need. In these applications the ontologies must describe the activities and actors involved in e-learning such as courses, assessment test, students, tutors and the action to perform. The metadata and the domain ontology are used by the student to search the repository to find the learning objects. The student can “navigate” the repository on the basis of the concepts describing the subject of interest.

SpeechManager: which action would you execute? (enroll in course, assessment test,……) User: I want to enroll in a course. SpeechManager: To complete the request of enroll in a course I need more information SpeechManager: what is your first name? User: Mark………… ……………………………. Part of a possible dialog between a SpeechManager agent and a user. The system supports Italian and English.

Conclusion I The method used to mapping the ontology to a dialog is very general and does not depend on the domain considered. To build a conversation agent for a new domain it is necessary only to design the ontology and to write the dictionary. Also the support to new languages is straightforward. One have only to extend the dictionary and give specialized rules to deal with numbers and dates. Now the system supports Italian and English.

Conclusion II Another problem to address is that the dialog generated is not very flexible. To give a more usable interface it is necessary to deal with synonyms and paraphrases that the user can say. On the other side a general purpose recognition grammar, even if useful for written text processing, is not equally valuable for speech recognition. Our approach is to generate a tailored grammar for each application domain. So we obtain a limited number or grammar rules and of lexical entries, improving the reliability of speech recognition and the robustness of the system.