About the use of UNL in key words, key images and key concepts transcultural analysis.

Slides:

Advertisements

Similar presentations

Advertisements

Institut Software- und Systemtechnik Fraunhofer ISS T 1 Current Service Platform and Architecture Design Personalization and Context-Awareness Bernhard.

CHART or PICTURE INTEGRATING SEMANTIC WEB TO IMPROVE ONLINE Marta Gatius Meritxell González TALP Research Center (UPC) They are friendly and easy to use.

European Commission DG Information Society Info Day Brussels, 2 June 2005 Focal points: 1. Concepts, methods and core services 2. Tools in Rich Environments.

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.

INTEGRATION OF VOICE SERVICES IN INTERNET APPLICATIONS By Eduardo Carrillo (lecturer), J. J Samper, J.J. Martínez-Durá Universidad Autónoma de Bucaramanga.

Information Society Technologies Third Call for Proposals Norbert Brinkhoff-Button DG Information Society European Commission Key action III: Multmedia.

TeleMorph & TeleTuras: Bandwidth determined Mobile MultiModal Presentation Student: Anthony J. Solon Supervisors: Prof. Paul Mc Kevitt Kevin Curran School.

Sharing Content and Experience in Smart Environments Johan Plomp, Juhani Heinila, Veikko Ikonen, Eija Kaasinen, Pasi Valkkynen 1.

Irek Defée Signal Processing for Multimodal Web Irek Defée Department of Signal Processing Tampere University of Technology W3C Web Technology Day.

XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.

I1-[OntoSpace] Ontologies for Spatial Communication John Bateman, Kerstin Fischer, Reinhard Moratz Scott Farrar, Thora Tenbrink.

Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.

Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.

Image Search Presented by: Samantha Mahindrakar Diti Gandhi.

WMES3103 : INFORMATION RETRIEVAL

Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,

Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.

1 Towards a better transcultural understanding through distance education. Towards a better transcultural understanding through.

LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.

Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.

Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.

Philips Research France Delivery Context in MPEG-21 Sylvain Devillers Philips Research France Anthony Vetro Mitsubishi Electric Research Laboratories.

Presented by Zeehasham Rasheed

1 Distance education : What could technology offer ? Gérard CHOLLET ENST/CNRS-LTCI 46 rue Barrault PARIS cedex 13

Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.

Building the Design Studio of the Future Aaron Adler Jacob Eisenstein Michael Oltmans Lisa Guttentag Randall Davis October 23, 2004.

Smart Learning Services Based on Smart Cloud Computing

Should Intelligent Agents Listen and Speak to Us? James A. Larson Larson Technical Services

GOOD, MULTILINGUAL interpretation, translation, resources What can we do for the OG-08? Christian BOITET GETA, CLIPS, IMAG-campus UJF & CNRS, Grenoble,

GUI: Specifying Complete User Interaction Soft computing Laboratory Yonsei University October 25, 2004.

Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.

Conversational Applications Workshop Introduction Jim Larson.

Multimedia Databases (MMDB)

Postacademic Interuniversity Course in Information Technology – Module C1p1 Contents Data Communications Applications –File & print serving –Mail –Domain.

Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.

Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.

University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.

Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.

APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.

PDA Applications for the Olympic Games. Consolidated collaborations GET & Tsinghua University: –(Prof. Ding Xiaoqing) LingTour Chinese character recognition.

CHAPTER TEN AUTHORING.

By NIST/ITL/IAD, Mike Rubinfeld, January 16, 2002 Page 1 L3 Overview L3 Standards Overview By Mike Rubinfeld Chairman, INCITS/L3 (MPEG & JPEG) NIST, Gaithersburg,

HYMES (1964) He developed the concept that culture, language and social context are clearly interrelated and strongly rejected the idea of viewing language.

1 Representing New Voice Services and Their Features Ken Turner University of Stirling 11th June 2003.

FDT Foil no 1 On Methodology from Domain to System Descriptions by Rolv Bræk NTNU Workshop on Philosophy and Applicablitiy of Formal Languages Geneve 15.

Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.

VoiceUNL : a proposal to represent speech control mechanisms within the Universal Networking Digital Language Mutsuko Tomokiyo (GETA-CLIPS-IMAG) & Gérard.

Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,

Ontology-driven VoiceXML Dialogues Generation Marta Gatius, Meritxell González TALP Research Center, Technical University of Catalonia, Barcelona Berlin,

AMSP : Advanced Methods for Speech Processing An expression of Interest to set up a Network of Excellence in FP6 Prepared by members of COST-277 and colleagues.

MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).

This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public performance or display, including.

MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System Lian Mo, Alan Jiang, Junhua Ding April, 2001.

Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.

Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.

MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.

Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,

UNL Document Summarization Virach Sornlertlamvanich, Tanapong Potipiti and Thatsanee Charoenporn Information Research and Development Division National.

W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.

WP6 Emotion in Interaction Embodied Conversational Agents WP6 core task: describe an interactive ECA system with capabilities beyond those of present day.

What is Multimedia Anyway? David Millard and Paul Lewis.

Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.

Technologies: for Enhancing Broadcast Programmes with Bridgets

Visual Information Retrieval

KRISTINA Consortium Presented by: Mónica Domínguez (UPF-TALN)

3.0 Map of Subject Areas.

Pilar Orero, Spain Yoshikazu SEKI, Japan 2018

Presented by: Mónica Domínguez

Presentation transcript:

About the use of UNL in key words, key images and key concepts transcultural analysis

Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN, Gérard CHOLLET Presented here by Gérard CHOLLET ENST/CNRS-LTCI

Outline Rationale of the proposal Objectives  The Beijing 2008 Olympics Approaches  Multimedia, multilingual information server  Intelligent Camera  Bilingual Voice Communicator Needs and relevance  A PDA for tourists and travelling businessmen Conclusions and Perspectives

Rationale for the IP-KNOWLISTICS Logistics for knowledge in a specific domain (OG) Language independent knowledge representation and management Multimedia (text, speech, image, video) Multimodal access (text, speech, visual I/O) Distributed multimedia server accessible from mobile terminals (phone, PDA, PC,…) Primarily targetted to tourist applications initially 2008 Beijing Olympics as a field trial

Technical developments Language independent knowledge representation (using conceptual graphs and an Intermediate Representation Language like ‘Universal Networking Language’) Summarisation and reformulation of texts Generation in 12 target languages Speech synthesis and recognition VoiceXML-based interactive dialog agent ‘Intelligent camera’ with Chinese character recognition Cross-language ‘Multimodal communicator’ on a PDA Cross-language lexical access

Chinese character recognition

Intelligent camera from Tsinghua Univ. capture reco translation

Extracting text in scene images Complex color images Uncontrolled illumination Variations : size, fonts, orientation, texture Complex backgrounds, shadows

Text extraction Searching for character regions (text has uniform color)  Multi-channel decomposition  Connected components analysis  Grouping of components  Alignment analysis (number of horizontally or vertically aligned components)  Text identification (language independant features : size, alignment,…) Detection rate : 84 % False alarm rate : 5.6 %

Cross-language Multimodal Communicator Use of a visual display (e.g. on a PDA) to mediate the dialogue between 2 persons speaking different languages. Recognition of short utterances, display of a word graph, selection of keywords, visualisation (and synthesis) of the translation of key words and groups of words. Specialised lexicon for dialog acts in typical touristic situations (in a restaurant, at the hotel, in the street, in public transport, about the Olympic games,…) UMTS access to an information server offering maps, photographs, video sequences,…

Generation in target languages Sharing of acoustic models between languages to simplify extensibility to other languages. Combination of phone models with small amounts of data. Models adaptation to user and environmental situations. French Chinese Shared acoustic models Language specific models

Knowledge representation A formal language for representating the meaning of natural language sentence. UNL (Universal Networking Language) introduced to describe natural language semantics. Language-independent context indexing, possible for cross-language information retrieval. Use of conceptual hierarchy of UNL to address the inherent ambiguity of natural languages. A set of semantic relations (linking concepts together) for a structured information pattern.

UNL representation “The cat drank the milk” agt(drink(icl>do,agt>thing, obj(drink(icl>do,agt>thing, can be encoded by: agt, obj are binary semantic relations

Role of semantic contents representation in indexing Digital Audio Video Textual Cross lingual Multimedia platform User’s request UNL encoding User specific information UNL decoding

Application architecture UMTS server Speech synthesis Access information a word graph, + a list of keywords Translation

Digital Olympic Multi-Language Information Network Service System Project

From VoiceXML to VoiceUNL. Presented here by Gérard CHOLLET ENST/CNRS-LTCI With the contribution of Christian BOITET

Outline Rationale of the proposition Objectives  Promotion of a new standard, demonstrations Approaches  An extra layer of VoiceXML Need and relevance  Multilingual Vocal Servers Integration and structuring effect Conclusions and Perspectives

Rationale for VoiceUNL Need for Language Independent Vocal Servers,  Need for a language independent knowledge representation and management formalism Principle of proposed solution: Start from UNL graphs augmented with voice- oriented semantic marks (special UWs, attributes), Generate in the target language, Voice-oriented marks become prosodic markers, Final conversion to VoiceXML 2008 Beijing Olympics as a field trial

What is VoiceXML ? A recommendation of W3C (WWW Consortium) An extension of XML for vocal information servers, A set of normalised markup tags, Current ags concern language identification, voice prompting, speech synthesis, form filling, barge in, echo cancelling,… No provision to access a semantically encoded data base, Need for a UNL-type front-end Compatibility with MPEG4-SNHC (talking head)

Applications

Prosodic information in UNL Attributes that can influence the grammatical and the prosodic structure of a sentence already exist: Representations should be defined, concerning : @relaxed…?  Focus: grouping words to emphasize in a scope?  special UWs for voice characteristics…?  Expression (for face and gesture animation): special UWs/constructs?

Conclusions and Perspectives Demonstrations to be prepared within the LingTour, Normalangue and KNOWLISTICS projects First target is the Beijing 2008 Olympics Some concept-oriented formalism (such as Sowa's conceptual graphs) may be used to store knowledge before building in UNL "interlingual prelinguistic, communicative content"

Conclusions and Perspectives UNL representation of meaning of natural language sentences directly available for retrieval, indexing and knowledge extraction. UNL with multimedia contents (text, speech, image, video) and multimodal access (text, speech, visual I/O) to enrich the service for communication. Comprehensive and extensive information service on PDAs with access to UMTS and wireless LAN.