Pattern-Based French-to-IF Analysis Fill in the blanks IF-to-French Generation within NESPOLE! Hervé Blanchon CLIPS-IMAG

Slides:



Advertisements
Similar presentations
Le pronom en replaces a noun indicating a location, a thing, an idea, or a person.
Advertisements

BIENVENUE… À LA CLASSE DE FRANÇAIS!.
World Language Survey le 5 septembre 2013.
Les pronoms.
Français I – Leçon 2A Structures
Communication Orale Unité 4, leçon B
French Greetings and Salutations
Sports you (and those you know) do how you spend your free time what you are planning to do next weekend what you usually eat and drink and your likes/dislikes.
Les pronoms “y” et “en”.
Les adverbes….
Click on the speaker icon to listen. Then click arrow Listening Exercises For MOTIFS Module 3 for the audio CD in your textbook Exit.
Préliminaire I.
Bienvenue au New Hampshire A Restaurant Experience A campaign for a French-Friendly New Hampshire.
© Crown copyright 2011, Department for Education These materials have been designed to be reproduced for internal circulation, research and teaching or.
Bienvenue au New Hampshire A Shopping Experience A campaign for a French-Friendly New Hampshire.
Point de départ You have learned how to make affirmative and declarative statements in French. Now you will learn how to form questions and make negative.
Expressions de classe. How do you say hello in French? Bonjour!
Psycholinguistic what is psycholinguistic? 1-pyscholinguistic is the study of the cognitive process of language acquisition and use. 2-The scope of psycholinguistic.
Natural Language Understanding Difficulties: Large amount of human knowledge assumed – Context is key. Language is pattern-based. Patterns can restrict.
French 2 1. Turn in any late work or warmup (pg ), final supply & book check. 2. NOTES: Dates, weather, seasons: R2 & R3 (B & D). 3. French.
Point de départ In Leçon 2A, you learned four ways to formulate yes or no questions in French. However, many questions seek information that can’t.
Jour 2 mercredi, 13 août MOT DE PASSE: Comment ça s’écrit
Greetings, Introduction, Farewells
Vocabulaire leçon 8. Asking information questions: Follow the formula: Interrogative expression + est-ce que + subject + verb…..? Ex. Où est-ce que tu.
1. Un tableau Chalkboard 2. Un morceau de craie Chalk 3. Un bureau Teacher’s desk 4. Un ordinateur Computer 5. Une calculatrice calculator 6. Une table.
Culture Lesson 8. Meals For the French, eating is a cultural experience. Meals are not just about food, but about relaxing, chatting, and spending leisure.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
Chapter 20: Natural Language Generation Presented by: Anastasia Gorbunova LING538: Computational Linguistics, Fall 2006 Speech and Language Processing.
Project topics Projects are due till the end of May Choose one of these topics or think of something else you’d like to code and send me the details (so.
Software Testing and Quality Assurance
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
What can humans do when faced with ASR errors? Dan Bohus Dialogs on Dialogs Group, October 2003.
Information Extraction and Ontology Learning Guided by Web Directory Authors:Martin Kavalec Vojtěch Svátek Presenter: Mark Vickers.
Information Modeling: The process and the required competencies of its participants Paul Frederiks Theo van der Weide.
BİL744 Derleyici Gerçekleştirimi (Compiler Design)1.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Outline P1EDA’s simple features currently implemented –And their ablation test Features we have reviewed from Literature –(Let’s briefly visit them) –Iftene’s.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
The Big Picture – La Grande Image! What is involved in learning French in Year 9? In Year 9 you will build on the four skills of speaking, listening, reading.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
SIG IL 2000 Evaluation of a Practical Interlingua for Task-Oriented Dialogue Lori Levin, Donna Gates, Alon Lavie, Fabio Pianesi, Dorcas Wallace, Taro Watanabe,
Te Kaitito A dialogue system for CALL Peter Vlugter, Alistair Knott, and Victoria Weatherall Department of Computer Science School of Māori, Pacific, and.
PETRA – the Personal Embedded Translation and Reading Assistant Werner Winiwarter University of Vienna InSTIL/ICALL Symposium 2004 June 17-19, 2004.
Analysis for Spoken Language Translation Using Phrase-Level Parsing and Domain Action Classification Chad Langley Language Technologies Institute Carnegie.
Using Surface Syntactic Parser & Deviation from Randomness Jean-Pierre Chevallet IPAL I2R Gilles Sérasset CLIPS IMAG.
Introduction to Computational Linguistics
Automatic Cue-Based Dialogue Act Tagging Discourse & Dialogue CMSC November 3, 2006.
Motivation Facts Idea Problem ð Translation need dictionaries in large quantities and of good quality. ð Construction a new dictionary is expensive. ð.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Compiler Design Introduction 1. 2 Course Outline Introduction to Compiling Lexical Analysis Syntax Analysis –Context Free Grammars –Top-Down Parsing –Bottom-Up.
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
Compiler Construction By: Muhammad Nadeem Edited By: M. Bilal Qureshi.
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University.
Distinctively Visual. Your task Define/describe what each symbol represents. Write down the first few things that pop into your mind.
Parsing into the Interlingua Using Phrase-Level Grammars and Trainable Classifiers Alon Lavie, Chad Langley, Lori Levin, Dorcas Wallace,Donna Gates and.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
CS416 Compiler Design1. 2 Course Information Instructor : Dr. Ilyas Cicekli –Office: EA504, –Phone: , – Course Web.
Syntax Analysis Chapter 4.
Basque language: is IT right on?
CS416 Compiler Design lec00-outline September 19, 2018
Introduction CI612 Compiler Design CI612 Compiler Design.
Retrieval of audio testimonials via voice search
David Cyphert CS 2310 – Software Engineering
CS416 Compiler Design lec00-outline February 23, 2019
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
Presentation transcript:

Pattern-Based French-to-IF Analysis Fill in the blanks IF-to-French Generation within NESPOLE! Hervé Blanchon CLIPS-IMAG

Outline  Introduction  Analysis: Pattern-based approach –Analyzer architecture –Steps in detail Turn splitting into SDU Topic, sub-domain detection Argument filling DA construction  Generation –Track 1: Fill in the blanks approach –Track 2: Rule-based approach  First results  Perspectives

Introduction  Handling the client's utterances French SLT modules Speech Recognition RAPHAEL French textual output Pattern-Based Analysis IFs

Introduction  Handling the agent's utterances French SLT modules Generation -Fill-in the blanks approach -Rule-based approach French textual output Speech Synthesis Euler (Mons) Sound file IFs

 Hypothesis –c: j aimerais organiser une semaine de vacances dans un parc (I would like to organize a week of holydays in a park)  IF –{c: ( )} Introduction Interchange Format give-information +disposition+reservation+trip Pred-part Main-pred Attitude Speech Act

Introduction  Hypothesis –c: j aimerais organiser une semaine de vacances dans un parc (I would like to organize a week of holydays in a park)  IF –{c: ( )} give-information +disposition+reservation+trip disposition=(, ), visit-spec=(, ), location=, duration=(, ) Top-level arguments Interchange Format

Introduction  Hypothesis –c: j aimerais organiser une semaine de vacances dans un parc (I would like to organize a week of holydays in a park)  IF –{c: ( )} give-information +disposition+reservation+trip disposition=(, ), visit-spec=(, ), location=, duration=(, ) who=identifiability= time-unit=quantity= Interchange Format Embedded arguments

Introduction  Hypothesis –c: j aimerais organiser une semaine de vacances dans un parc (I would like to organize a week of holydays in a park)  IF –{c: ( )} give-information +disposition+reservation+trip disposition=(, ), visit-spec=(, ), location=, duration=(, ) who=identifiability= time-unit=quantity= no vacationilikeweek1 park Interchange Format Values

Introduction  Hypothesis –c: j aimerais organiser une semaine de vacances dans un parc (I would like to organize a week of holydays in a park)  IF –{c: ( )} give-information +disposition+reservation+trip disposition=(, ), visit-spec=(, ), location=, duration=(, ) who=identifiability= time-unit=quantity= no vacationilikeweek1 park  Back-translation –J'aimerais organiser des vacances dans un parc pendant une semaine. (I would like to organize holydays in a park during a week.) Interchange Format

Analysis  Necessity to deal with –Insertions –Deletions –Wrong words & agreements  Arguments as the core of an IF –Constrain the possible ones through a topic –Find the realized ones among the possible ones  Phrase spotting mechanism –Regular expressions to express their realization –Minimal linguistic treatments Choice

Analysis ASR Output Splitting into SDUs SDU 1SDU 2SDU n… Topic Detection SDU j Topic i Topic i handling IFs DA Construction Arguments filling Arg k filling … … Architecture

Analysis  Simple sentences & phrases as SDU boundaries –Leading to terminal SA or SA with few continuations Affirm Affirm –oui c'est ça, bien sûr – yes that's it, of course Negate Negate –non pas du tout, non pas très bien, non – no not at all, no not very well, no Exclamation Exclamation –c'est excellent, très bien, oh – that's excellent, very good, oh Greeting Greeting –bonjour, au revoir, bonne journée – hello, good bye, have a good day Splitting turns into SDU

Analysis  Articulations as SDU boundaries –Rhetorical + pronoun or question Examples Examples –et donc je, et puis il, et j', donc on, est-ce que, quel est –and thus I, and then he, and I, so we, is …, what is  Treatment of the pronouns –When rhetorical is not explicitly stated Examples Examples –xxx pronoun yyy (the|that|if) zzz pronoun ttt SDU = xxx pronoun yyy (the|that|if) zzz pronoun ttt SDU = xxx pronoun yyy (the|that|if) zzz pronoun ttt –xxx pronoun yyy pronoun zzz SDU 1 = xxx pronoun yyy SDU 1 = xxx pronoun yyy pronoun zzz may have to be split pronoun zzz may have to be split

Analysis  Goal terminal SAfocus concept –Find a terminal SA or a focus concept –Restrict the search for instantiated arguments  Means –Keywords are associated with each topic –Early matched topic is selected Some topics Some topics  affirm, greeting, acknowledgment, exclamation, please- wait, …  accommodation, activity, attraction, package, room, trip, … Topic detection

Analysis  A function per topic (Topic2If) –Find the realized (instantiated) arguments among the possible ones –Calculate the DA –Concatenate The speaker The DA The arguments Topic handling

Analysis  A specialized function per argument (Arg2If) –Describing the possible realizations –Capturing the pertinent information Exemple Exemple Argument filling proc RoomSpec2If {inString} { } elseif {[regexp "(?:les) (\[0\-9\]+) ($fifservdico::frenchroom)(?:x|s)?" $inString lMatch lQuantity lRoom]!=0} { } elseif {[regexp "(?:les) (\[0\-9\]+) ($fifservdico::frenchroom)(?:x|s)?" $inString lMatch lQuantity lRoom]!=0} { append the_result "room-spec=(identifiability=yes, quantity=" $lQuantity ", " [fifservdico::Room2If $lRoom] ")" append the_result "room-spec=(identifiability=yes, quantity=" $lQuantity ", " [fifservdico::Room2If $lRoom] ")" } elseif {[regexp "(\[0\-9\]+) ($frenchroom)(?:x|s)?" $inString lMatch lQuantity lRoom]!=0} { } elseif {[regexp "(\[0\-9\]+) ($frenchroom)(?:x|s)?" $inString lMatch lQuantity lRoom]!=0} { append the_result "room-spec=(identifiability=no, quantity=" $lQuantity ", " [fifservdico::Room2If $lRoom] ")" append the_result "room-spec=(identifiability=no, quantity=" $lQuantity ", " [fifservdico::Room2If $lRoom] ")" }......}

Analysis  Construct the DA using –The attitudes –The realized arguments To get the main-predication and the pred-participants –Other information Subject Verbal construction (give-information, request- information, request-suggestion, suggest) Negation of the predicate … DA construction

Analysis  Hypothesis –c: j aimerais préparer une semaine de vacances dans un parc # –c: i would like to prepare a week of vacation in a park #  Domain: –visit  Arguments: –disposition, visit-spec, duration, location  DA: –give- information+disposition+reservation+trip Exemple

Generation  Fill in the blanks approach (next slides)  Rule based approach –Use (parse) the IF specification files To build dictionary and grammar skeletons To label semantically each link between a terminal symbol of the IF with its possible continuations –Steps Associate to each terminal of the IF input the list of its possible labelled continuations; Built a semantic tree of the IF according to the continuations; Translate all the terminals and structures of the IF into French words and structures; Generate a syntactically well-formed sentence in French. Approaches

Generation  Step 1: Find the normalized DA –Discarding the disposition(s)  Step 2: Find the “fill in the blank sentence” –According to some of the possible arguments  Step 2: Fill in the blanks –With the phrases generated for the arguments Fill in the blanks

Generation c:give-information+disposition+reservation+trip (disposition=(who=i, like), visit- spec=(identifiability=no, vacation), location=park, duration=(time-unit=week, quantity=1))  IF c:give-information+disposition+reservation+trip (disposition=(who=i, like), visit- spec=(identifiability=no, vacation), location=park, duration=(time-unit=week, quantity=1))  Fill in the blank sentence ifDisposition2Text organiser ifVisitSpec2text ifLocation2Text ifTime2Text ifDuration2Text.  Result –J'aimerais organiser des vacances dans un parc pendant une semaine. –I would like to organize vacations in a park during a week. Fill in the blanks

First results  4 unseen dialogues, clients turns –Transcription as References  2 settings –Manually segmented into SDU (all languages) –Automatically segmented into SDU (analysis module - French only)  5 sets –ASR alone (WER, Hypos as paraphrase of SDU) –FR-FR on ASR, FR-FR on References –FR-IT on ASR,FR-IT on References  3 graders & 3 grades –p: perfect, k: OK, b: bad(p+k=acceptable) Evaluation

First results  Fr-Fr and Fr-It are comparable with Xx-Xx and Xx- It  It-Fr is performing less than It-Xx – French generator less oriented towards agent's IFs Evaluation Monolingual (%acceptable) / on Ref/on Rec [on auto SDU] En-En / 58%/45% Ger-Ger / 31%/25%Fr-Fr54%/41%[62%/48%] It-It / 61%/48% Bilingual (%acceptable) / on Ref/on Rec [on auto SDU] En-It / 55%/43% Ger-It / 32%/27%Fr-It44%/34%[58%/44%] It-En / 47%/37% It-Ger / 47%/31%It-Fr40%/27%

Perspectives  Pattern-based analysis –Quite promising –Analyzer for second showcase under development more concepts (actions, attitudes, feature), arguments (feature, focalizer, modifier, rhetorical)  Fill in the blank generation –Towards a better coverage with greater factorization of the DA  Rule-based generation –Should be available for second showcase evaluation