Tree Bimorphisms and Their Relevance in the Theory of Translations Cătălin-Ionuţ Tîrnăucă GRLMC, Rovira i Virgili University

Slides:



Advertisements
Similar presentations
C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.
Advertisements

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Grammars, constituency and order A grammar describes the legal strings of a language in terms of constituency and order. For example, a grammar for a fragment.
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
1 Statistical NLP: Lecture 12 Probabilistic Context Free Grammars.
Applied Computer Science II Chapter 2 : Context-free languages Prof. Dr. Luc De Raedt Institut für Informatik Albert-Ludwigs Universität Freiburg Germany.
Weighted Tree Transducers A Short Introduction Cătălin Ionuţ Tîrnăucă Research Group on Mathematical Linguistics, Rovira i Virgili University Pl. Imperial.
1 Foundations of Software Design Lecture 23: Finite Automata and Context-Free Grammars Marti Hearst Fall 2002.
Normal forms for Context-Free Grammars
BİL744 Derleyici Gerçekleştirimi (Compiler Design)1.
Context-Free Grammar CSCI-GA.2590 – Lecture 3 Ralph Grishman NYU.
CSC 8310 Programming Languages Meeting 2 September 2/3, 2014.
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 7 Mälardalen University 2010.
1 Introduction to Parsing Lecture 5. 2 Outline Regular languages revisited Parser overview Context-free grammars (CFG’s) Derivations.
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in Tree-adjoining grammars are somewhat similar to context-free.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
Winter 2007SEG2101 Chapter 71 Chapter 7 Introduction to Languages and Compiler.
Learning to Transform Natural to Formal Language Presented by Ping Zhang Rohit J. Kate, Yuk Wah Wong, and Raymond J. Mooney.
A sentence (S) is composed of a noun phrase (NP) and a verb phrase (VP). A noun phrase may be composed of a determiner (D/DET) and a noun (N). A noun phrase.
Grammars CPSC 5135.
PART I: overview material
Learning Automata and Grammars Peter Černo.  The problem of learning or inferring automata and grammars has been studied for decades and has connections.
May 2006CLINT-LN Parsing1 Computational Linguistics Introduction Parsing with Context Free Grammars.
NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.
October 2005csa3180: Parsing Algorithms 11 CSA350: NLP Algorithms Sentence Parsing I The Parsing Problem Parsing as Search Top Down/Bottom Up Parsing Strategies.
1 Syntax In Text: Chapter 3. 2 Chapter 3: Syntax and Semantics Outline Syntax: Recognizer vs. generator BNF EBNF.
7. Parsing in functional unification grammar Han gi-deuc.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Chapter 5: Sequences, Mathematical Induction, and Recursion 5.9 General Recursive Definitions and Structural Induction 1 Erickson.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
What’s in a translation rule? Paper by Galley, Hopkins, Knight & Marcu Presentation By: Behrang Mohit.
Theory of Computation, Feodor F. Dragan, Kent State University 1 TheoryofComputation Spring, 2015 (Feodor F. Dragan) Department of Computer Science Kent.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Comp 311 Principles of Programming Languages Lecture 3 Parsing Corky Cartwright August 28, 2009.
Basic Parsing Algorithms: Earley Parser and Left Corner Parsing
CS412/413 Introduction to Compilers and Translators Spring ’99 Lecture 3: Introduction to Syntactic Analysis.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Grammars Hopcroft, Motawi, Ullman, Chap 5. Grammars Describes underlying rules (syntax) of programming languages Compilers (parsers) are based on such.
Grammars CS 130: Theory of Computation HMU textbook, Chap 5.
1 A Simple Syntax-Directed Translator CS308 Compiler Theory.
11 CDT314 FABER Formal Languages, Automata and Models of Computation Lecture 7 School of Innovation, Design and Engineering Mälardalen University 2012.
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Programming Languages and Design Lecture 2 Syntax Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Overview of Previous Lesson(s) Over View 3 Model of a Compiler Front End.
Copyright © Curt Hill Other Trees Applications of the Tree Structure.
Learning regular tree languages from correction and equivalence queries Cătălin Ionuţ Tîrnăucă Research Group on Mathematical Linguistics, Rovira i Virgili.
Donghyun (David) Kim Department of Mathematics and Physics North Carolina Central University 1 Chapter 2 Context-Free Languages Some slides are in courtesy.
NATURAL LANGUAGE PROCESSING
COMP 3438 – Part II-Lecture 6 Syntax Analysis III Dr. Zili Shao Department of Computing The Hong Kong Polytechnic Univ.
Formal grammars A formal grammar is a system for defining the syntax of a language by specifying sequences of symbols or sentences that are considered.
CS416 Compiler Design1. 2 Course Information Instructor : Dr. Ilyas Cicekli –Office: EA504, –Phone: , – Course Web.
Complexity and Computability Theory I Lecture #12 Instructor: Rina Zviel-Girshin Lea Epstein.
Syntax Analysis By Noor Dhia Syntax analysis:- Syntax analysis or parsing is the most important phase of a compiler. The syntax analyzer considers.
Natural Language Processing : Probabilistic Context Free Grammars Updated 8/07.
Chapter 1 INTRODUCTION TO THE THEORY OF COMPUTATION.
CSE202: Introduction to Formal Languages and Automata Theory
Context-Free Grammars: an overview
CS510 Compiler Lecture 4.
Training Tree Transducers
CHAPTER 2 Context-Free Languages
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Teori Bahasa dan Automata Lecture 9: Contex-Free Grammars
Dekai Wu Presented by David Goss-Grubbs
Discrete Maths 13. Grammars Objectives
Presentation transcript:

Tree Bimorphisms and Their Relevance in the Theory of Translations Cătălin-Ionuţ Tîrnăucă GRLMC, Rovira i Virgili University CLG 08, Madrid, 28/06/2008

Outline Motivation Tree transformations Ways to define tree transformations What is a tree bimorphism? What can we do with a bimorphism? Conclusions References CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Motivation: why using trees in NLP? Strings and finite-state machines are extremely useful in speech recognition, text recognition, etc. NOT suitable for translations between natural languages: don’t capture syntax-sensitive transformations; don’t execute certain reorderings of parts of sentences. Hence, researchers from computational linguistic community have shown an increasing interest in more powerful formalisms, ones that can model trees and tree transformations. The new field of syntax-based machine translation was established: (Knight and Graehl 2005). CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Motivation: formal properties required Accuracy is an important issue in machine translation, so the whole theory should rely on a solid mathematical background (formal languages). A perfect model: (Knight 2008), should have at least: expressiveness (reordering parts of sentences, i.e., local rotation), inclusiveness (generalizing the finite state machines), modularity (closure under composition, i.e., breaking the initial problem into smaller pieces easier to solve), and teachability (efficient training). CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Motivation: defining tree transformations Many of the formalisms proposed until now fail in at least one of the above criteria. We mention the weak and strong parts of some of them, as well as the relation between them. We will see (something about and types of): tree transducers (formal language theory), tree homomorphisms (formal language theory), synchronous grammars (linguistics), and TREE BIMORPHISMS (formal language theory). We speak more about modularity: think of what success finite-state machines had because of the noisy-channel concept! CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Tree transformations: first trees Ranked alphabet  : symbols of a given rank, i.e., number of branches (if not unique you can rename); represent parts of sentences (NP, VP) Leaf alphabet X: usual alphabets of “words” called variables (Spanish, Russian) T  (X): set of all trees Tree language: subset of T  (X) Yield: concatenated sequence of leaf variables read from left to right NP  ={S/2, NP/1, NP/3 N/0,VP/2,V/1} X=Spanish alphabet Yield = Vamos a la playa S NP N a VP VNP la playa Vamos CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Tree transformations and translations Tree transformation  : a collection of pairs of parse (syntax) trees of natural language sentences. Its translation  (  ): a set of pairs of words from the two natural languages considered: just take the yield of such a pair of trees. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations Input English S D N NP VP V s  T  (X) Thecastleburns Output Romanian S V VP NP N Ardecastelul t  T  (Y)  transforms s into t, i.e., (s,t)   X= English words  ={S/2,NP/2,VP/1,NP/1,D/1, N/1} T  (X)= English parse trees Y= Romanian words  ={S/2,NP/2,VP/1,NP/1, N/1} T  (Y)=Romanian parse trees yd(s) is translated into yd(t) 

Define tree transformations: tree transducers A tree transducer uses a set of states to encode information, and given an input tree over the input ranked alphabet, computes a (set of) output tree(s) over the output ranked alphabet in two ways: Top-down: starts at root, finishes at leaves; Bottom-up: starts at leaves, finishes at root. Many types introduced (in general, modularity doesn’t hold or cannot be proved), at least two look very promising: Extended top-down tree transducers: (Knight and Graehl 2005) Multi bottom-up tree transducers: (Maletti, Engelfriet and Lilin 2008), (Maletti 2007) Tree Transducer Input tree Output tree CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Define tree transformations: tree homomorphisms (I) A tree homomorphism  :T  (X)  T  (Y) is: a mapping which, based on certain rules defined for every input symbol, transforms recursively an input tree into a (totally or not) different output tree, or A top-down tree transducer with one state. Defines a simple tree transformation   ={(s,  (s))  s  T  (X)}; usually one just identifies   with  treating  as a relation. We need auxiliary variables  to indicate an occurrence of a subtree in a tree during the processing. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Define tree transformations: tree homomorphisms (II)  :T  (X)  T  (Y) is defined by the rules:  (x)=  X (x): each x in the input replaced by a tree  X (x) from output;  (c)=  0 (c): each constant c in the input replaced by a tree  0 (x);  (f(t 1,…,t m ))=  m (f)(  1   (t 1 ),…,  m   (t m )): each root f in the input tree replaced by a tree  m (f) from the output, where auxiliary variables appear as leaf symbols; CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations X={in, blue}  ={NP/3,ADJ/1,D/0} T  (X)= English parse trees Y= {the, red, pen, out, of}  ={VP/4,N/3,D/1} T  (Y)=English parse trees   3 (NP)=VP(  2,  3,  1, of)`  1 (ADJ)=N(big,  1, pen)  0 (D)=D(the)  X (in)=out  X (blue)=red NP in D ADJ blue VP Input tree s in T  (X) D ofN the out bigred pen Output tree  (t) in T  (Y)  3 (NP)= VP 22 of 33 11 D the 33 22 11  X (in)=out N big 11 pen  1 (ADJ)=  X (bleu)=red

Define tree transformations: tree homomorphisms (III) Several types of tree homomorphisms exist depending on the restrictions imposed on each tree  m (f) and implicitly on auxiliary variables: Linear: in each  m (f) no copy of a subtree is allowed (no  i appears twice or more); Non-deleting: in each  m (f) no subtree information is lost (each  i appears at least once); Strict: no  m (f) can be reduced to an auxiliary variable  i ; Quasi-alphabetic: linear, non-deleting, strict, each constant is mapped to a constant or a tree of height 1 (all leaves are symbols from the output leaf alphabet), and each  m (f) is a tree of height 1 in which the subtrees are reordered and symbols from the output leaf alphabet may appear as leaves. H: the class of all tree homomorphisms. lH, nH, sH, qH: the corresponding restricted class. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Define tree transformations: synchronous grammars (I) Use the idea of synchronous rewriting: two formal grammars work in parallel, productions being linked by some relation and applied synchronously. Pairs of recursively related words are generated simultaneously. Syntax-directed translators, first used in compilers, use the power of context-free rewriting and can be viewed as a three-step process one of which is a tree transformation: 1. For a given input sentence u, construct a parse tree for u. 2. Transform the parse tree into a tree in the output grammar. 3. Take the yield of the output tree as a translation for u. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations S  VP 1 P 2 you VP 3 ; P 2 VP 3 VP 1 P  before ; antes de VP  speak ; hablar VP  think ; pensar S VP S pensar antes de beforespeak VP P hablar VP, P you think

Define tree transformations: synchronous grammars (III) Many types of syntax-directed translations: Syntax Directed Translation Schemata: (Aho and Ullman 1972), Synchronous CFGs: (Satta and Peserico 2005), Inversion Transduction Grammars: (Wu 1997), Tree-to-string models: (Yamada and Knight 2001), Hierarchical Phrase-Based Models: (Chiang 2007), etc Beyond the generative capacity of CF rewriting (multi-level rules): Synchronous Tree Substitution Grammars: (Shieber 2004). Involve discontinuous constituents: Synchronous TAGs: (Shieber and Schabes 1990), Generalized Multitext Grammars: (Melamed et all 2004). CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Define tree transformations: synchronous grammars (III) Synchronous grammars derivations can be viewed as pairs of trees, just as usual derivations in formal grammars can be viewed as trees. From the way of defining them, we can see that they: easily capture syntax-sensitive transformations, execute local rotations, and are in general trainable. Unfortunately no modularity results were known or could be proved until recently: (Shieber 2004). What helped us improve the modularity? Answer: TREE BIMORPHISMS. Why? Don’t be impatient and don’ fell asleep. It follows right now. This is the best part and not too much mathematics. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

What is a tree bimorphism? (I) A mathematical, nice alternative way to define tree transformations. Excellent tool for proving mathematical properties of classes of tree transformations. Consists of two tree homomorphisms defined on the same (regular) tree language, i.e., domains of tree transducers that computes a partial identity function. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

What is a tree bimorphism? (II) B(H 1,Rec,H 2 ): a class of tree bimorphisms. B(H 1,Rec,H 2 ): the class of tree transformations defined by B. B=( ,R,  ): a tree bimorphism from the class B, with  H 1, R  Rec,  H 2. B(lnH,Rec,qH): the class of all tree bimorphisms in which the first tree homomorphism component is linear and non-deleting, the second is a quasi-alphabetic tree homomorphism. t  R  T  (Z)  T  (Y) T  (X)   (t)  (t)   Common abstract language: Interlingua Output language Input language transforms into yd(  (t)) yd(  (t)) is translated into CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

What can we do with a bimorphism? Concept very similar with synchronous grammars: from derivation trees to derived trees. This way, we can encode the derivations into the common tree language, and by using the tree homomorphisms to get the derivations in input/output grammars. In 2004, Shieber was the first one who linked synchronous grammars and tree bimorphisms; he was trying to improve modularity and other mathematical properties of classes of tree transformations defined by synchronous grammars. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Results obtained The tree bimorphisms B(lnH,Rec,lnH): not closed under composition: (Maletti 2007); define the same class of tree transformations as STSGs: (Shieber 2004). The tree bimorphisms B(lnsH,Rec,lnsH): not closed under composition: (Arnold and Dauchet 1982); define the same class of tree transformations as a special STAG: (Shieber 2006). The (quasi-alphabetic) tree bimorphisms B(qH,Rec,qH): closed under composition: (Steinby and Tîrnăucă 2007); define the same translations as STDSs: (Steinby and Tîrnăucă 2007), SCFGs and ITGs: (Tîrnăucă 2007); the tree transformations are encoded in their bimorphism counterpart. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

Conclusions; future work Properties of/ modeling tree transformations: a new challenge in machine translation. Modularity, or closure under composition, a big issue. There are several ways to define tree transformations. One connects all of them and helps us to prove modularity: tree bimorphisms. We saw some results obtained recently. What about other useful properties of synchronous grammars, other synchronous grammars and other tree bimorphism? Can we obtain more improvements? CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

References (I) Aho, Alfred and Jeffrey Ullman (1972). The theory of parsing, translation, and compiling, Volume I: Parsing. New Jersey: Prentice Hall Professional Technical Reference. Arnold, André and Max Dauchet (1982). “Morphismes et bimorphismes d’arbres”, Theoretical Computer Science 20: 33–93. Chiang, David (2007). “Hierarchical phrase-based translation”, Computational Linguistics 33(2):201–228. Engelfriet, Joost, Eric Lilin and Andreas Maletti (2008). “Extended multi bottom-up tree transducers”. Manuscript. URL: Knight, Kevin (2008). “Capturing practical natural language transformations”. Manuscript. URL: Knight, Kevin and Jonathan Graehl (2005). “An overview of probabilistic tree transducers for natural language processing”. In: Alexander F. Gelbukh, ed., Computational Linguistics and Intelligent Text Processing 6th International Conference, CICLing 2005, Proceedings, vol of LNCS. Berlin: Springer-Verlag, pp. 1–24. Maletti, Andreas (2007). “Compositions of extended top-down tree transducers”. In: Remco Loos, Szilárd Z. Fazekas and Carlos Martín Vide, eds., Proceedings of the 1st International Conference on Language and Automata Theory and Applications, LATA 2007, vol. 35/07 of GRLMC Reports. Tarragona: Universitat Rovira i Virgili, pp. 379–390. Melamed, I. Dan (2003). “Multitext grammars and synchronous parsers”. In: HLT-NAACL 2003, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1. Morristown: Association for Computational Linguistics, pp. 79–86. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

References (II) Satta, Girogio and Enoch Peserico (2005). “Some computational complexity results for synchronous context-free grammars”. In: HLT/EMNLP 2005, Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. Morristown: Association for Computational Linguistics, pp. 803–810. Shieber, Stuart M. (2004). “Synchronous grammars as tree transducers”. In: Proceedings of the TAG+7: Seventh International Workshop on Tree Adjoining Grammar and Related Formalisms, 2004, pp. 88–95. Shieber, Stuart M. (2006). “Unifying synchronous tree-adjoining grammars and tree transducers via bimorphisms”. In: EACL 2006, 11st Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computer Linguistics, pp Steinby, Magnus and Cătălin I. Tîrnăucă (2007). “Syntax-directed translations and quasi- alphabetic tree bimorphisms”. In Jan Holub and Jan Zdárek, eds,, Implementation and Application of Automata, 12th International Conference, CIAA Berlin: Springer-Verlag, pp. 265–276. Tîrnăucă, Cătălin I. (2007). “Synchronous context-free grammars by means of tree bimorphisms”. In: Gemma Bel Enguix and Maria Dolores Jiménez Lopez, eds., Proceedings of the 1st International Workshop on Non-Classical Formal Languages in Linguistics (ForLing 2007), vol. 36/07 of GRLMC Reports. Tarragona: Universitat Rovira i Virgili, pp. 97–107. Yamada, Kenji and Kevin Knight (2001). “A syntax-based statistical translation model”. In: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics. Morristown: Asociation of Computational Linguistics, pp Wu, Dekai (1997). “Stochastic inversion transduction grammars and bilingual parsing of parallel corpora”, Computational Linguistics 23(3): 377–403. CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations

That’s all folks! CLG 08, Madrid, 28/06/2008: C. Tîrnăucă, Tree bimorphisms and their relevance in translations Thank you! Arigato Graçias! Moltes gràcies! Mulţumesc! Mila esker! Kiitos Grazie! Spasiva! 謝謝你。