Learning regular tree languages from correction and equivalence queries Cătălin Ionuţ Tîrnăucă Research Group on Mathematical Linguistics, Rovira i Virgili.

Slides:



Advertisements
Similar presentations
Context-Free and Noncontext-Free Languages
Advertisements

Characterization of state merging strategies which ensure identification in the limit from complete data Cristina Bibire.
Theory of Computation CS3102 – Spring 2014 A tale of computers, math, problem solving, life, love and tragic death Nathan Brunelle Department of Computer.
CS 345: Chapter 9 Algorithmic Universality and Its Robustness
Lecture # 8 Chapter # 4: Syntax Analysis. Practice Context Free Grammars a) CFG generating alternating sequence of 0’s and 1’s b) CFG in which no consecutive.
1 1 CDT314 FABER Formal Languages, Automata and Models of Computation Lecture 3 School of Innovation, Design and Engineering Mälardalen University 2012.
Pushdown Automata Chapter 12. Recognizing Context-Free Languages Two notions of recognition: (1) Say yes or no, just like with FSMs (2) Say yes or no,
The Power of Correction Queries Cristina Bibire Research Group on Mathematical Linguistics, Rovira i Virgili University Pl. Imperial Tarraco 1, 43005,
Simplifying CFGs There are several ways in which context-free grammars can be simplified. One natural way is to eliminate useless symbols those that cannot.
1 Polynomial Time Probabilistic Learning of a Subclass of Linear Languages with Queries Yasuhiro TAJIMA, Yoshiyuki KOTANI Tokyo Univ. of Agri. & Tech.
Pushdown Automata Part II: PDAs and CFG Chapter 12.
1 Introduction to Computability Theory Lecture12: Decidable Languages Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture11: Variants of Turing Machines Prof. Amos Israeli.
Introduction to Computability Theory
1 Introduction to Computability Theory Lecture5: Context Free Languages Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture13: Mapping Reductions Prof. Amos Israeli.
CS5371 Theory of Computation
Lecture Note of 12/22 jinnjy. Outline Chomsky Normal Form and CYK Algorithm Pumping Lemma for Context-Free Languages Closure Properties of CFL.
1 CSC 3130: Automata theory and formal languages Tutorial 4 KN Hung Office: SHB 1026 Department of Computer Science & Engineering.
Decidable and undecidable problems deciding regular languages and CFL’s Undecidable problems.
Weighted Tree Transducers A Short Introduction Cătălin Ionuţ Tîrnăucă Research Group on Mathematical Linguistics, Rovira i Virgili University Pl. Imperial.
CS Master – Introduction to the Theory of Computation Jan Maluszynski - HT Lecture 4 Context-free grammars Jan Maluszynski, IDA, 2007
104 Closure Properties of Regular Languages Regular languages are closed under many set operations. Let L 1 and L 2 be regular languages. (1) L 1  L 2.
1 Module 31 Closure Properties for CFL’s –Kleene Closure construction examples proof of correctness –Others covered less thoroughly in lecture union, concatenation.
Normal forms for Context-Free Grammars
CS5371 Theory of Computation Lecture 8: Automata Theory VI (PDA, PDA = CFG)
January 23, 2015CS21 Lecture 81 CS21 Decidability and Tractability Lecture 8 January 23, 2015.
Parsing SLP Chapter 13. 7/2/2015 Speech and Language Processing - Jurafsky and Martin 2 Outline  Parsing with CFGs  Bottom-up, top-down  CKY parsing.
January 15, 2014CS21 Lecture 61 CS21 Decidability and Tractability Lecture 6 January 16, 2015.
BİL744 Derleyici Gerçekleştirimi (Compiler Design)1.
Cs466(Prasad)L8Norm1 Normal Forms Chomsky Normal Form Griebach Normal Form.
Context-Free Grammars Chapter 3. 2 Context-Free Grammars and Languages n Defn A context-free grammar is a quadruple (V, , P, S), where  V is.
Context-free Grammars
Compilation 2007 Context-Free Languages Parsers and Scanners Michael I. Schwartzbach BRICS, University of Aarhus.
1 Introduction to Parsing Lecture 5. 2 Outline Regular languages revisited Parser overview Context-free grammars (CFG’s) Derivations.
Chapter 4 Context-Free Languages Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1.
Lecture 16 Oct 18 Context-Free Languages (CFL) - basic definitions Examples.
CSE 3813 Introduction to Formal Languages and Automata Chapter 8 Properties of Context-free Languages These class notes are based on material from our.
Learning DFA from corrections Leonor Becerra-Bonache, Cristina Bibire, Adrian Horia Dediu Research Group on Mathematical Linguistics, Rovira i Virgili.
Pushdown Automata (PDA) Intro
Context-free Grammars Example : S   Shortened notation : S  aSaS   | aSa | bSb S  bSb Which strings can be generated from S ? [Section 6.1]
Context-Free Grammars Normal Forms Chapter 11. Normal Forms A normal form F for a set C of data objects is a form, i.e., a set of syntactically valid.
Context Free Grammars CIS 361. Introduction Finite Automata accept all regular languages and only regular languages Many simple languages are non regular:
Learning Automata and Grammars Peter Černo.  The problem of learning or inferring automata and grammars has been studied for decades and has connections.
Context Free Grammars. Context Free Languages (CFL) The pumping lemma showed there are languages that are not regular –There are many classes “larger”
Computability Construct TMs. Decidability. Preview: next class: diagonalization and Halting theorem.
Inferring Finite Automata from queries and counter-examples Eggert Jón Magnússon.
Context-Free and Noncontext-Free Languages Chapter 13 1.
Context Free Grammars 1. Context Free Languages (CFL) The pumping lemma showed there are languages that are not regular –There are many classes “larger”
Closure Properties Lemma: Let A 1 and A 2 be two CF languages, then the union A 1  A 2 is context free as well. Proof: Assume that the two grammars are.
 2005 SDU Lecture13 Reducibility — A methodology for proving un- decidability.
Context-Free and Noncontext-Free Languages Chapter 13.
Grammars A grammar is a 4-tuple G = (V, T, P, S) where 1)V is a set of nonterminal symbols (also called variables or syntactic categories) 2)T is a finite.
Context-Free Languages
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Computability Review homework. Video. Variations. Definitions. Enumerators. Hilbert's Problem. Algorithms. Summary Homework: Give formal definition of.
Chapter 8 Properties of Context-free Languages These class notes are based on material from our textbook, An Introduction to Formal Languages and Automata,
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University 2007.
CSCI 2670 Introduction to Theory of Computing October 13, 2005.
Lecture # 31 Theory Of Automata By Dr. MM Alam 1.
Donghyun (David) Kim Department of Mathematics and Physics North Carolina Central University 1 Chapter 2 Context-Free Languages Some slides are in courtesy.
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen Department of Computer Science University of Texas-Pan American.
Pushdown Automata Chapter 12. Recognizing Context-Free Languages Two notions of recognition: (1) Say yes or no, just like with FSMs (2) Say yes or no,
CS416 Compiler Design1. 2 Course Information Instructor : Dr. Ilyas Cicekli –Office: EA504, –Phone: , – Course Web.
 2005 SDU Lecture11 Decidability.  2005 SDU 2 Topics Discuss the power of algorithms to solve problems. Demonstrate that some problems can be solved.
Week 14 - Friday.  What did we talk about last time?  Simplifying FSAs  Quotient automata.
CS21 Decidability and Tractability
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
Normal Forms for Context-free Grammars
Presentation transcript:

Learning regular tree languages from correction and equivalence queries Cătălin Ionuţ Tîrnăucă Research Group on Mathematical Linguistics, Rovira i Virgili University Pl. Imperial Tarraco 1, 43005, Tarragona, Spain Pl. Imperial Tarraco 1, 43005, Tarragona, Spain 5 th of July 2006 Seminar II

Outline  Motivation  Preliminaries  Learning RTL from CQs  The class of injective languages  Concluding remarks  References

Motivation In 1987, Dana Angluin [1] proposes the learning model of exact identification using queries. The algorithm, called L *, allows the learner to ask questions about a language from an oracle (also called MAT - minimally adequate teacher) and halts in polynomial time with a correct description of the language. The first attempt to extend this inference method to CFG’s belongs to Angluin as well. In the same paper, she designs the algorithm L cf and shows that in this framework CFL’s are learnable. But the learnability is proved under powerful restrictions: - the grammar should be in Chomsky normal form - the set of non-terminal symbols, along with the start symbol, are assumed to be known by the learner.

Motivation In 1992, Sakakibara [6] extends the algorithm L * to skeletal regular tree automata which are a variation of finite automata that take skeletons as input. Intuitively, skeletons are derivation trees of strings in a grammar in which internal nodes are unlabeled. Such a tree reveals only the syntactic structure associated with a string with respect to a CFG but not the concrete rules generating it. The interest in learning RTL is justified by the fact that the yield of any RTL is a CFL. In 2003, Drewes and Högberg [4] improve Sakakibara's algorithm, generalizing L* to regular tree languages. The advantages are: avoiding the dead states and minimizing the observation table as much as possible (the trade-off is that the learner is assumed to know the number of states of the recognizer to be learned).

Motivation Inspired by how children learn, Becerra, Bibire and Dediu propose in [3] the algorithm LCA with an alternative to membership queries: instead of a yes/no answer, the teacher returns a correcting string. They choose as a correction for the string s, the smallest string s' (in the lex-length order) such that ss' belongs to the target language. Even if the worst case complexity of the two algorithms is the same, LCA runs more efficient on average, mainly because of the embedded information contained in the correction queries. We propose an extension of L* which generalizes the correction queries introduced in [3] from correcting strings to correcting contexts. Since our algorithm, restricted to skeletal tree languages, can be applied to CFL’s, we focused on RTL.

Preliminaries: Trees

Preliminaries: Knuth-Bendix Orders

Correcting Context The main idea is that a child can learn faster if he is helped in a proper manner, not only responding by yes or no to his questions, but also trying to correct him if he makes a mistake. In this way, he can add the new information to the previous knowledge in order to be able to infer the correct grammar more easily. For a tree, the frontier derivative language of T with respect to t is the set: where is any deterministic bottom-up tree recog- nizer accepting the tree language T. One can speak about the frontier derivative language of a state q as being, where is such that.

Correcting Context The correcting context of a tree with respect to the tree language T and the Knuth-Bendix order on, denoted, is the minimal context of the set. In case that no such context exists ( ), we say, where θ is a symbol which does not belong to the alphabet Σ. Hence, is a function from to. Note that if and only if t is in T, and for all, implies, but the converse does not hold.

The Learner

The Teacher

The class of injective languages

Concluding Remarks What is new? we introduced the notions of correcting contexts and complete observation table, we showed that regular tree languages can be learned from correction and equivalence queries, we described the teacher’s behaviour (this wasn’t present in any of the previous work on learning from queries), we proved that injective tree languages do not need equivalence queries in order to be learned in this framework. In the future, we would like to: study other types of correcting contexts (which would allow different subclasses of regular tree languages to be learnable without equivalence queries), to extend L RTL C for inferring weighted tree transducers.

References