Conjunctive Grammars and Synchronized Alternating Pushdown Automata October 2009 Tamar Aizikowitz Joint work with Michael Kaminski Technion – Israel Institute.

Slides:

Advertisements

Similar presentations

Simplifications of Context-Free Grammars

Advertisements

Variations of the Turing Machine

PDAs Accept Context-Free Languages

Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.

Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.

Chapter 4 An Introduction to Finite Automata

By John E. Hopcroft, Rajeev Motwani and Jeffrey D. Ullman

Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×

Programming Language Concepts

1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.

CSCI 3130: Formal Languages and Automata Theory Tutorial 5

Solve Multi-step Equations

Chapter 11: Models of Computation

Turing Machines.

Bellwork Do the following problem on a ½ sheet of paper and turn in.

Lexical Analysis Arial Font Family.

Cook-Levin Theorem Proof and Illustration

Hector Miguel Chavez Western Michigan University.

How to convert a left linear grammar to a right linear grammar

Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.

CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.

1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.

6.4 Best Approximation; Least Squares

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Finite-state Recognizers

Conjunctive Grammars and Alternating Automata Tamar Aizikowitz and Michael Kaminski Technion – Israel Institute of Technology WoLLIC 2008 Heriot-Watt University.

Formal Language, chapter 10, slide 1Copyright © 2007 by Adam Webber Chapter Ten: Grammars.

1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.

Essential Cell Biology

PSSA Preparation.

Chapter Eleven: Non-Regular Languages

Essential Cell Biology

Energy Generation in Mitochondria and Chlorplasts

1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

Distributed Computing 5. Snapshot Shmuel Zaks ©

The Pumping Lemma for CFL’s

Linear Conjunctive Grammars and One-turn Synchronized Alternating Pushdown Automata Formal Grammar 2009, Bordeaux Tamar Aizikowitz* and Michael Kaminski.

Theory of Computation CS3102 – Spring 2014 A tale of computers, math, problem solving, life, love and tragic death Nathan Brunelle Department of Computer.

Finite State Machines for Strings over Infinite Alphabets F. Neven, T. Schwentick and V. Vianu Automata Seminar - Spring 2007 Tamar Aizikowitz ACM Transactions.

C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.

Closure Properties of CFL's

1 Introduction to Computability Theory Lecture12: Decidable Languages Prof. Amos Israeli.

Introduction to Computability Theory

1 Introduction to Computability Theory Lecture7: PushDown Automata (Part 1) Prof. Amos Israeli.

CS Master – Introduction to the Theory of Computation Jan Maluszynski - HT Lecture 4 Context-free grammars Jan Maluszynski, IDA, 2007

Normal forms for Context-Free Grammars

CS5371 Theory of Computation Lecture 8: Automata Theory VI (PDA, PDA = CFG)

CSR 2011, St. Petersburg, Russia

Pushdown Automata (PDAs)

1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 11 Midterm Exam 2 -Context-Free Languages Mälardalen University 2005.

Closure Properties Lemma: Let A 1 and A 2 be two CF languages, then the union A 1  A 2 is context free as well. Proof: Assume that the two grammars are.

CS 208: Computing Theory Assoc. Prof. Dr. Brahim Hnich Faculty of Computer Sciences Izmir University of Economics.

1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.

Foundations of (Theoretical) Computer Science Chapter 2 Lecture Notes (Section 2.2: Pushdown Automata) Prof. Karen Daniels, Fall 2010 with acknowledgement.

Donghyun (David) Kim Department of Mathematics and Physics North Carolina Central University 1 Chapter 2 Context-Free Languages Some slides are in courtesy.

CSCI 4325 / 6339 Theory of Computation Zhixiang Chen Department of Computer Science University of Texas-Pan American.

Grammar Set of variables Set of terminal symbols Start variable Set of Production rules.

Formal Languages, Automata and Models of Computation

Pushdown Automata.

Properties of Context-Free Languages

Deterministic FA/ PDA Sequential Machine Theory Prof. K. J. Hintz

CSE 105 theory of computation

PDAs Accept Context-Free Languages

Deterministic PDAs - DPDAs

The Pumping Lemma for CFL’s

… NPDAs continued.

CSE 105 theory of computation

CSE 105 theory of computation

Presentation transcript:

Conjunctive Grammars and Synchronized Alternating Pushdown Automata October 2009 Tamar Aizikowitz Joint work with Michael Kaminski Technion – Israel Institute of Technology

Context-Free Languages Combine expressiveness with polynomial parsing appealing for practical applications. Possibly the most widely used language class in Computer Science. At the theoretical basis of Programming Languages, Computational Linguistics, Formal Verification, Computational Biology, and more. of 43 2

Extended Models of 43 3 Goal: Models of computation that generate a slightly stronger language class without sacrificing polynomial parsing. Why? Such models seem to have great potential for practical applications. In fact, several fields (e.g., Computational Linguistics) have already voiced their need for a stronger language class.

Conjunctive Grammars Conjunctive Grammars (CG) [Okhotin, 2001] are an extension of context-free grammars. Have explicit intersection rules S (A & B) (w & w) w Semantics: L(A & B) = L(A) L(B) Recall: Context-free languages are not closed under intersection stronger language class Retain polynomial parsing practical applications of 43 4

Synchronized Alternating PDA of 43 5 Synchronized Alternating Pushdown Automata (SAPDA) [Aizikowitz Kaminski, 2008] extend PDA. Stack modeled as tree all braches must accept Uses a limited form of synchronization to create localized parallel computations. First automaton counterpart shown for Conjunctive Grammars. One-turn SAPDA shown to be equivalent to Linear CG [Aizikowitz Kaminski, 2009], mirroring the classical equivalence between one-turn PDA and LG.

Outline Model Definitions Conjunctive Grammars Synchronized Alternating Pushdown Automata (SAPDA) Main Results Equivalence Results Linear Conjunctive Grammars and One-turn SAPDA Conjunctive Languages Characterization of Language Class A Simple Programming Language Mildly Context Sensitive Languages Summary and Future Directions of 43 6

Conjunctive Grammars Synchronized Alternating Pushdown Automata Model Definitions

Conjunctive Grammars A CG is a quadruple: G=(V, Σ, P, S ) Rules: A (α 1 & & α n ) s.t. A V, α i (V Σ) * Examples: A (aAB & Bc & aD) ; A abC Derivation steps: s 1 As 2 s 1 (α 1 & & α n )s 2 where A (α 1 & & α n ) P s 1 (w & & w)s 2 s 1 w s 2 where w Σ * non-terminals terminals derivation rules start symbol n=1 standard CFG of 43 8

Grammar Language of 43 9 Language: L(G) = {w Σ * | S * w} Informally: All terminal words w derivable from the start symbol S. Note: As (, ), and & are not terminal symbols, all conjunctions must be collapsed in order to derive a terminal word. Semantics: (A & B) * w iff A * w B * w Therefore, L(A & B) = L(A) L(B).

Example: Multiple Agreement Example: following is a CG for the multiple- agreement language {a n b n c n | n } : C Cc | D ; D aDb | ε L(C) = {a n b n c m | n,m } A aA | E ; E bEc | ε L(A) = {a m b n c n | n,m } S (C & A) L(S ) = L(C) L(A) S (C & A) (Cc & A) (Dc & A) (aDbc & A) (abc & A) (abc & abc) abc of 43 10

Synchronized Alternating Pushdown Automata Synchronized Alternating Pushdown Automata (SAPDA) are an extension of classical PDA. Transitions are made to conjunctions of ( state, stack-word ) pairs, e.g., δ( q, σ, X ) = { ( p 1, XX ) ( p 2,Y ), ( p 3, Z ) } Note: if all conjunctions are of one pair only, the automaton is a regular PDA. of non-deterministic model = many possible transitions

SAPDA Stack Tree of The stack of an SAPDA is a tree. A transition to n pairs splits the current branch into n branches. Branches are processed independently. Empty sibling branches can be collapsed if they are synchronized = are in the same state and have read the same portion of the input. B AC D q p B A q δ(q,σ,A)={(q,A) (p,DC)} B q collapse B εε q q

SAPDA Formal Definition of An SAPDA is a sextuple A = ( Q, Σ, Γ, q 0, δ, ) Transition function: δ(q,σ, X ) {(q 1,α 1 ) (q n,α n ) | q i Q, α i Γ *, n } Configuration: a labeled tree states terminalsinitial state transition function initial stack symbol stack symbols B AC D q p B ( q, ab, A ) ( p, b, DC ) remaining input

SAPDA Computation and Language of Computation: Each step, a transition is applied to one stack-branch If a stack-branch is empty, it cannot be selected Synchronous empty sibling branches are collapsed Initial Configuration: Accepting configuration: Language: L(A) = {w Σ * | q Q, (q 0,w, ) * (q,ε,ε)} Note: all branches must empty ~ must agree. q0q0 ( q 0, w, ) ε q ( q, ε, ε ) have the same state and remaining input

Reduplication with a Center Marker of The reduplication with a center marker language (RCM), { w$w | w Σ * }, describes structures in various fields, e.g., Copying phenomena in natural languages:deal or no deal, boys will be boys,is she beautiful or is she beautiful? Biology: microRNA patterns in DNA, tandem repeats We will construct an SAPDA for RCM. Note: it is not known whether reduplication without a center marker can be derived by a CG.

Example: SAPDA for RCM of We consider an SAPDA for { w$uw | w,u Σ * }, which can easily be modified to accept RCM. The SAPDA is especially interesting, as it utilizes recursive conjunctive transitions. Construction Idea: if σ in the n th letter before the $, check that the n th letter from the end is also σ. σσ$ n - 1 wuw

Construction of SAPDA for RCM of A=(Q,{a,b,$},{,#},q 0,δ, ) Q = {q 0,q e } {q σ i | σ {a, b}, i {1, 2}} δ(q 0,σ, ) = {(q σ 1, ) (q 0, )} δ(q σ 1,τ, X) = {(q σ 1,#X)} δ(q 0,$, ) = {(q e, ε)} δ(q σ 1,$, X) = {(q σ 2, X)} δ(q σ 2,τ σ, X) = {(q σ 2, X)} δ(q σ 2,σ, X) = {(q e, X), (q σ 2, X)} δ(q e,τ,#) = {(q e, ε)} δ(q e, ε, ) = {(q e, ε)} recursively open branch to verify σ is n th from the $ and from the end count symbols between σ and $ $ reached, stop recursion $ reached, stop counting symbols look for σ guess that this is the σ we are looking for, or keep looking count symbols from σ to the end all done: empty stack

Computation of SAPDA for RCM of q0q0 ε q0q0 qa1qa1 # ε qa1qa1 q0q0 qb1qb1 ε q0q0 # qa1qa1 # qb1qb1 qb2qb2 qa2qa2 qb2qb2 ε qeqe qeqe qeqe qeqe ε ε ε qeqe qeqe qeqe a b b $ b a b b word accepted!

Equivalence Results Linear CG and One-turn SAPDA Main Results

Equivalence Results of Theorem 1. A language is generated by an CG if and only if it is accepted by an SAPDA. The equivalence is very similar to the classical equivalence between CFG and PDA. The proofs of the equivalence are extended versions of the classical proofs.

only if Proof Sketch of Given an CG, we construct an single-state SAPDA using an extension of the classical construction. A simulation of the derivation is run in the stack: If the top stack symbol is a non-terminal, it is replaced with the r.h.s. of one of its rules. If the top stack symbol is a terminal, it is emptied while reading the same terminal symbol from the input. A correlation is achieved between the stack contents and the grammar sentential forms.

only if Proof Simulation of S * w A α w(uBβ & )α uxCγβ ux(v & & v)γβ * ux v γβ A α β B w u x v z u switch variable with r.h.s of rule empty terminals γ C x *

if Proof Sketch of Given an SAPDA we construct an CG. The proof is an extension of the classical one. However, due to the added complexity of the extended models, it is more involved. Therefore, we wont get into it now…

Single-state SAPDA of The if proof translates a general SAPDA into a Conjunctive Grammar. The only if proof translates a Conjunctive Grammar into a single-state SAPDA. Corollary: Single-state SAPDA and multi-state SAPDA are equivalent. This characterizes classical PDA as well.

Linear CG and One-turn SAPDA Linear Conjunctive Grammars (LCG) [Okhotin, 2001] are an interesting sub-class of CG. Have especially efficient parsing [Okhotin, 2003] Equivalent to Trellis Automata [Okhotin, 2004] A conjunctive grammar is linear if all conjuncts in all rules contain at most one variable. We define a sub-class of SAPDA, one-turn SAPDA, and prove equivalence to LCG. of 43 25

Motivation of Linear Conjunctive Grammars as a sub-family of CG are defined analogously to Linear Grammars as a sub-family of Context-free Grammars. It is a well known result [Ginsburg et.al, 1966] that Linear Grammars are equivalent to one-turn PDA. A turn is a computation step where the stack height changes from increasing to decreasing. A one-turn PDA is a PDA s.t. all accepting computations, have only one turn.

One-turn SAPDA of We introduce a sub-family of SAPDA, one-turn SAPDA, analogously to one-turn PDA. An SAPDA is one-turn if all stack-branches make exactly one turn in all accepting computations. Note: the requirement of a turn is not limiting as we are considering acceptance by empty stack.

Informal Definition of Assume all transitions on a stack-branch and its sub-tree are applied consecutively (reordering if needed). We refer to this segment of the computation as the relevant transitions w.r.t. the branch. An SAPDA is one-turn if for every branch, the relevant transitions can be split into three phases: (1) Increasing transitions applied to the stack-branch. (2) A conjunctive transition followed by transitions applied to the branches in the sub-tree and then a collapsing transition of the sub-tree. (3) Decreasing transitions on the stack-branch.

Informal Definition Continued… of Note: if the automaton is a classical PDA, then there is only one branch with no second phase (no conjunctive transitions), and therefore the automaton is a classical one-turn PDA. phase 1phase 3 phase 2

Equivalence Results of Theorem 2. A language is generated by an LCG if and only if it is accepted by a one-turn SAPDA. This result mirrors the classical equivalence between Linear Grammars and one-turn PDA, strengthening the claim of SAPDA as a natural automaton counterpart for CG. Corollary: One-turn SAPDA are equivalent to Trellis automata.

Characterization of Language Class A Toy Programming Language Mildly Context Sensitive Languages Conjunctive Languages

Generative Power of CG can derive all finite conjunctions of CF languages as well as some additional languages (e.g., RCM). Linear CG can derive all finite conjunctions of linearCF languages as well as some additional languages (e.g., RCM). However, there are some CF languages that cannot be derived by a Linear CG. LG CFG LCG CG

Closure Properties of Union, concatenation, intersection, Kleene star Proven quite easily using grammars Homomorphism Inverse homomorphism Well touch on the proof of this in the next slide… Linear CL are closed under complement. It is an open question whether general CG are closed under complement. ?

Inverse Homomorphism of ModelTechniqueLengthLinear CG CGNon-classical and complicated 13 pagesRequires separate proof SAPDAIntuitive extension of classical proof 1 pageSame proof works for one-turn SAPDA For the grammar based proof, see [Okhotin, 2003].

Decidability Problems of Linear CG Membership: O(n 2 ) time and O(n) space. General CG Membership: O(n 3 ) time and O(n 2 ) space. Emptiness, finiteness, equivalence, inclusion, regularity

A Toy Programming Language of A program in PrintVars has three parts: Definition of variables Assignment of values Printing of variable values to the screen Example: VARS a, b, c VALS b = 2, a = 1, c = 3 PRNT b a a c b Output:

PrintVars Specification of A PrintVars program is well-formed if: (1) It has the correct structure (2) All used variables are defined (3) All defined variables are used (4) All defined variables are assigned a value (5) All variables assigned a value are defined Item (1) is easily defined by a CF Grammar. However, items (2) – (5) amount to a language reducible to RCM, which is not CF.

A (partial) CG for PrintVars of S ( structure & defined_used & used_defined & defined_assigned & assigned_defined ) structure vars vals prnt vars VARS … ; vals VALS … ; prnt PRNT … defined_used VARS check_du check_du ( a X vals X a X & a check_du ) | ( b X vals X b X & b check_du ) | | vals X ) X a X | | z X | 0 X | | 9 X | = X | ε

Mildly Context Sensitive Languages of Computational Linguistics pursues a computational model which exactly describes natural languages. Originally, context-free models were considered. However, non-CF natural language structures led to interest in a slightly extended class of languages – Mildly Context-sensitive Languages (MSCL). Several formalisms (e.g., Tree Adjoining Grammars) are known to converge to MCSL. [Vijay-Shanker, 1994]

Conjunctive Languages and MCSL of We explore the correlation between Conjunctive Languages and MCSL. MCSL are loosely categorized as follows: (1)They contain the context-free languages (2)They contain multiple-agreement, cross-agreement and reduplication (3) They are polynomially parsable (4) They are semi-linear a CG exists for the language { ba 2 ba 4 ba 2n b | n } Not an exact characterization of natural languages, but still with applicative potential.

Summary Future Directions Concluding Remarks

Summary of Conjunctive Languages are an interesting language class because: They are a strong, rich class of languages. They are polynomially parsable. Their models of computation are intuitive and easy to understand; highly resemble classical CFG and PDA. SAPDA are the first automaton model presented for Conjunctive Languages. They are an natural extension of PDA. They lend new intuition on Conjunctive Languages.

Future Directions of Broadening the theory of SAPDA Deterministic SAPDA Possible implications on LR-Conjunctive Grammars Considering possible applications Formal verification …

Thank you.

References Aizikowitz, T., Kaminski, M.: Conjunctive grammars and alternating pushdown automata. WoLLIC09. LNAI 5110 (2008) 30 – 41 Aizikowitz, T., Kaminski, M.: Linear conjunctive grammars and one-turn synchronized alternating pushdown automata. Formal Grammars: Bordeaux LNAI To appear. Ginsburg, S., Spanier, E.h.: Finite-turn pushdown automata. SIAM Journal on Control. 4(3) (1966) 429 – 453 Okhotin, A.: Conjunctive grammars. Journal of Automata, Languages and Combinatorics. 6(4) (2001) 519 – 535 Okhotin, A.: Conjunctive languages are closed under inverse homomorphism. Technical Report School of Computing, Queens Univ., Kingston, Ontario, Canada. Okhotin, A.: On the equivalence of linear conjunctive grammars and trellis automata. RAIRO Theoretical Informatics and Applications. 38(1) (2004) 69 – 88 Vijay-Shanker, K., Weir, D.J.: The equivalence of four extensions of context-free grammars. Mathematical Systems Theory. 27(6) (1994) 511 – 546.