Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in 1975. Tree-adjoining grammars are somewhat similar to context-free.

Slides:



Advertisements
Similar presentations
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
Advertisements

Lecture # 8 Chapter # 4: Syntax Analysis. Practice Context Free Grammars a) CFG generating alternating sequence of 0’s and 1’s b) CFG in which no consecutive.
Translator Architecture Code Generator ParserTokenizer string of characters (source code) string of tokens abstract program string of integers (object.
May 2006CLINT-LN Parsing1 Computational Linguistics Introduction Approaches to Parsing.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
ICE1341 Programming Languages Spring 2005 Lecture #4 Lecture #4 In-Young Ko iko.AT. icu.ac.kr iko.AT. icu.ac.kr Information and Communications University.
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
LTAG Semantics on the Derivation Tree Presented by Maria I. Tchalakova.
ISBN Chapter 3 Describing Syntax and Semantics.
Chapter 3 Describing Syntax and Semantics Sections 1-3.
Starting With Complex Primitives Pays Off: Complicate Locally, Simplify Globally ARAVIND K. JOSHI Department of Computer and Information Science and Institute.
LEXICALIZATION AND CATEGORIAL GRAMMARS: ARAVIND K. JOSHI A STORY BAR-HILLEL MIGHT HAVE LIKED UNIVERSITY OF PENNSYLVANIA PHILADELPHIA, PA USA June.
1 Introduction: syntax and semantics Syntax: a formal description of the structure of programs in a given language. Semantics: a formal description of.
Chapter 3: Formal Translation Models
COP4020 Programming Languages
Probabilistic Parsing Ling 571 Fei Xia Week 5: 10/25-10/27/05.
(2.1) Grammars  Definitions  Grammars  Backus-Naur Form  Derivation – terminology – trees  Grammars and ambiguity  Simple example  Grammar hierarchies.
Chapter 2 Syntax A language that is simple to parse for the compiler is also simple to parse for the human programmer. N. Wirth.
1 Introduction to Parsing Lecture 5. 2 Outline Regular languages revisited Parser overview Context-free grammars (CFG’s) Derivations.
Introduction Syntax: form of a sentence (is it valid) Semantics: meaning of a sentence Valid: the frog writes neatly Invalid: swims quickly mathematics.
DIPLOMA THESIS Peter Černo Clearing Restarting Automata Supervised by RNDr. František Mráz, CSc.
Winter 2007SEG2101 Chapter 71 Chapter 7 Introduction to Languages and Compiler.
1 Chapter 3 Describing Syntax and Semantics. 3.1 Introduction Providing a concise yet understandable description of a programming language is difficult.
Grammars CPSC 5135.
PART I: overview material
LANGUAGE DESCRIPTION: SYNTACTIC STRUCTURE
Lecture # 9 Chap 4: Ambiguous Grammar. 2 Chomsky Hierarchy: Language Classification A grammar G is said to be – Regular if it is right linear where each.
3-1 Chapter 3: Describing Syntax and Semantics Introduction Terminology Formal Methods of Describing Syntax Attribute Grammars – Static Semantics Describing.
May 2006CLINT-LN Parsing1 Computational Linguistics Introduction Parsing with Context Free Grammars.
ProgrammingLanguages Programming Languages Language Syntax This lecture introduces the the lexical structure of programming languages; the context-free.
October 2005csa3180: Parsing Algorithms 11 CSA350: NLP Algorithms Sentence Parsing I The Parsing Problem Parsing as Search Top Down/Bottom Up Parsing Strategies.
TextBook Concepts of Programming Languages, Robert W. Sebesta, (10th edition), Addison-Wesley Publishing Company CSCI18 - Concepts of Programming languages.
Copyright © by Curt Hill Grammar Types The Chomsky Hierarchy BNF and Derivation Trees.
1 Syntax In Text: Chapter 3. 2 Chapter 3: Syntax and Semantics Outline Syntax: Recognizer vs. generator BNF EBNF.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
Bernd Fischer RW713: Compiler and Software Language Engineering.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
CSA2050 Introduction to Computational Linguistics Parsing I.
Basic Parsing Algorithms: Earley Parser and Left Corner Parsing
1 Context Free Grammars October Syntactic Grammaticality Doesn’t depend on Having heard the sentence before The sentence being true –I saw a unicorn.
Compiler Design Introduction 1. 2 Course Outline Introduction to Compiling Lexical Analysis Syntax Analysis –Context Free Grammars –Top-Down Parsing –Bottom-Up.
csa3050: Parsing Algorithms 11 CSA350: NLP Algorithms Parsing Algorithms 1 Top Down Bottom-Up Left Corner.
Supertagging CMSC Natural Language Processing January 31, 2006.
LING 6520: Comparative Topics in Linguistics (from a computational perspective) Martha Palmer Jan 15,
Syntax and Semantics Form and Meaning of Programming Languages Copyright © by Curt Hill.
Handling Unlike Coordinated Phrases in TAG by Mixing Syntactic Category and Grammatical Function Carlos A. Prolo Faculdade de Informática – PUCRS CELSUL,
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Programming Languages and Design Lecture 2 Syntax Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Chapter 4: Syntax analysis Syntax analysis is done by the parser. –Detects whether the program is written following the grammar rules and reports syntax.
Chapter 3 – Describing Syntax CSCE 343. Syntax vs. Semantics Syntax: The form or structure of the expressions, statements, and program units. Semantics:
10/31/00 1 Introduction to Cognitive Science Linguistics Component Topic: Formal Grammars: Generating and Parsing Lecturer: Dr Bodomo.
Modeling Arithmetic, Computation, and Languages Mathematical Structures for Computer Science Chapter 8 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesAlgebraic.
Describing Syntax and Semantics Chapter 3: Describing Syntax and Semantics Lectures # 6.
Chapter 3 – Describing Syntax
An Introduction to the Government and Binding Theory
Programming Languages Translator
Basic Parsing with Context Free Grammars Chapter 13
What does it mean? Notes from Robert Sebesta Programming Languages
Automata and Languages What do these have in common?
Department of Software & Media Technology
Compiler Design 4. Language Grammars
Lexical and Syntax Analysis
TREE ADJOINING GRAMMAR
Chapter 2: A Simple One Pass Compiler
Lecture 7: Introduction to Parsing (Syntax Analysis)
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
COMPILER CONSTRUCTION
Presentation transcript:

Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free grammars have rules for rewriting symbols as strings of other symbols, tree-adjoining grammars have rules for rewriting the nodes of trees as other trees (see tree (graph theory) and tree (data structure) ).

A Tree Adjoining Grammar (TAG) is a 5-tuple G = {Σ,NT, I,A, S} such that Σ is a finite set of terminal symbols. NT is a finite set of non terminal symbols. I is a finite set of finite trees called initial trees. A is a finite set of finite trees called auxiliary trees. S is a distinguished non-terminal symbol. The trees in I ∪ A are called elementary trees. The trees of G are combined using adjunction and substitution.

TAG Elementary trees NP John S NP↓ VP runs VP VP ⋆ ADV quickly Initial trees are elementary trees whose leaves are labelled with non terminal or terminal categories. Leaf nodes labelled with non terminal are substitution nodes marked with ↓ Auxiliary trees are elementary trees whose with a designated foot node. The root and the foot nodes are labelles with the same category.

The rules in a TAG are trees with a special leaf node known as the foot node, which is anchored to a word. There are two types of basic trees in TAG: initial trees (often represented as '') and auxiliary trees (''). Initial trees represent basic valency relations, while auxiliary trees allow for recursion. Auxiliary trees have the root (top) node and foot node labeled with the same symbol. A derivation starts with an initial tree, combining via either substitution or adjunction. Substitution replaces a frontier node with another tree whose top node has the same label. Adjunction inserts an auxiliary tree into the center of another tree. The root/foot label of the auxiliary tree must match the label of the node at which it adjoins. Other variants of TAG allow multi-component trees, trees with multiple foot nodes, and other extensions.

S VP NP NP ↓ VP VP ⋆ ADV John runs quickly S NP VP VP ADV John runs quickly Adjunction inserts an auxiliary tree into a tree (Adjunction is not allowed on substitution nodes) Substitution inserts a derived or elementary tree at the substitution node of a TAG tree.

There are several extensions of tree adjoining grammars being explored, such as lexical tree adjoining grammars (LTAGs). LTAGs consist of sets of initial and auxiliary trees which can correspond to units in human language. A TAG is lexicalized if each elementary tree has at least one leaf with a terminal label. In lexicalized TAG, each elementary tree has at least one leaf with a terminal label. Computationally interesting: if the grammar is finite, the number of analyses for a sentence is finite. Linguistically interesting: each lexical item can be associated with the set of syntactic constructions it occurs in.

Tree Adjoined Grammars fits in between the Context Free and Context Sensitive classes in the Chomsky Hierarchy: they describe Mildly Context-sensitive languages. Where as Context Free grammars have the pleasant property of being parsable in polynomial space and time, they are not adequate for describing for example the natural languages

Vijay-Shanker and Weir (1994) demonstrates that Linear Indexed Grammars, Combinatory Categorial Grammars, Tree-adjoining Grammars, and Head Grammars are weakly equivalent formalisms, in that they all define the same string languages.

Two ways of grammar implementation with TAG 1)XTAG tools (UPenn) XTAG is an on-going project to develop a wide-coverage grammar for English using a lexicalized Tree Adjoining Grammar (TAG) formalism. XTAG also serves as an system for the development of TAGs and consists of a parser, an X-windows grammar development interface and a morphological analyzer. parser, editor, viewer,... 2) XMG + TuLiPA XMG: eXtensible MetaGrammar (Duchier et al, 2004) TuLiPA: T¨ubingen Linguistic Parsing Architecture (Parmentier et al, 2008)

Grammar/ linguistic theory: rules for well-formed structures of natural language Grammar formalism: mathmatically concise description language Tree Adjoining Grammar (TAG) Implementation: (the result of) a process to translate sth. into a specific grammar formalism into a specific input format for a parser

TAG departs from other linguistic framework by its linguistic and computational properties : extended domain of locality, factoring out of recursion, mildly context-sensitive. Semantic construction in TAG is still an open research area. Surface realisation benefits from the susbtitution/adjunction distinction. Tree Adjoining Grammar is a good formalism for describing natural language syntax. Exact type of TAG, and exact type of linguistic theory, are active areas of research

A. K. Joshi. (1985). An Introduction to Tree Adjoining Grammars. Philadelphia, Pennsylvania: Technical Report MS- CIS-86-64, Univ. of Pennsylvania. Claire Gardent, Tree Adjoining Grammars Theory and Practice, Bankok, Philippe de Groote, Tree Adjoining Grammars as Abstract Categorial Grammars, Campus Scientifique, B.P. 239, France. XTAG project: /slides/gimpl intro.pdf duesseldorf.de/~lichte/events/tag/slides/1-intro.pdf