Constituent Structure

Slides:



Advertisements
Similar presentations
Chapter 4 Syntax Part IV.
Advertisements

The Structure of Sentences Asian 401
Lecture #9 Syntax, Pragmatics, and Semantics © 2014 MARY RIGGS 1.
Chapter 4 Syntax.
Sub-constituents of NP in English September 12, 2007.
1 Introduction to Linguistics II Ling 2-121C, group b Lecture 4 Eleni Miltsakaki AUTH Spring 2006.
Statistical NLP: Lecture 3
Syntax (1) Dr. Ansa Hameed.
SYNTAX Introduction to Linguistics. BASIC IDEAS What is a sentence? A string of random words? If it is a sentence, does it have to be meaningful?
MORPHOLOGY - morphemes are the building blocks that make up words.
Morphology Chapter 7 Prepared by Alaa Al Mohammadi.
1 Words and the Lexicon September 10th 2009 Lecture #3.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Matakuliah: G0922/Introduction to Linguistics Tahun: 2008 Session 10 Syntax 1.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Outline of English Syntax.
Lect. 11Phrase structure rules Learning objectives: To define phrase structure rules To learn the forms of phrase structure rules To compose new sentences.
Today  What is syntax?  Grammaticality  Ambiguity  Phrase structure Readings: 6.1 – 6.2.
Constituency Tests Phrase Structure Rules
THE PARTS OF SYNTAX Don’t worry, it’s just a phrase ELL113 Week 4.
Syntax The number of words in a language is finite
How are sentences are constructed?. The boys laughed. MorphemesWords Thethe Boyboys -s laughlaughed -ed.
Constituents  Sentence has internal structure  The structures are represented in our mind  Words in a sentence are grouped into units, and these units.
Meeting 3 Syntax Constituency, Trees, and Rules
Chapter 4 Syntax Part II.
Lecture Four Syntax.
Introduction to Linguistics
ASPECTS OF LINGUISTIC COMPETENCE 4 SEPT 09, 2013 – DAY 6 Brain & Language LING NSCI Harry Howard Tulane University.
Syntax.
Dr. Monira Al-Mohizea MORPHOLOGY & SYNTAX WEEK 12.
Dr. Monira Al-Mohizea MORPHOLOGY & SYNTAX WEEK 11.
Natural Language Processing Lecture 6 : Revision.
SYNTAX Lecture -1 SMRITI SINGH.
CS : Language Technology for the Web/Natural Language Processing Pushpak Bhattacharyya CSE Dept., IIT Bombay Constituent Parsing and Algorithms (with.
NLP. Introduction to NLP Is language more than just a “bag of words”? Grammatical rules apply to categories and groups of words, not individual words.
BY HELEN LORENA SOLANO ALEXANDER ARANDA. is a group of words without both a subject and predicate. Phrases combine words into a larger unit that can function.
GrammaticalHierarchy in Information Flow Translation Grammatical Hierarchy in Information Flow Translation CAO Zhixi School of Foreign Studies, Lingnan.
Notes on Pinker ch.7 Grammar, parsing, meaning. What is a grammar? A grammar is a code or function that is a database specifying what kind of sounds correspond.
Linguistic Essentials
Parsing with Context-Free Grammars for ASR Julia Hirschberg CS 4706 Slides with contributions from Owen Rambow, Kathy McKeown, Dan Jurafsky and James Martin.
CPE 480 Natural Language Processing Lecture 4: Syntax Adapted from Owen Rambow’s slides for CSc Fall 2006.
Rules, Movement, Ambiguity
Making it stick together…
WORDS The term word is much more difficult to define in a technical sense, and like many other linguistic terms, there are often arguments about what exactly.
Syntax II “I really do not know that anything has ever been more exciting than diagramming sentences.” --Gertrude Stein.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
SYNTAX.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
SYNTAX 1 NOV 9, 2015 – DAY 31 Brain & Language LING NSCI Fall 2015.
TYPES OF PHRASES REPRESENTING THE INTERNAL STRUCTURE OF PHRASES 12/5/2016.
Basic Syntactic Structures of English CSCI-GA.2590 – Lecture 2B Ralph Grishman NYU.
X-Bar Theory. The part of the grammar regulating the structure of phrases has come to be known as X'-theory (X’-bar theory'). X-bar theory brings out.
Language Structure Lecture 1: Introduction & Overview Helena Frännhag Spring 2013.
Week 3. Clauses and Trees English Syntax. Trees and constituency A sentence has a hierarchical structure Constituents can have constituents of their own.
College of Science and Humanity Studies, Al-Kharj.
SYNTAX.
Structure, Constituency & Movement
An Introduction to the Government and Binding Theory
Statistical NLP: Lecture 3
BBI 3212 ENGLISH SYNTAX AND MORPHOLOGY
Syntax Word order, constituency
SYNTAX.
Part I: Basics and Constituency
Structural relations Carnie 2013, chapter 4 Kofi K. Saah.
BBI 3212 ENGLISH SYNTAX AND MORPHOLOGY
Introduction to Linguistics
Natural Language Processing
סמינר בבלשנות חישובית מבנה מרכיבי המשפט הפקולטה למדעי המחשב - הטכניון
Língua Inglesa - Aspectos Morfossintáticos
Structure of a Lexicon Debasri Chakrabarti 13-May-19.
Presentation transcript:

Constituent Structure

Syntactic Categories Parts of speech: Noun, Verb, Adjective, Adverb, etc. Evidence for syntactic categories: child language. The Wug Test (Jean Berko Gleason, 1958) was designed to understand children’s understanding of inflection.

Tongue slips of adult native speakers (Spoonerisms) “Sir, you’ve hissed my mystery class”. Intended: “Sir you’ve missed my history class.”

Ambiguity (Ambiguous sentences) Lexical ambiguity John drove his car to the bank. The hunter went home with five bucks in his pocket.

The [tall bishop]’s hat (The bishop is tall) The tall [bishop’s hat] Structural ambiguity This type of ambiguity is caused by grouping words together in different ways. The [tall bishop]’s hat (The bishop is tall) The tall [bishop’s hat] (The hat is tall)

We can assign different grammatical structures to the same string of words. This is evidence showing that words form sub-groups (or CONSTITUENTS) within a phrase or sentence. These groupings are often crucial in determining the meaning of a sentence.

Words belonging to different syntactic categories Mistrust wounds. “Suspicion hurts people.” “We should mistrust injuries.” Can you interpret the following sentence? Time flies.

Amusing newpaper headlines Which word causes the ambiguity? Reagan Wins On Budget, But More Lies Ahead Squad Helps Dog Bite Victim

How do we identify a constituent? She is crying. The little girl wearing a red hat with a blue ribbon is crying. (1) Strings of words replacing a single word must be units (constituents.)

(ii) İkan besar itu saya makan fish big that I eat Malay (i) Saya makan ikan besar itu I eat fish big that ‘I ate/am eating that big fish.’ (ii) İkan besar itu saya makan fish big that I eat ‘That big fish I ate/am eating.’ (2) When a group of words can be moved as a unit, we can assume that the group froms a syntactic unit.

Orang tua itu makan ikan besar itu person old that eat fish big that Malay Orang tua itu makan ikan besar itu person old that eat fish big that ‘That old person ate the big fish.’ ikan besar itu di-makan oleh anjing saya fish big that PASS-eat by dog my ‘That big fish was eaten by my dog.’ (3) The same string of words can occur in a variety of positions within the sentence, e.g. as subject and object.

Orang tua itu makan ikan besar itu person old that eat fish big that Malay Orang tua itu makan ikan besar itu person old that eat fish big that ‘That old person ate the big fish.’ Siapa makan ikan besar itu Who ate fish big that ‘Who ate that big fish?’ (4) When a group of words are replaced by a question word to form a content question, we can assume the group of words forms a unit.

Siapa makan ikan besar itu Who ate fish big that ‘Who ate that big fish?’ Answer1: Orang tua itu person old that ‘that old person Answer2: *tua itu ‘old that’ old that (5) Constituents can form the answer to a content question, whereas a string of words which is not a syntactic unit is not a possible answer.

Hierarchy Each constituent of a larger unit may itself be composed of smaller constituents. The CLAUSE is the smallest grammatical unit which can express a complete proposition.

A sentence may consist of several clauses. Can you identify the clauses in the following lines? Foxes have holes and birds of the air have nests, But the Son of Man has no place to lay his head.

PHRASE A single clause may contain several phrases. The coach’s wife introduced her little sister to the captain of the football team. [to [the captain [of [the [football team]]]]].

The captain of the football team To the captain of the football team

A single word may contain several morphemes. Dis-taste-ful Read-abil-ity Dis-en-tangle

This is important in morphology, syntax, and phonology. This kind of structural organization is called a PART-WHOLE HIERARCHY: Each unit is entirely composed of smaller units belonging to a limited set of types. This is important in morphology, syntax, and phonology.

Identifying syntactic categories Traditional definitions of parts of speech are based on semantic properties. A NOUN is a word than names a person, place, or thing. A VERB is a word that names an action or event. An ADJECTIVE is a word that describes a state.

They cannot distinguish the noun fool from the adjective foolish. Traditional definitions fail to identify nouns like happiness, love, destruction, etc. They cannot distinguish between the noun love and the adjective fond of. They cannot distinguish the noun fool from the adjective foolish.

In Jabberwocky, we were able to distinguish most parts of speech even though they were mostly nonsense words. Also, children are able to form the plurals of nonsense words or words they’ve never heard before.

We need to address the following problems separately: The identification of syntactic categories cannot be based on semantic factors. We need to address the following problems separately: Which words belong together in the same class? What name (or label) should we assign to a given class?

Answering Question 1 Words that share a number of grammatical characteristics are assumed to belong to the same class. Words that have distinct grammatical characteristics are assumed to belont to differen classes.

Identifying grammatical characteristics Fool vs foolish Modification by degree adverb vs adjective They are utter fools. *They are utter foolish. They are fools. They are very foolish. Inflection for number Fool, fools Foolish, *foolishes Comparative forms Fool-*fooler/*more fool Foolish-more foolish As subject of a clause Fools rush in where angels fear to tread. *Fools rush in where angels fear to tread.

Answering Question 2 Once the word classes in a particular language have been identified in this way, they can be assigned a label (Noun, Verb, etc) based on universal notional patterns. If there is a class whose prototypical members include most of the basic terms for concrete objects (dog, book,house), we would label that class NOUN.

If there is a class whose prototypical members include most of the basic terms for volitional actions (run, dance, eat), we would label that class VERB. The grammatical criteria used to determine word classes are diagnostic features rather than definitions. E.g. In English, not all adjectives can take the comparative and superlative suffixes.

Almost all languages have the lexical categories Noun and Verb, but there is a significant range of difference among languages.

PHRASES and PHRASAL CATEGORIES A phrase must be a group of words which form a constituent. A phrase is lower in the hierarchy than clauses. 1. Which phrases belong together in the same class? 2. What name (or label) should we assign to a given class?

Answering Question 1 Internal structure of phrases e.g. An English noun phrase often begins with a DETERMINER (a, the, that, this) Mutual substitutability: two phrases of the same category could potentially occur in the same positions. e.g. Phrases occuring in Object and Subject positions are NOUN PHRASES.

In most phrases, there is a core word, called the HEAD of the phrase. Answering Question 2 In most phrases, there is a core word, called the HEAD of the phrase. We name a phrase by the category of the head. e.g. That big fish is a NOUN PHRASE because its head is a noun (fish). e.g. very beautiful is an ADJECTIVE PHRASE because its head is an adjective (beautiful).

How do we know which word in the phrase is the head? How do we distnguish the head from the DEPENDENTS (i.e all the other elements in the phrase)?

The head is important because it determines the grammatical features of the phrase as a whole. it may determine the number and type of other elements in the phrase. it is more likely to be obligatory than the modifiers or other non-head elements in the phrase.

The head determines the grammatical features of the phrase as a whole The new rice is in the barn. The new kittens are in the barn.

Prepositional phrases are complements of the adjective phrase The head may determine the number and type of other elements in the phrase. Prepositional phrases are complements of the adjective phrase I am [very grateful to you] John felt [sorry for his actions.] angry at someone, proud of someone, worried about something Objects are complements of the verb phrase Mary is [reading a book]. James [showed his photo album to us]. Mary [runs] every morning.

The head is often obligatory in a phrase. [The little girl wearing a red hat with a blue ribbon] was crying her eyes out. [The little girl] was crying her eyes out. [The girl] was crying her eyes out.

The head may be omitted in certain contexts The third little girl was smarter than the second ___. The good, the bad, and the ugly The rich get richer and the poor get childen.

Major categories (can function as heads of phrases) Noun, verb, adjective, adverb, preposition Minor categories Conjunctions, interjections, determiners (includes articles, demonostratives, and quantifiers)

Tree diagrams representing the constituents of a clause In analyzing grammatical structure, we need to identify The constituent parts which the sentence is formed. The order in which these constitutents occur. The vertical lines inserted between the constitutents are helpful to describe grammatical structure.

Tree diagrams A Mother node C Daughter nodes B

A mother IMMEDIATELY DOMINATES its own daughters. A DOMINATES all of its daughter nodes; i.e. The daughters of daughters, daughters of its grand-daughters, etc. A mother IMMEDIATELY DOMINATES its own daughters. A CONSTITUENT is a string of words which is exhaustively dominated by some node.

PP P NP Det N on the beach

N Noun A Adjective V Verb P Preposition Adv Adverb Det Determiner Conj Conjunction NP Noun Phras AP ADjective Phrase VP Verb Phrase PP Prepositional Phrase S Sentence or Clause

The top-most node in any tree diagram is called the ROOT NODE. The terminal nodes at the bottom are sometimes called LEAVES. The No Crossing Constraint: lines from mother to daughter must not cross. The Single Mother Constraint: each node after the root node must be the daughter of exactly one other node.

The motivation for imposing these constraints is that by allowing crossing lines or multiple parenthood, we would end up with potentially complex structures which are never found in real human languages.

Phrase Structure Rules The task of the linguist is to find out the rules which allow the speakers of a language to construct and comprehend novel sentences. The rules needed to produce Phrase Structure Trees are known as Phrase Structure Rules and have the following form: A B C

A B C This rule says that a node labelled A may immediately dominate two daughters labelled B and C in that order. This is a CONTEXT FREE rule, i.e. there is no conditioning environment stated in this rule.

Each node of a Phrase Structure tree must be permitted (or LICENSED) by a phrase structure rule in order to be legal. To license (or, to generate) the prepositional phrase “on the beach” (slide 44), we would need these rules:

PP P NP NP Det N We also need rules to insert the terminal elements (lexical elements), i.e. to hang leaves on the tree. P {on, in, at, under, over ...} N {beach, house, boy, girl, cat ...} Det {the, a, an, this, that, ...}

The LEXICON (the speaker’s mental dictionary) The lexicon includes much more than a simple list of words. The lexical entry for each word must include phonological, semantic, morphological, and syntactic information. Instead of having lexical rules like the ones in the previous slide, we can simply assume that there is a general rule of LEXICAL INSERTION which will licence a word of any given category to appear as the only daughter of a node which bears the corresponding category label.

Lexical Insertion Rule Any lexical category (N, V, etc) may have a sinlge daughter node which is a specific lexical item of the same category.

Notational devices to combine two or more Phrase Structure Rules a) A B (C) b) A B c) A B C a) X Y Z b) X Y c) X Z

Pronouns and proper names In traditional grammar, pronouns and proper names are not considered as “phrases” in the sense we use them in linguistics. I collapsed. (pronoun) John collapsed. (proper name) The old school collapsed. (noun phrase)

pronoun S proper name V noun phrase The subject of a clause may be expressed as a pronoun, a proper name, or a common noun phrase. pronoun S proper name V noun phrase

The object of a preposition can be a pronoun, a proper name, or a common noun phrase. behind me behind John behind the old school house pronoun PP P proper name noun phrase

Notice that the material inside the braces in PS rules in slides 56 and 57 are exactly the same. The same set of alternatives may show up in other PS rules as well, i.e., in almost every position where a name can occur, we can substitute a pronoun or a common noun phrase.

If we had to list all of these alternatives in every rule that mentions one of these positions, there would be a large amount of redundancy in the rules. We would be missing an important generalization. In order to avoid this massive redundancy, we will use the term NP to refer to any unit which can appear in a name-like position in the phrase structure.

Two New Phrase Structure Rules S NP V PP P NP Traditional grammars state that a pronoun “takes the place of a noun”, but in fact pronouns replace whole NPs.

Pronouns are never modified by adjectives (but common nouns are) The quick red [fox] jumped over the lazy brown dog. *The quick red [she] jumped over the lazy brown dog. She jumped over him.

Proper nouns are not modified by determiners or adjectives either. Some unusal cases exist: You are the first Emily I’ve ever met. We will assume that pronouns and proper names are lexical items whose lexical entry specifies that they belong to category NP, rather than N. They may appear in tree diagrams as immediate daughters of an NP node.

This is the end of the lecture on constituency This is the end of the lecture on constituency. You can now do the exercises in Kroeger, pp. 47-50.