Elicitation Corpus April 12, 2003. Agenda Tagging with feature vectors or feature structures Combinatorics Extensions.

Slides:



Advertisements
Similar presentations
Language and Grammar Unit
Advertisements

Lecture 3a Clause functions Adapted from Mary Laughren.
APA Style Grammar. Verbs  Use active rather than passive voice, select tense and mood carefully  Poor: The survey was conducted in a controlled setting.
ASPECTS OF LINGUISTIC COMPETENCE 5 SEPT 11, 2013 – DAY 7 Brain & Language LING NSCI Harry Howard Tulane University.
Greenberg 1963 Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements.
Projecting Grammatical Features in Nominals: 23 March 2010 Jerry T. Ball Senior Research Psychologist 711 th HPW / RHAC Air Force Research Laboratory DISTRIBUTION.
Grammatical Relations and Lexical Functional Grammar Grammar Formalisms Spring Term 2004.
Statistical NLP: Lecture 3
Chapter 4 Basics of English Grammar
October 8, : Grammars and Lexicons Lori Levin (Examples from Kroeger)
The Eight Parts of Speech
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Word Classes and English Grammar.
Present Day English How we use and label verbs. Principal Parts Verbs in all Germanic languages have few inflections. PDE is a Germanic language. –all.
NLP and Speech 2004 English Grammar
Week 2a. Morphosyntactic features, part II. Ch. 2, 4.2- CAS LX 522 Syntax I.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Outline of English Syntax.
Getting started with Sanskrit grammar. Inflectional form: Root + Affix = Stem Stem + Inflectional ending = Word.
Grammar Nuha Alwadaani.
Phrases and Sentences: Grammar
11 CS 388: Natural Language Processing: Syntactic Parsing Raymond J. Mooney University of Texas at Austin.
Latin Grammar: Singular and Plural Magister Henderson Latin I.
Embedded Clauses in TAG
Grammar Skills Workshop
Chapter 4 Basics of English Grammar Business Communication Copyright 2010 South-Western Cengage Learning.
Introduction to English Syntax Level 1 Course Ron Kuzar Department of English Language and Literature University of Haifa Chapter 2 Sentences: From Lexicon.
Morphology An Introduction to the Structure of Words By Christian Monson.
ASPECTS OF LINGUISTIC COMPETENCE 4 SEPT 09, 2013 – DAY 6 Brain & Language LING NSCI Harry Howard Tulane University.
Daily Grammar Practice
© 2006 SOUTH-WESTERN EDUCATIONAL PUBLISHING 11th Edition Hulbert & Miller Effective English for Colleges Chapter 2 PRONOUNS.
Minor Parts of Speech  Pronouns  Auxiliaries  Conjunctions  Determiners  Qualifiers  Prepositions  Isolates.
Chapter 5 Syntax English Linguistics: An Introduction.
Grammar Fix Part 1. Pronouns What are they? Words that take the place of a noun How many can you think of? There are many, but they fall in to Five main.
Morphology An Introduction to the Structure of Words Lori Levin and Christian Monson Grammars and Lexicons Fall Term, 2004.
Ideas for 100K Word Data Set for Human and Machine Learning Lori Levin Alon Lavie Jaime Carbonell Language Technologies Institute Carnegie Mellon University.
By: Jeremy Pagnotti.  Phonetic language (no silent letters)  No particular word order  Grammatical function of nouns and verbs displayed by endings.
CPE 480 Natural Language Processing Lecture 4: Syntax Adapted from Owen Rambow’s slides for CSc Fall 2006.
Parsing and Translating
LING 388: Language and Computers Sandiway Fong Lecture 21.
Are Verbs important? Why/ why not? TRUE or FALSE? The English verb has only 2 forms. Right answer: It has 3 forms: The Infinitive, the Gerund & the.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
SYNTAX 1 NOV 9, 2015 – DAY 31 Brain & Language LING NSCI Fall 2015.
1 Some English Constructions Transformational Framework October 2, 2012 Lecture 7.
Basic Syntactic Structures of English CSCI-GA.2590 – Lecture 2B Ralph Grishman NYU.
Eliciting a corpus of word- aligned phrases for MT Lori Levin, Alon Lavie, Erik Peterson Language Technologies Institute Carnegie Mellon University.
Unit 1 Language Parts of Speech. Nouns A noun is a word that names a person, place, thing, or idea Common noun - general name Proper noun – specific name.
Chapter 11: Parsing with Unification Grammars Heshaam Faili University of Tehran.
Grammar and Sentence Writing ENG 111 Al-Huqail, Eman.
Inflection. Inflection refers to word formation that does not change category and does not create new lexemes, but rather changes the form of lexemes.
ENGLISH LANGUAGE – 2° YEAR A HISTORY OF THE ENGLISH LANGUAGE Annalisa Federici, Ph.D. Textbook: J. Culpeper, History of English, Routledge (unit.
System and the axis of Choice  Systems are list of choices which are available in the grammar of a language.  It could be a list of things b/w which.
LECT. 11 DR. AMAL ALSAIKHAN Government and Case Theories.
Lecture 1 Sentences Verbs.
Descriptive Grammar – 2S, 2016 Mrs. Belén Berríos Droguett
Ms. Rasha Ali Inflection.
Focus on Fluency Nouns, Adjectives, and pronouns
Eliciting a corpus of word-aligned phrases for MT
The World of Verbs.
Lecture 3: Functional Phrases
Statistical NLP: Lecture 3
Lecture 4b: Verb Processes
Grammar: Issues with Agreement
Getting started with Sanskrit grammar
NOUNS person, place, thing, or idea
Chapter 4 Basics of English Grammar
Conversational Russian I. Day 5.
GRAMMAR قواعد اللغــــــــــة الإنجليزية
Chapter 4 Basics of English Grammar
The 7Cs: A Pedagogical Framework for Grammar Teaching and Learning
ALI139 – Arabic Grammar I Week 2.
Presentation transcript:

Elicitation Corpus April 12, 2003

Agenda Tagging with feature vectors or feature structures Combinatorics Extensions

Annotating the corpus Feature Vectors: –Maria saw the girls. –Snum-s, stype-prop, sanim-an, scount-na, sdef-def, vtype-perc, vtime-past, onum-pl, odef-def, etc. Feature Structures: –((SUBJ ((num sg) (type prop) (anim an) (count na) (def def))) (vtype perc) (vtime past) (OBJ ((etc. –These are easy: they come right out of the parser.

Adapting parser output Do we need to filter out irrelevant features? –E.g., features about “have” and “be” to make the English auxiliary system work. E.g., (AUX- TYPE) = have

Not covered by the parser Derived features: –does the subject outrank the object in animacy? Constructional features: –Counterfactual conditional: If I had gone, I would have seen him. –Do we want to extend the parsing grammar to label these automatically? Discourse/semantic/context features: –Context: Who saw John? Elicitation sentence: Bill saw John. Feature: subject is new information. –Elicitation sentence: He must see it. Feature: evidential or deontic (obligation) Features that aren’t used in English –Context: we=you and me (inclusive ‘we’) –Elicitation sentence: We are tall.

Example of Combinatorics: subject verb agreement five numbers (singular, plural, dual, trial, paucal) three genders (masculine, feminine, and neuter, and more for Bantu languages) four persons (first, second, third, and fourth), several levels of animacy (animate, inanimate, first and second person, third person) two levels of definiteness (definite and indefinite) huge number of tenses and aspects (present, past, future, non-past, non-future, near past, remote past, near future, remote future, continuous, perfective, etc.). Two steps? (1) Which features are involved? (2) Which values are involved?

Example of combinatorics: determiners and possessive pronouns See handout.

Current Coverage of the Elicitation Corpus Basic word order: intransitive verb and subject; transitive verb with subject and object; noun phrase with determiners, adjectives, an possessors. Definiteness and animacy: special treatment of indefinite subjects, inanimate subjects, definite direct objects, animate direct objects, and sentences where the object outranks the subject in definiteness or animacy. Agreeement (in number, gender, person, etc.): subject and verb; object and verb; determiner and noun; adjective and noun; possessor and noun; relative pronoun and noun. Possessive NPs: with inalienable possession (body parts); kinship terms; alienable possession; pronominal possessors; full NP possessors. Inflectional Features: gender, number, person, case, tense.

Not covered by the elicitation corpus Subcategorization frames for major verb classes: stative, change of state, change of location, change of possession, creation, filling and covering, experience, cognition, perception, saying and telling, causatives, etc. Voice: active, passive, and oblique voices. Negation: sentences and noun phrases Relative clauses: inflectional features of the relative pronoun; possible locations of the gap; headed or unheaded, etc. Embedded clauses: argument clauses; adjunct clauses; nominalized clauses.

Not Covered Coordination: sentences (switch reference and same subject), noun phrases, and other constituents. Questions: Yes-no questions (positive answer expected and negative answer expected); Open questions (possible locations of gaps). Other constructions: comparatives, conditionals, causatives, desideratives, imperatives, possessor ascension, quantifier float, noun incorporation (polysynthesis). –Each of these has a few parameters to check: e.g., does the causee come out in dative or accusative case; can the incorporated noun take an unincorporated modifier; which NPs can possessors ascend from/quantifiers float from, etc. Further coverage of tense, aspect, and modality: present, past, and future time; ongoing and completed actions; punctual and non-punctual activities; habituality; iteration; realized and non-realized. –Cross product of these with lexical aspect: state, activity, accomplishment, punctual.

Not Covered Information structure: treatment of topic (given information) and focus (new information), including clefted and topicalized sentences. Other meanings that are typically grammaticalized: yet, still, only, distributive (each), etc. Other noun phrase phenomena: quantification, deictic determiners, classifiers, etc.