Markéta Lopatková Institute of Formal and Applied Linguistics, MFF UK PDT – Tectogrammatical Layer Introduction and T-lemma.

Slides:



Advertisements
Similar presentations
1 Lennart Lönngren University of Tromsø LOVE. 2 Let us start with a sentence in the active voice and its passive counterpart.
Advertisements

Lecture #9 Syntax, Pragmatics, and Semantics © 2014 MARY RIGGS 1.
CPSC 422, Lecture 16Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 16 Feb, 11, 2015.
Lexical Functional Grammar : Grammar Formalisms Spring Term 2004.
Annotation of Grammatemes in the Prague Dependency Treebank 2.0 Magda Razímová Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University.
Functional Generative Description (FGD) Markéta Lopatková Institute of Formal and Applied Linguistics, MFF UK
English Grammar Parts of Speech Parts of Speech Eight Parts of Speech Nouns Adjectives Adverbs Conjunctions Prepositions Interjections Verbs pronouns.
Used in place of a noun pronoun.
PRONOUNS LESSON 1. WHAT IS A PRONOUN? Pronouns take the place of nouns to name persons, places, things, or ideas.
1 Words and the Lexicon September 10th 2009 Lecture #3.
April 26, 2007Workshop on Treebanking, NAACL-HTL 2007 Rochester1 Treebanks: Layering the Annotation Jan Hajič Institute of Formal and Applied Linguistics.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
PARTS OF SPEECH 1 The principles of the traditional classification of the English vocabulary 2 Notional and functional parts of speech. 3 The field structure.
Learning the parts of speech is a focus of Language Arts instruction and is vital stage in writing development. The parts of speech make up sentences.
Parts of Speech (Lexical Categories). Parts of Speech Nouns, Verbs, Adjectives, Prepositions, Adverbs (etc.) The building blocks of sentences The [ N.
Building the Valency Lexicon of Arabic Verbs Viktor Bielický Otakar Smrž LREC 2008, Marrakech, Morocco.
INSTRUCTOR: TSUEIFEN CHEN TERM:   Participial phrase: what is it and what does it do?  Participle forms: 1. General form –ing participial phrases.
PDT 2.0 Prague Dependency Treebank 2.0 Zdeněk Žabokrtský Dept. of Formal and Applied Linguistics Charles University, Prague.
Chapter 4 Syntax Part II.
PDT Grammatemes and Coreference in the PDT 2.0 Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University in Prague.
Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in Tree-adjoining grammars are somewhat similar to context-free.
Morphological Meanings in the Prague Dependency Treebank Magda Razímová Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University,
English Review for Final These are the chapters to review. In Textbook: Chapter 1 Nouns Chapter 2 Pronouns Chapter 3 Adjectives Chapter 4 Verbs Chapter.
English Review for Final These are the chapters to review. In Textbook: Chapter 1 Nouns Chapter 2 Pronouns Chapter 3 Adjectives Chapter 4 Verbs Chapter.
Systematic Parameterized Description of Pro-forms in the Prague Dependency Treebank 2.0 Magda Ševčíková Zdeněk Žabokrtský Institute of Formal and Applied.
Metalanguage Revision English language year
Parts of Speech (Lexical Categories). Parts of Speech n Nouns, Verbs, Adjectives, Prepositions, Adverbs (etc.) n The building blocks of sentences n The.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
Resemblances between Meaning-Text Theory and Functional Generative Description Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University,
Linguistics The ninth week. Chapter 3 Morphology  3.1 Introduction  3.2 Morphemes.
Parts of Speech Major source: Wikipedia. Adjectives An adjective is a word that modifies a noun or a pronoun, usually by describing it or making its meaning.
LANGUAGE ARTS LA WORKS UNIT 3 REVIEW STUDY GUIDE.
C HAPTER 11 Grammar Fundamentals. T HE P ARTS OF S PEECH AND T HEIR F UNCTIONS Nouns name people, places things, qualities, or conditions Subject of a.
Proper Nouns in Czech Corpora Magda Ševčíková Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics.
PDT Grammatemes in the PDT 2.0 Zdeněk Žabokrtský Dept. of Formal and Applied Linguistics Charles University, Prague
English Review for Final These are the chapters to review. In Textbook: Chapter 9 Nouns Chapter 10 Pronouns Chapter 11 Adjectives Chapter 12 Verbs Chapter.
Verbals. What are Verbals?  A verbal is a word that is based on a verb and expresses action or a state of being, but is acting as a different part of.
WORDS The term word is much more difficult to define in a technical sense, and like many other linguistic terms, there are often arguments about what exactly.
nd PIRE project workshop1 Tectogrammatical Representation of English Silvie Cinková Lucie Mladová, Anja Nedoluzhko, Jiří Semecký, Jana Šindlerová,
Parts of Speech Review. A Noun is a person, place, thing, or idea.
GoBack definitions Level 1 Parts of Speech GoBack is a memorization game; the teacher asks students definitions, and when someone misses one, you go back.
Annotation Procedure in Building the Prague Czech-English Dependency Treebank Marie Mikulová and Jan Štěpánek Institute of Formal and Applied Linguistics.
Syntactic Annotation of Slovene Corpora (SDT, JOS) Nina Ledinek ISJ ZRC SAZU
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Part of SpeechPunctuationVerbal Prepositional.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
English Grammar PARTS OF SPEECH.
March 5, 2008Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics 1 PDT: Tectogrammatical Representation Jan Hajič Institute.
MORPHOLOGY. PART 1: INTRODUCTION Parts of speech 1. What is a part of speech?part of speech 1. Traditional grammar classifies words based on eight parts.
Coreference: Current and outlook Silvie Cinková (CU) Companions Semantic Representation and Dialog Interfacing Workshop Edinburgh, March 5, 2008.
Prague Czech-English Dependency Treebank 2.0 ufal.mff.cuni.cz/pcedt2.0 Silvie Cinková, Marie Mikulová, Jan Štěpánek & professors, annotators and programmers.
Parts of Speech Review.
Learning to Generate Complex Morphology for Machine Translation Einat Minkov †, Kristina Toutanova* and Hisami Suzuki* *Microsoft Research † Carnegie Mellon.
SYNTAX.
Syntax Parts of Speech and Parts of the Sentence.
Netgraph – a Tool for Searching in the Prague Dependency Treebank 2.0 Defence of the Doctoral Thesis, Prague, September 3 rd, 2008 Author: Mgr. Jiří Mírovský.
The theory of word classes in modern grammar studies
Parts of Speech Review.
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Revision Outcome 1, Unit 1 The Nature and Functions of Language
Word Classes and Affixes
Translation Problems.
By: Mrs. Smith St. Mary’s Middle School English
Prague Dependency Treebank 2. 0 Zdeněk Žabokrtský Dept
Jeopardy – MS. South English
Parts of Speech Review.
The development of PDT 3.0 Introduction to the discussion
Ms. McDaniel 6th Grade Language Arts
Parts of Speech Review.
Parts of Speech.
Presentation transcript:

Markéta Lopatková Institute of Formal and Applied Linguistics, MFF UK PDT – Tectogrammatical Layer Introduction and T-lemma

PDT: t-layer: intro Lopatková PDT: t-layer Intro Relation between t-layer and a-layer T-lemma documentation:

PDT: t-layer: intro Lopatková PDT: t-layer Goal: to describe deep, semantic structure of a sentence ~ a sentence meaning disambiguated dependency 'tree' information on t-lemma lexical items: t-lemma (primarily) relations between lexical words ~ deep structure: functors + subfunctors grammatemes grammatemes coreferential links topic focus articulation: linear order + tfa attribute (cz: aktuální členění) documentation:

PDT: t-layer: intro Lopatková PDT: relation between t-layer and a-layer each t-node … PML reference to a-layer atree.rf technical t-root … atree.rf  id of a root of correspond. anal. tree a non-root t-node… attribute a consisting of 2 attributes: lex.rf lex.rf  id of a-node from which the t-node got its lexical meaning aux.rf aux.rf  list of ids of all other a-nodes related to the t-node

PDT: t-layer: intro Lopatková PDT: relation between t-layer and a-layer each t-node … PML reference to a-layer atree.rf technical t-root … atree.rf  id of a root of correspond. anal. tree a non-root t-node… attribute a consisting of 2 attributes: lex.rf lex.rf  id of a-node from which the t-node got its lexical meaning aux.rf aux.rf  list of ids of all other a-nodes related to the t-node t-node with no analytical counterpart: lex.rf and aux.rf empty Dovolil mu odejít. {#Cor.ACT} copied nodes: lexical items with several occurrences at the t-layer but expressed only once in a surface sentence (e.g., červené a bílé víno = červené víno a bílé víno ) all attributes a/lex.rf, a/aux.rf … id(s) of the corresponding a-node(s)

PDT: t-lemma two types of nodes wrt t-lemma individual lexical units (present at surface or ‘restored’) prototypically t-lemma = m-lemma (suffixes are ignored) BUTlexical and syntactic derivation multi-word expressions frozen verbal forms (e.g. myslím, soudě) foreign-language expressions PDT: t-layer: intro Lopatková t-lemma = m-form

PDT: t-lemma two types of nodes wrt t-lemma individual lexical units (present at surface or ‘restored’) prototypically t-lemma = m-lemma (suffixes are ignored) BUTlexical and syntactic derivation multi-word expressions frozen verbal forms (e.g. myslím, soudě) foreign-language expressions # t-lemma substitutes … starting with # personal and possessive pronouns: #PersPron newly established words (not copied) #Gen, #Rcp, #Cor … (diff. type of ellipses) #Forn, #Idph, … negation: #Neg punctuation: #Comma, #Dash, #Slash, #Bracket, … PDT: t-layer: intro Lopatková t-lemma = m-form

PDT: t-layer: intro Lopatková Syntactic and lexical derivation "traditional" part of speech classification (PoS) morphological tag 10 basic classes syntactic part of speech classification syntactic nouns, adjectives, adverbs, verbs e.g., Šmilauer "skladebné podstatné jméno" semantic part of speech classification syntactic vs. lexical derivation (Kuryłowicz) sempos attribute sempos nounsadjectivespronounsnumeralsadverbsverbsprepositionsconjunctionsparticlesInter- jections semantic nounssemantic adjectivessemantic adverbssemantic verbs

PDT: t-layer: intro Lopatková Syntactic and lexical derivation Syntactic and lexical derivation (cont.) syntactic derivation new syntactic function (change of PoS) the same semantics e.g. přicházet → přicházení; to arrive → arriving (not in PDT) přicházení → příchod; to arrive → arrival (not in PDT) pěkně [nicely] → pěkný [nice] lexical derivation new syntactic function (change of PoS) change in semantics e.g. učit → učitel; to teach → teacher učit → učebna [classroom] FGD theory: derived words represented by the t-lemma of the original word

PDT: t-layer: intro Lopatková PDT: t-lemma for derived words personal and possessive pronouns e.g. já, mi, tobě, sebe, je [I, me, you, myself, them] → #PersPron tvé, jejich, svoje [your, their, refl] → #PersPron possessive adjectives e.g. matčin [mother's] → matka [mother] Pavlova [Pavel's] → Pavel deadjectival adverbs e.g. pěkně [nicely] → pěkný [nice] directional adverbs (→ locative) e.g. tudy [this way] → tady [here]; kudy [which way] → kde [where] temporal adverbs e.g. doteď [until now] → teď [now]; dokdy [till when] → kdy [when] short forms of adjectives e.g. zklamán [disappointed] → zklamaný NOT for passive participles: pozván [invited] → pozvat [to invite] syntactic derivation: m-lemma  t-lemma + functor

PDT: t-layer: intro Lopatková PDT: t-lemma for derived words numerals ordinal, sort/kind, set and fraction numerals derived from the cardinal numerals t-lemma of the cardinal number grammateme numertype ord (cz řadové) … e.g. třetí [the_third] → tři [three] kind (cz druhové) … trojí [three_kinds_of] → tři set (cz souborové) … troje [three_sets/pairs/…_of] → tři frac (cz dílové) … třetina [(one) third] → tři lexical derivation: m-lemma  t-lemma + numertype

PDT: t-layer: intro Lopatková PDT: t-lemma for derived words pronouns pronouns, pronominal numerals and pronominal adverbs relative, indefinite, interrogative, negative and totalizing derived from the correspond. interrogative or relative pronoun / numeral / adverb t-lemma grammateme indeftype e.g. někdo [somebody] → kdo [who] nikdo [nobody] → kdo kdokoliv [anybody] → kdo nic [nothing] → co [what] několik [several] → kolik [how many] všechen [all] → co [what] žádný [no] → který [which] lexical derivation: m-lemma  t-lemma + indeftype

PDT: t-layer: intro Lopatková PDT: t-lemma for multi-word expressions reflexiva tantum e.g. smát se [to laugh Refl] → smát_se setkat se [to meet] → setkat_se complex conjunctions and conjunction pairs, operators e.g. buď … nebo [either … or] → buď_nebo od … přes … do [from … via … to] → od_přes_do a nebo [or] → a_přes_do numeral expressions e.g → 278_11 41 letý [forty-one_years_old] → 41_letý idioms e.g. nohy na ramena [legs on shoulders ] → nohy_na_ramena etc.

PDT: t-layer: intro Lopatková PDT: t-lemma for multi-word expressions reflexiva tantum e.g. smát se [to laugh Refl] → smát_se setkat se [to meet] → setkat_se complex conjunctions and conjunction pairs, operators e.g. buď … nebo [either … or] → buď_nebo od … přes … do [from … via … to] → od_přes_do a nebo [or] → a_přes_do numeral expressions e.g → 278_11 41 letý [forty-one_years_old] → 41_letý idioms e.g. nohy na ramena [legs on shoulders ] → nohy_na_ramena etc. grammatemes e.g. chtít přijít [to want to come] → přijít [to come] + volitive (deontic modality) special functors e.g. CPHR: mít dojem [to have the impression]

PDT: t-layer: intro. Lopatková PDT: node types nodetype eight node types … attribute nodetype defined on the basis of a t-lemma and/or a functor

References Manual for Tectogrammatical Annotation Kuryłowicz, J. (1936). Dérivation lexicale et dérivation syntaxique. Bulletin de la Société de liguistique de Paris, 37, s. 79–92. Český překlad in: Principy strukturní syntaxe I. Praha, Univerzita Karlova, s. 87–94. PDT: t-layer: intro Lopatková