Building the Valency Lexicon of Arabic Verbs Viktor Bielický Otakar Smrž LREC 2008, Marrakech, Morocco.

Slides:



Advertisements
Similar presentations
CODE/ CODE SWITCHING.
Advertisements

18 and 24-month-olds use syntactic knowledge of functional categories for determining meaning and reference Yarden Kedar Marianella Casasola Barbara Lust.
Morphology.
Cognitive Linguistics Croft & Cruse 9
June 6, 20073rd PIRE Meeting1 Tectogrammatical Representation of English in Prague Czech-English Dependency Treebank Lucie Mladová Silvie Cinková, Kristýna.
Statistical NLP: Lecture 3
Prague Arabic Dependency Treebank Center for Computational Linguistics Institute of Formal and Applied Linguistics Charles University in Prague MorphoTrees.
Autosegmental Phonology
1 Words and the Lexicon September 10th 2009 Lecture #3.
Language is very difficult to put into words. -- Voltaire What do we mean by “language”? A system used to convey meaning made up of arbitrary elements.
April 26, 2007Workshop on Treebanking, NAACL-HTL 2007 Rochester1 Treebanks: Layering the Annotation Jan Hajič Institute of Formal and Applied Linguistics.
Linguisitics Levels of description. Speech and language Language as communication Speech vs. text –Speech primary –Text is derived –Text is not “written.
Sound and Speech. The vocal tract Figures from Graddol et al.
Corpus 3 Corpus-based Description. Aspects of corpus-based studies lexis, morphology, syntax and discourse. fig. 3.1 A classification of corpus-based.
PARTS OF SPEECH 1 The principles of the traditional classification of the English vocabulary 2 Notional and functional parts of speech. 3 The field structure.
Type shifting and coercion Henriëtte de Swart November 2010.
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
PDT 2.0 Prague Dependency Treebank 2.0 Zdeněk Žabokrtský Dept. of Formal and Applied Linguistics Charles University, Prague.
Prague Dependency Treebank(s) Workshop at LSA2011, Part II Jan Hajič, Zdeňka Urešová Institute of Formal and Applied Linguistics School of Computer Science.
A Summary of Terminology in Linguistics. First Session Orientation to the Course Introduction to Language & Linguistics 1. Definition of Language 2. The.
LREC 2008 AWN 1 Arabic WordNet: Semi-automatic Extensions using Bayesian Inference H. Rodríguez 1, D. Farwell 1, J. Farreres 1, M. Bertran 1, M. Alkhalifa.
Work Group 2: Ontological Concepts for Lexical Entries.
Morphological Meanings in the Prague Dependency Treebank Magda Razímová Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University,
The Current State of FrameNet CLFNG June 26, 2006 Fillmore.
Application of INTEX in refinement and validation of Serbian WordNet Ivan Obradović, Ranka Stanković Cvetana Krstev, Gordana Pavlović-Lažetić University.
Chapter 3 Lexical & Grammatical Morphology Morphology Lane 333.
Jan Hajič Otakar Smrž Petr Zemánek Jan Šnaidauf Emanuel Beška Faculty of Mathematics and Physics Faculty of Philosophy and Arts Charles University in Prague.
What is Linguistics? The word ‘ linguistics’ has been derived from Latin. Linguistics Etymologically, therefore, linguistics is the scientific study of.
Lecture Three Morphology.
Resemblances between Meaning-Text Theory and Functional Generative Description Zdeněk Žabokrtský Institute of Formal and Applied Linguistics Charles University,
Linguistics The ninth week. Chapter 3 Morphology  3.1 Introduction  3.2 Morphemes.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
1 / 5 Zdeněk Žabokrtský: Automatic Functor Assignment in the PDT Automatic Functor Assignment (AFA) in the Prague Dependency Treebank PDT : –a long term.
1 Principles & Parameters Approach in Linguistics - IV Bibhuti Bhusan Mahapatra.
Reasoning with Dependency Structures and Lexicographic Definitions using Unit Graphs Maxime Lefrançois, Fabien Gandon [ maxime.lefrancois | fabien.gandon.
The Minimalist Program
nd PIRE project workshop1 Tectogrammatical Representation of English Silvie Cinková Lucie Mladová, Anja Nedoluzhko, Jiří Semecký, Jana Šindlerová,
Natural Language Processing Chapter 2 : Morphology.
Syntactic Annotation of Slovene Corpora (SDT, JOS) Nina Ledinek ISJ ZRC SAZU
1 STO A Lexical Database of Danish for Language Technology Applications Anna Braasch Center for Sprogteknologi Copenhagen SPINN Seminar, October 27, 2001.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
SYNTAX.
Building Sub-Corpora Suitable for Extraction of Lexico-Syntactic Information Ondřej Bojar, Institute of Formal and Applied Linguistics, ÚFAL.
Issues of Valency in Prague Dependency Treebank: Creating Valency Lexicon of Verbs Markéta Lopatková Center for Computational Linguistics MFF UK, Prague.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Arabic Syntactic Trees Zdeněk Žabokrtský Otakar Smrž Center for Computational Linguistics Faculty of Mathematics and Physics Charles University in Prague.
Basic Syntactic Structures of English CSCI-GA.2590 – Lecture 2B Ralph Grishman NYU.
September 26, : Grammars and Lexicons Lori Levin.
March 5, 2008Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics 1 PDT: Tectogrammatical Representation Jan Hajič Institute.
TSD, Brno, Institute of Formal and Applied Linguistics, 1 Czech Verbs of Communication and the Extraction of.
Textbook: Влавацкая М.В. ‘English Lexicology in Theory and Practice’ Новосибирск: НГТУ, ББК В 57.
BAMAE: Buckwalter Arabic Morphological Analyzer Enhancer Sameh Alansary Alexandria University Bibliotheca Alexandrina 4th International.
Lec. 10.  In this section we explain which constituents of a sentence are minimally required, and why. We first provide an informal discussion and then.
Morphology 1 : the Morpheme
1 The grammatical categories of words and their inflections Kuiper and Allan Chapter 2.1.
NLP Midterm Solution #1 bilingual corpora –parallel corpus (document-aligned, sentence-aligned, word-aligned) (4) –comparable corpus (4) Source.
SYNTAX.
Grammar Grammar analysis.
Distributional analysis
Statistical NLP: Lecture 3
Chapter 3 Lexical & Grammatical Morphology
ENGLISH MORPHOLOGY Week 1.
Prague Arabic Dependency Treebank
Prague Dependency Treebank 2. 0 Zdeněk Žabokrtský Dept
Traditional Grammar VS. Generative Grammar
Chapter Six CIED 4013 Dr. Bowles
Introduction to Linguistics
Word phoneme SENTENCE PHRASE SUFFIX prefix PHRASE CLAUSE UTTERANCE PART OF SPEECH MICRO-LINGUISTICS Macro-linguistics Language dictionary LEXICON allophone.
Owen Rambow 6 Minutes.
Presentation transcript:

Building the Valency Lexicon of Arabic Verbs Viktor Bielický Otakar Smrž LREC 2008, Marrakech, Morocco

“Valency” potential of a lexical unit, esp. verb, to bind some other specific syntactic element(s)‏ valency information is lexicalized – unpredictable by grammatical rules valency behavior of a verb has to be stated explicitly in a lexicon a linguistic task – to create a valency lexicon of the most frequent Arabic verbs reusable by both humans and computational systems

Theoretical framework valency theory in Functional Generative Description (FGD) – a multi-stratal dependency-oriented description – elaborated on Czech inspiration – VALLEX and PDT-Vallex for Czech resources – Prague Arabic Dependency Treebank (PADT), Corpus Linguae Arabicae (CLARA), Arabic Gigaword, Arabic-Czech dictionary, printed dictionaries valency associated with the underlying tectogrammatical layer of language representation – it describes meaning valency information defined by the valency frame – filled by various types of valency complements described by functors

Types of complements (1) inner participants (actants) – ACTor, PATient, ADDRessee, ORIGin, EFFect (2) free modifications (adjuncts) – e.g. time, location, direction, manner, aim, cause… obligatory vs. optional complements – decided by the dialog test

Complements ACT, PAT, ORIG, EFF

Structured frame ACT, PAT, ORIG, EFF

shifting of cognitive roles – application of primarily syntactic criteria for assigning the first two inner participants (ACT, PAT) – semantic criteria for the remaining inner participants and all free modifications shifting: ACT ← PAT ← EFF/ADDR/ORIG Functional Generative Description

The shift of EFF to PAT

Problems and solutions problem: distinguishing between an inner participant and some free modification solution: criterion of the priority of verbal government (direct transitivity) affecting case inflection in the direct object (the accusative case) → inner participant; surface morphemic realization with a preposition → free modification

Inner participant vs. free modification

Problems and solutions II problem: (i) literal, (ii) figurative, and (iii) idiomatic meanings of a verb – emphasis on consistency solution: for all these meanings separate valency frames are distinguished (with possible surface morphemic variants)

Valency of verbonominal derivatives a participle (active and passive) and a verbal noun ( ) preserve some valency properties of the original verb valency frame of verb - ACT, PAT, ORIG verbal noun can preserve all valency slots – active participle – ACT is absorbed –

Information stored in the valency lexicon lexeme – represented by a lemma (citation form)‏ lexical unit – particular meaning of a given verb described by its valency frame valency frame – sequence of frame slots containing both obligatory and optional inner participants and only obligatory free modifications (adjuncts) and providing information about their surface morphemic representation (a case, a particular preposition)‏ diathesis – possible passivisation of a verb additional information – morphological information, glosses, examples from corpora, frequency and rank of occurrence, syntactic-semantic verb class, etc.

Thank you!

Comlements ACT, PAT, TWHEN

Complements ACT, PAT, ADDR

Complements ACT, DIR1, DIR3

Figurative meaning of verb → one meaning, one frame (with morphemic variants)

Idiomatic meaning → a separate frame e.g. one particular meaning of verb “to set one’s hopes to”