North Saami Verb Profiling

Slides:



Advertisements
Similar presentations
Does it Swap Or drop ?. Main Auxiliaries DROP : Do Does Did SWAP : To be + Ing Have/Has+ Participle Will + Verb Would + Verb.
Advertisements

1 Reproduced by kind permission of Erik Smitterberg (PhD, Docent), Department of English, Uppsala University A-level Grammar 4: Verbs and Verb Phrases.
On Two Russian Constructions: What Else If Not Synonyms? Nezrin Samedova, Azerbaijan University of Languages.
Natural Language Processing Syntax. Syntactic structure John likes Mary PN VtVt NP VP S DetPNVtVt NP VP S Every man likes Mary Noun.
VERBS.
Lecture # 7 Chapter 4: Syntax Analysis. What is the job of Syntax Analysis? Syntax Analysis is also called Parsing or Hierarchical Analysis. A Parser.
Modeling the Data: Conceptual and Logical Data Modeling
Dependency Parsing with Reference to Slovene, Spanish and Swedish Simon Corston-Oliver Anthony Aue Microsoft Research.
Helping Verbs.
VERBS Verb is a part of speech that shows:  ACTION  STATE OF BEING (NON-ACTION) State of being –be Feelings - love Senses - see Mental activity or state-
Day 38 – Intro to Fiction INSTRUCTOR: KYLE BRITT.
Thinking about inflections How to find verb inflections (Part of Dick Hudson's web tutorial on Word Grammar)web tutorial.
File and Database Design SYS364. Today’s Agenda WHTSA DBMS, RDBMS, SQL A place for everything and everything in its place. Entity Relationship Diagrams.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
+ Sentence Structure Creating sophisticated, age-appropriate sentences.
Unit 10. Verb Patterns There are four verb patterns in English 1- verb + to + infinitive I want to go there I’d like to visit him. The following is a.
EFL 084 Grammar 4 Modal Auxiliaries –Meaning Probability Necessity Advisability Ability –Time Present/future structure Past structure.
Chapter 15 Section 15.6 Limits and Continuity; Equality of Mixed Partials.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
Wellcome to ENGLISH 2 class 3 rd Meeting. Passive vs active Subject of A Sentence Performs the Action of the Verb VS Subject of A Sentence Receives the.
Verb tenses.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
§1.2 Differential Calculus
§1.2 Differential Calculus Christopher Crawford PHY 416G
Verb phrases Main reference: Randolph Quirk and Sidney Greenbaum, A University Grammar of English, Longman: London, (3.23 – 3.55)
Are Verbs important? Why/ why not? TRUE or FALSE? The English verb has only 2 forms. Right answer: It has 3 forms: The Infinitive, the Gerund & the.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
Subject Predicate Subject Main verb (Nominative structure) Auxiliary (link) verb.
1.¿Which is the use of Present Progressive? 2.¿Which is the structure of Present Progressive in a positive form ? 3.¿ What way should be the verb in.
NATURAL LANGUAGE PROCESSING
McGraw-Hill/Irwin Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Modeling the Data: Conceptual and Logical Data Modeling.
Auxiliaries in simple past How to work with “did” and “was-were”
3 Unit 1.
How to use the Internet to Speak and Write in Italian 1.Write simple sentences in English. 2.Identify the verb 3.Translate the verb correctlyhttp://dictionary.reverso.net/english-
CSC 594 Topics in AI – Natural Language Processing
Communication The different categories of communication include:
Cognitive Language Processing for Rosie
SENSEVAL: Evaluating WSD Systems
Parsing & Context-Free Grammars
Programming Languages Translator
Textbook:Modern Compiler Design
Lecture 5 (Modality) Ling 442.
VERBS.
Urdu-to-English Stat-XFER system for NIST MT Eval 2008
Regression.
DGP – Sentence 1 Sentence Parts.
Chapter Eight Syntax.
A field guide to North American grammar
Laura A. Janda UiT The Arctic University of Norway Francis M. Tyers
The /a/  /aj/ Shift in Russian Verbs and Cognitive Linguistics
Lecture 15: Text Classification & Naive Bayes
Statistical NLP: Lecture 13
Verbs.
A Statistical Model for Parsing Czech
Regular Grammar - Finite Automaton
Which of the following graphs is the graph of the derivative of the function {image} . 1. {applet}
Chapter Eight Syntax.
Rules for Constructing Isolines
Primary key Introduction Introduction: A primary key, also called a primary keyword, is a key in a relational database that is unique for each record.
Four Languages Verbs from the Bottom up
Lexico-grammar: From simple counts to complex models
Russian multiplication
Test corrections (SPN IB lección 4 test) ~Correct only the ones you got wrong! ESCUCHAR (#1-#5) = Listen to the selection, write the words in Spanish.
David Kauchak CS159 – Spring 2019
Meaning Based Grammar: The Argument for the Auxiliary
NLP.
WRITING / SPEAKING IDEAS ORGANIZATION: INTRODUCTION
Presentation transcript:

North Saami Verb Profiling Jussi Ylikoski, Laura A. Janda, Ciprian Gerstenberger, Hanne Eckhoff

Size and Source of database Note: disambiguation is a major problem a given form can easily have over 10 interpretations About 80,000 words, comprised of: North Saami “Gold” corpus, disambiguated by hand, about 30,000 words Johan Turi’s (1910) Muitalus sámiid birra (An account of the Saami). Automatically disambiguated, plus correction of verb forms, about 50,000 words

Protocol for North Saami Derived verbs: Treat all derived verbs as separate lexemes. This includes verbs derived in -goahtit “start X-ing” and also passive verbs (roughly parallel to Russian -ся verbs).   Auxiliary verbs: Ignore/remove all forms of “ii” (negation verb) as aux. Ignore/remove all forms of “leat” as aux. Include all other auxiliary verbs as verbs on equal footing with main verbs. [Assumption: We have the following relevant types of constructions: “ii” aux + negation forms of main verbs “leat” aux + non-finite forms of main verbs other aux + non-finite forms of main verbs For 1. and 2. we will take only the forms of the main verbs, but for 3. we will take both the aux and the main verb.] Threshold: Retain/focus on only lemmas that have 20+ forms in the combined corpus.

Form of data COLUMN 1: PASSPORT Gold vs. Turi COLUMN 2: FORM_ID Unique line # and word ID COLUMN 3: FORM The actual form of the verb found (boahtit, boahtime, gorrojuvvon, jáhkkigoahtit) COLUMN 3: LEMMA The infinitive form of the verb (boahtit, gorrojuvvot, jáhkkigoahtit) COLUMN 5: ANALYSIS Full parse of form COLUMN 5: SUBPARADIGM Ind.Prs, Ind.Prt, Imprt, Inf, PrfPrc, etc. COLUMN 6: CONTEXT The entire sentence that the example form is found in.

Example from dataset PASSPORT;FORM_ID;FORM;LEMMA;ANALYSIS;SUBPARADIGM;CONTEXT Gold;g_3_1;Sávan;sávvat;V.TV.Ind.Prs.Sg1;Ind.Prs;"Sávan didjiide lihku , vástidii Evo Morales . " Translation: I wish you luck, answered Evo Morales.

Results Almost 20,000 verb forms About 120 verbs cross the threshold (>20 forms) Partially annnotated for auxiliary verbs semantic classes following Russian National Corpus