Ferenc HAVAS, Budapest Congressus Undecimus Internationalis Fenno-Ugristarum 12. 8. 2010 Piliscsaba.

Slides:



Advertisements
Similar presentations
CODE/ CODE SWITCHING.
Advertisements

Web Technologies By Andreas Vetter and Yong Soo Deutschle.
Dissertation Writing.
Instructions for completing the ES089g term paper.
United Nations Statistics Division Principles and concepts of classifications.
Vocative: paradigmatization of address (with parallels from other case domains) Michael Daniel Moscow State University.
Introduction to Linguistics and Basic Terms
Use-case Modeling.
Outline IS400: Development of Business Applications on the Internet Fall 2004 Instructor: Dr. Boris Jukic Table, Forms, Metatags and Frames.
Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.
Its Grammatical Categories
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Chapter 2 Finding Ideas to Research. Generating Topics Translate ideas into valid and reliable ways of measuring them Collect evidence Unique Topics Innovative.
Cultural language- Korean
The European Manuscript & Hand Press Book Heritage The role of the Consortium of European Research Libraries (CERL) Manuscript Collection in the National.
(AS 12) Accounting for Government Grants. Scope This Statement does not deal with: (i) the special problems arising in accounting for government grants.
IMSS005 Computer Science Seminar
LISTENING TEAM REPORT Comprehensive Review Presentation prepared by Jacob Black-Lock.
Language and Social Culture Lecture 7. Language Varieties  Variety is a generic term for a particular coherent form of language in which specific extralinguistic.
Writing your dissertation. Overview Dissertation structure and components Writing Software assistance A look at past dissertations.
Elena Danilina Russian Patent Law and Practice. I am a Russian patent attorney and a legal adviser at the Ushinsky State Scientific Library in Moscow,
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Proposal for a new UNECE regulation on recyclability of motor vehicles Informal Document GRPE Reply to the Comments of the Russian Federation Informal.
Ferenc Havas Tallinn, Introduction to the project: Uralic Typology Database Project website:
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
Experimental Research Methods in Language Learning Chapter 16 Experimental Research Proposals.
UNIT 7 DEIXIS AND DEFINITENESS
Chapter 7 Relational Algebra. Topics in this Chapter Closure Revisited The Original Algebra: Syntax and Semantics What is the Algebra For? Further Points.
RDA in NACO Module 6.a RDA Chapter 11: Identifying Corporate Bodies—Overview Recording the Attributes.
SPEECH AND WRITING. Spoken language and speech communication In a normal speech communication a speaker tries to influence on a listener by making him:
ELAG : Library Systems Seminar – 26 Roma – Biblioteca Nazionale Centrale, Aprile 2002 THE SEMANTIC WEB AND LIBRARIES.
1 15 quality goals for requirements  Justified  Correct  Complete  Consistent  Unambiguous  Feasible  Abstract  Traceable  Delimited  Interfaced.
Microsoft Office XP Illustrated Introductory, Enhanced Tables and Queries Using.
Copyright © 2009 by McGraw-Hill Ryerson Limited. All rights reserved. Understanding Economics 5th edition by Mark Lovewell.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Structural Levels of Language Lecture 1. Ferdinand de Saussure  "Language is a system sui generis “ = a system where everything holds together  The.
Week 2 Introduction to Data Modelling
Teaching Writing.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Levels of Linguistic Analysis
Definition Essay WIT Comp 2. Definition A definition essay is an essay that defines a word, term, or concept. In this essay you should not define a term.
Today’s Lesson….. 1.Formative Assessment Given Back – Go through Answers. 2.Webpage Design.
Group 2: Sino-Tibetan Languages Working Group II: Sino-Tibetan Languages Session Report July 2, 2005.
. Thematic Working Group 4 Possible Elements – Chapter VI: Constraints, gaps and related financial, technical and capacity needs CGE Workshop to exchange.
The FDES revision process: progress so far, state of the art, the way forward United Nations Statistics Division.
How to Give a Scientific Talk Stefan Köhler, Ph.D. Dept. of Psychology and Brain and Mind Institute (based on presentation prepared by Dr. Susanne Schmid.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
Chanchal C Sarkar DY. Director, Trade Policy Division Department of Commerce, Ministry of Commerce & Industry TBT Agreement : Key Principles.
Language choice in multilingual communities
©2005 Prentice Hall Business Publishing, Auditing and Assurance Services 10/e, Arens/Elder/Beasley Other Assurance Services Chapter 25.
Abstract  An abstract is a concise summary of a larger project (a thesis, research report, performance, service project, etc.) that concisely describes.
THE GENITIVE CASE Their Syntactical Classification.
The theory of word classes in modern grammar studies
Logical Database Design and the Rational Model
Lecture 3 Syllabuses and Coursebooks
Chapter 7 Verbal Intercultural Communication
Attestation Concept additional explanation and implementation proposal
REPORT OF THE ELECTRONIC WORKING GROUP UNEP(DEC)/CAR WG.29/INF.12
IPC & PATENTSCOPE Sandrine Ammann Marketing & Communications Officer
Questions/concerns
SAMANCTA Introduction: A guide to the development, content and functionality Presentation PPT-GNP-01 ver EN.
CHAPTER 4 PROPOSAL.
CHAPTER 4 PROPOSAL.
Levels of Linguistic Analysis
TBT Agreement : Key Principles
The ultimate in data organization
The Invisible Process to help with analysis:
TECHNICAL REPORTS WRITING
Key Stage 1 Grammar.
Presentation transcript:

Ferenc HAVAS, Budapest Congressus Undecimus Internationalis Fenno-Ugristarum Piliscsaba

2005: Initiative launched at the Tenth International Congress of Finno-Ugrists held in Yoshkar-Ola, Mari El 2008: A closer delineation of the project in 2008 at a dedicated international conference at Vienna University 2008–2009: Presentation of the project at different conferences (Bratislava, Khanty-Mansijsk, Moscow, Tallinn, Szeged) 2010: CIFU : Establishment of a steering committee

General outline of its structure and purpose

A database is a grid (table) consisting of columns and rows that yield cells in their intersection.

The columns represent Uralic languages…

… whereas the rows stand for parameters ie. typologically salient features of languages.

The content of a cell depicts the way the given parameter materializes in a given language.

Displaying a given parameter’s specification for several languages:

Displaying a given parameter’s specification for a whole subgroup of the language family:

Displaying several parameter specifications for a given language:

The number of living Uralic languages: at least 19 items Some items commonly considered as collective designations of several dialects Some items considered sometimes as languages, sometimes as dialects Often significant differences between dialects of the same language Conclusion: as many items (separate columns) should be taken into account as there are typologically different dialects All these items can equally be called and regarded as (representing) dialects in our typological database How many columns do we need?

Karelian is a language closely related to Finnish, with which it is not necessarily mutually intelligible. Karelian is spoken mainly in Republic of Karelia, Russia. Dialects spoken in Finnish Karelia (North Karelia and South Karelia) are not considered Karelian but Savonian dialects or Southeastern dialects of Finnish. Karelian is spoken by about 100,000 people mainly in the Republic of Karelia, Russia but notable Karelian- speaking communities can also be found in Tver region. Karelian is also spoken in Finland where Karelian-speakers are estimated to be around 5,000. Karelian belongs to the Finno-Ugric languages, and is distinguished from Finnish by some important extensions to the phonology and the lack of influence from modern 19th and 20th century Finnish. It cannot merely be classified as a Finnish dialect with Russian influences, because it has original innovations and it may differ considerably from Finnish. In the Republic of Karelia Karelian has official status as a minority language. Since the late 1990s there have been moves to pass special language legislation, which would give Karelian an official status on par with Russian. In Finland Karelian has official status as a non-regional national minority language.There is no standard Karelian language, although the Republic of Karelia's authorities have recently begun to attempt standardization. Each writer writes in Karelian according to his own dialectal form. The script is the Latin alphabet as used for Finnish with letters added.

Parameter sources: WALS = World Atlas of Language Structures, (or search for WALS) Matthew Dryer's Typological Database database (or search for Matthew Dryer) My own improvements and supplements

Parameter specification: Set of parameters: a revised and enhanced inventory of typologically salient grammatical features Definition of parameters: precise but not oversophisticated definitions (indicating alternative terminology if necessary)

Parameter value specification: Set of parameter values: a possibly full set of different attested patterns of realizing the given parameter in the world’s languages Definition of parameter values: precise and unequivocal definitions Format of parameter values: a possibly transparent code (string of abbreviations) Coding the mixed situations in a given dialect: combining two value codes using a restricted set of linking symbols (such as & for “equally present” and / for “both present with the first one being dominant”)

The placement of possessive pronouns Parameter: Placement of possessive pronouns Values: NoPoss(NB) Poss(NB)N Nposs(NB) Comments Hints: D 31b

Comments explanation and definition of the meaning of a given parameter assistance in identifying the relevant phenomenon explanation of each possible value of the given parameter

The placement of possessive pronouns Possessive pronouns are representatives of a specific part of speech – they are non-bound (NB) grammatical words that mark person and/or number (occasionally also class or gender) of the possessor next to a noun or noun phrase depicting something possessed (the possessee). They can be placed before or after the possessed noun (noun phrase). The existence of possessive pronoun as an independent part of speech can only be stated in a language if the grammatical words marking person and/or number of the possessor are not identical to (some form of) personal pronouns. For example, an ordinary genitive form of a personal pronoun representing the possessor cannot be considered a possessive pronoun. Similarly, an ordinary personal pronoun obligatorily extended with some other grammatical element in the possessor function (like the definite article in Hungarian az én…) is not a possessive pronoun either. Values: NoPoss(NB): there are no possessive pronouns as an independent part of speech in the given dialect. Poss(NB)N: there are possessive pronouns in the dialect and they are placed before the possessed nouns (noun phrases). NPoss(NB): there are possessive pronouns in the dialect and they are placed after the possessed nouns (noun phrases).

The agreement of adnominal adjectives with their head nouns Parameter: Agreement of adnominal adjectives Values: NoAgr NoAdjAgr AdjAgr AdjNumAgr AdjCaseAgr AdjClassAgr AdjNumAdjCaseAgr Comments Hints:

Agreement of adnominal adjectives comments (1) Agreement of adnominal adjectives is an obligatory marking of class/gender, number and case (or at least one of these features) of the nouns syntactically governing, and semantically modified by, adnominative (ie. non-predicative) adjectives within the morphological shape of these adjectives themselves. We can consider the agreement of adjectives depicting primary properties like size, shape, colour etc. as prototypical and we should examine their agreement with the head nouns in non-nominative (oblique) case forms, if available. 1

Agreement of adnominal adjectives comments (2) Values: NoAgr: There is no marking of either class (gender) or number or case in the given dialect so no agreement between an adnominal adjective and noun can take place. NoAdjAgr: No agreement in adjectival phrases, though adnominal agreement in other types of phrases occurs. 2 AdjAgr: Agreement of adjectives takes place relating to all grammatical features of the head noun in attributive phrases. AdjNumAgr: Though head nouns have several grammatical features, agreement of adnominal adjectives takes place only relating to number. 3 AdjCaseAgr: Though head nouns have several grammatical features, agreement of adnominal adjectives takes place only relating to case form. AdjClassAgr: Though head nouns have several grammatical features, agreement of adnominal adjectives takes place only relating to class/gender. AdjNumAdjCaseAgr: Though head nouns have more than two grammatical features, agreement of adnominal adjectives takes place only relating to number and case. 4

Agreement of adnominal adjectives comments (3) 1 If there are several values of the parameter that are characteristic of the given dialect, we can link together different values with the symbol „&” if they occur (as types) evenly or with the symbol „ / ” if the first value is dominant but the second one occurs as well. For example, AdjAgr&AdjNumAgr would mean a dialect in which agreement in all respects and relating only to number occurs with the same frequency; whereas, AdjAgr / AdjNumAgr is a dialect in which agreement in all respects is a general rule but there is a limited but considerable number of cases (in the paradigm) where agreement in number only (and not in case, for example) takes place. 2 This means that agreement relating to class/gender, number and case (or at least one of them) takes place in other, non-adjectival attributive structures, e.g. with adnominal determiners. 3 Phenomena such as the adjective obligatorily taking some special non- nominative (oblique) shape next to the modified noun, displaying a form which nonetheless only marks the number (and not, for example, the case form) of the head, should also be considered here. 4 Following this pattern, we could set further values as well, e.g. AdjNumAdjClassAgr representing a dialect in which the adnominal adjective would only agree with its head in number and class/gender but not in case form.

 

Background information Cf. a fekete kutyá-val ART DEF black-  dog- INSTR / COMIT ‘with the black dog’ but ez-zel a kutyá-val this- INSTR / COMIT ART DEF dog- INSTR / COMIT ‘with this dog’

Creating the IT-apparatus of the database and its technical implementation as an online device Presenting the core stock of criteria (inventory of dialects, parameters and parameter values) for the database in English and Russian Researchers of the different dialects specify the parameter values for the particular dialects, prepare the background information for them Inserting of new dialects or parameters if necessary

Adding a new dialect:

Adding a new parameter:

Supervisory Board: invitation of authors call for offers peer review approval translation insertion

The database is an online device with access provided to any researcher. Using the database means collecting parameter values for a particular set of typological parameters in a particular set of Uralic dialects. With dialects, the particular set may consist of one or several items or all of them, whereas the number of the selected parameters should range from one to a reason- able limit.

Displaying the search results in a form of a printable and storable table (grid) Actually, the resulting table is a subset of the hidden (virtual) comprehensive database. It is also equipped with the relevant hiperlinks: in the column headings: textual identification and possibly closer characterization of the related dialects; in the row headings: parameter specifications; in the cells: background information relating to the specified parameter values.

Contribution to the improvement of the grammatical description of the Uralic languages Unforeseen gaps in the research on certain dialects  motivation for in-depth inquiries into the dialects themselves; supply of topics for conference talks, articles, monographs Void cells to be filled in  supply of topics for graduate students’ assignments, theses, essays, PhD dissertations

Pilot project: Typological Database of the Ugric Languages Yugra University, Khanti-Mansiysk ELTE University, Budapest a dedicated point in the workplan within the valid contract between the two universities A pending application for a National Scientific Research Foundation grant in Hungary

Setting up further pilot projects? Typological Database of the Ugric Languages…

Setting up further pilot projects? Typological Database of the Permic Languages…

Setting up further pilot projects? Typological Database of the Cheremis and Mordvin Languages…

Setting up further pilot projects? Typological Database of the Finnic Languages…

Setting up further pilot projects? Typological Database of the Samoyedic Languages…

Setting up further pilot projects? Seeking investigators and research institutes/departments…

Introduction to the project: Uralic Typology Database Project website: (or search for “urtypol” in your browser) In Russian: Uralic Typology Pages: (or search for “uralictypology” in your browser) Article «Проект типологической базы данных уральских языков». Финно-угорский мир, 2009/4, 42–46., 2010.

The PowerPoint presentation you have just seen will soon be accessible in both English and Russian on the Uralic Typology Database Project website: (or search for “urtypol” in your browser)

Thank you for your attention. Questions? Comments? Ferenc HAVAS, Budapest Congressus Undecimus Internationalis Fenno-Ugristarum Piliscsaba