Marijke Mooijaart INL Leiden Bullay Deutschland april 2010 Fifteen centuries Dutch vocabulary Four historical dictionaries in one internet application.

Slides:



Advertisements
Similar presentations
DICTIONARY Get to know your.
Advertisements

Keyboarding Objective Apply language skills in keyed documents
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
THE STEPS OF SEARCH You have opened a new veterinary clinic in a small town, and want people in the vicinity to know about it. You need some new ideas.
The Sketch Engine for Dutch with the ANW corpus Carole Tiberius.
Example queries for Federated search Jan Odijk CLARIN Federated Search Workshop Copenhagen, 24 Apr
Compiled by Helene van der Sandt. Is a search engine that searches for scholarly literature Can search across many disciplines Searches for articles,
English Lexicography.
Dr. Jun Wang Goleman Library An Introduction to. What Is OED? An online dictionary like no other with – 600,000 words from the English speaking world.
Example Database English-German Dictionary
Macrostructure  Front matter  Body  Appendices Jackson, Howard Lexicography: An Introduction. London: Routledge, p. 25.
1 Words and the Lexicon September 10th 2009 Lecture #3.
June 12, 2015 ©2005 Ovid Technologies Jörn Hope Ovid.
Stemming, tagging and chunking Text analysis short of parsing.
Morphology I. Basic concepts and terms Derivational processes
WISER: History Advanced OLIS searches Isabel Holowaty, History Librarian Kate Petherbridge, Upper Camera Superintendent.
Using LegalTrac To Find Law Review Articles. What Is LegalTrac? A commercial service UW Libraries subscribe Indexes law reviews, other legal periodicals.
Dictionary.
Keyboarding Objective 3.01 Interpret Proofreader Marks
Dr. Denison MacPherson, MacIsaac, Gowans Quotations Purpose and Integration.
1. 2 Content WSK Online is a new online database of specialized dictionaries covering all the major areas of linguistics and communication science: Biannual.
Deny A. Kwary Internal Structures of Dictionary Entries.
Literary Research Modern World Literature LIT 4113 Frederic Murray M.L.I.S. Reference Librarian
EMELD Workshop on Digitizing Lexical Information Modeling Lexical Entries in Bilingual Dictionaries —Or— Exegeting the UML Model Mike Maxwell Linguistic.
Chapter Four Morphology
9. Microstructure of Bilingual Dictionaries. The microstructure of the dictionary specifies the way the lemma articles are composed. The lemma article.
Chapter 1: By: Ms. Ola Al-arjani
Please copy your agenda into your planner! Welcome Back!
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
NooJ international Conference, Komotini, May 2010 Portability of Armenian Corpus by Nooj Anaid Donabedian & Victoria Khurshudian Institut National des.
Moodle (Course Management Systems). Glossaries Moodle has a tool to help you and your students develop glossaries of terms and embed them in your course.
Dr. Kristin Bakken, NO 2014 Oddrun Grønvik, NO 2014 Dr. Daniel Ridings, DOK Sept. 7th 2004.
Medline on OvidSP. Medline Facts Extensive MeSH thesaurus structure with many synonyms used in mapping and multidatabase searching with Embase Thesaurus.
Online Dutch-Frisian Dictionary Pieter Duijff, Frits van der Kuip, Hindrik Sijens Fryske Akademy Introduction The most recent bilingual Dutch-Frisian dictionary.
1. 2 Content Based on over 20,000 articles, the database provides comprehensive information about literary authors in the German-speaking world from the.
1 DATABASE ARTISTS OF THE WORLD (AKL) ONLINE. 2 Content AKL Online is the world’s most contemporary, reliable, and extensive reference work on artists.
Library databases. database NOUN:also data base Computer Science A collection of data arranged for ease and speed of search and retrieval. Also called.
Chapter 3 Monolingual Dictionaries II Arabic Dictionaries.
VOCABULARY STRATEGIES -Awareness of words -Wide reading & extensive writing -Independent strategies.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Eighth International Congress Of Hittitology Warsaw, 5-9 September 2011 The electronic Chicago Hittite Dictionary:
WISER Finding stuff: Articles Kerry Webb, Deputy Librarian, English Faculty Library Isabel McMann, Academic Liaison Services, Radcliffe Science Library.
CITATION / BIBLIOGRAPHY Moazzam Ali. CITATION DEFINED  A citation is a reference to a published or unpublished source (not always the original source)
UoS Libraries 2011 EndNote X5 - basic graduate session.
1. 2 Content The Historisches Wörterbuch der Rhetorik [Historical Dictionary of Rhetoric] is the only comprehensive academic reference work in the field.
AACR 2 –Rules for Descriptive Cataloguing
1 DATABASE INTERNATIONAL BIBLIOGRAPHY OF PERIODICAL LITERATURE IN THE HUMANITIES AND SOCIAL SCIENCES ONLINE.
GOOGLE SCHOLAR Compiled by Helene van der Sandt. WHAT IS GOOGLE SCHOLAR?
Lexicography Lexicon has two different meanings:
MORPHOLOGY definition; variability among languages.
Levels of Linguistic Analysis
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Slang. Informal verbal communication that is generally unacceptable for formal writing.
1. 2 Content BTL Online provides electronic access to all print editions of the Bibliotheca Teubnerina Latina: Bibliotheca Teubneriana Latina is the world’s.
Bibliographic Record Description of a book or other library material.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
WISER: Resources for Research in Classics and Ancient History Tuesday 22 May 2007 Charlotte Goodall.
Brill Over three centuries of scholarly publishing.
Chapter 3 Word Formation I This chapter aims to analyze the morphological structures of words and gain a working knowledge of the different word forming.
Research Skills and Strategies Using Sources Summarizing, Paraphrasing, and Quoting.
A Bibliographic Management Software NORSHUHADA SAIDIN REFERENCE & RESEARCH DIVISION PERPUSTAKAAN KEJURUTERAAN UNIVERSITI SAINS MALAYSIA.
WISER Finding stuff: Journals and Journal Articles Kerry Webb, Deputy Librarian, English Faculty Library & Angela Carritt, Bodleian Libraries’ User Education.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
عمادة التعلم الإلكتروني والتعليم عن بعد
Information in Monolingual Dictionaries
Levels of Linguistic Analysis
PubMed.
Finding Full-Text Journal Articles
A Comprehensive Index for Classical Studies
Using Dictionaries in Translation (223 TRAJ)
Presentation transcript:

Marijke Mooijaart INL Leiden Bullay Deutschland april 2010 Fifteen centuries Dutch vocabulary Four historical dictionaries in one internet application

Historical Dictionaries of Dutch Oudnederlands woordenboek - ONW Dictionary of Old Dutch, 2009 (online) coverage Vroegmiddelnederlands woordenboek – VMNW Dictionary of Early Middle Dutch (hard copy and online) coverage / Middelnederlands Woordenboek – MNW Dictionary of Middle Dutch (hard copy, CD-ROM, online) coverage /2001 Woordenboek der Nederlandsche Taal - WNT Dictionary of the Dutch Language (hard copy, CD-ROM, online) coverage

Differences source material open or closed corpus, size grammatical information spelling and sound variants; etymology regional information origin of texts and authors quantitative data spelling and sound variants dating first attestation, dated citations definition style; sense structure analytical or synonym definition; sense hierachy

WNT ( /1976 ) 3 meters of WNT volumes

I WNT ( ) source material: open corpus, expanding from titles; no proper names grammatical information: often detailed etymological information regional information : no, or only very global information quantitative data: none dating: originally only in recent volumes definition style; sense structure: analytical definitions; hierarchical structure

MNW ( ) Jakob Verdam *22 January † 19 July 1919

II MNW ( ) source material: open but restricted, mainly literary corpus; basis often 19th-century, critical editions; no proper names grammatical information: spelling and sound variants are included, often as reference entries regional information: information attached to source information (separate volume) quantitative data: none dating: originally only attached to source information (separate volume) definition style and sense structure synonym definitions; more linear sense structure

VMNW ( ) Jacob van Maerlant author and scribe ca – ca. 1300

III VMNW ( ) source material: closed corpus of official and literary texts; diplomatic editions; proper names included grammatical information: succinct etymology regional information: information attached to citations quantitative data on spelling forms and sound variants dating of all citations definition; sense development: synonym definitions; more linear sense structure

ONW ( ) Love song or probatio pennae? Manuscript 11th c. Oldest sentence: maltho, thi afrio, letu ‘I declare, I let you free, half-free man’ Lex Salica, 6th century

IV ONW ( ) source material: closed corpus of three longer literary religious texts; proper names included; fragments and glosses grammatical information: succinct etymology regional information: information in secondary literature about sources quantitative data on spelling forms and sound variants dating of all citations sense development: synonym definitions; more linear sense structure

Integrated search: dictionaries in the Language Bank INL Dutch Language Bank: Electronic database of Dutch from 6th-21st century includes Texts Dictionaries Computational lexica Searches are combined: one search for the results in the various works in the three components

Dictionary data model The four dictionaries are different in many aspects, but are similar as to the macrostructure of the electronic entries: headword grammatical information semantic analysis, including illustrative citations related entries (subentries: compounds, derivatives)

Encoding All data was converted to TEI-XML; various information categories were uniformly encoded For some dictionaries additional data development was necessary: modern Dutch headword for all entries uniform indication of part of speech citations: separate encoding of citation text, source indication and dating abbreviated forms of compounds (as subentries) were completed, e.g. apen-, burger-, dominees-, varkensbrood > apenbrood, burgerbrood, domineesbrood, varkensbrood

Modern Dutch lemma as standard headword head fish deal/ ‘to divide’ Haupt Fisch teilen ONW Old Dutch standard hōvitfiskdēlon (10th c.) VMNWMiddle Dutch standardhovetviscdelen MNWMiddle Dutch standardhovetvischdelen WNTSpelling De Vries-te Winkelhoofdvischdeelen 2009 modern spelling formhoofdvisdelen

Headwords problems modern headword form for lost words or misreadings (MNW) e.g. ONW swēneri ‘swineherd’ MNW swiner WNT zwener (not after 17th c.) modern headword zwijner or zwener? homonyms are usually numbered differently in the four dictionaries: MNW wel III = ‘well’, adverb WNT wel III = ‘bubble’, noun wel V = ‘well’, adverb

wel ‘well’ 1, ONW1 – VMNW2 – MNW3

wel ‘well’ 2, MNW-3

wel ‘well’ 3, MNW-3, WNT4

Internet application: search options various combinations of dictionaries entries (search key: modern headword) words in sense section (definitions) words in citations words in full text sources external links

Internet application: results full article text OR display of article structure (sense numbers and definitions) display with OR without citations all results in articles OR in concordances with number of attestations results in different sections (see advanced search) highlighting of key words in results search results can be exported external links to other reference works (etymology, dialect) and pictures

Further developments: lexicon GiGant project of Dutch Language Bank computational lexicon 6th – 21th century each entry consists minimally of standard form – word form – pos – attestation, with additional semantic, morphological, syntactic information provides access to texts and dictionaries historical variants are retrieved from historical texts AND from historical dictionary citations; e.g. uiterlijk in WNT

uiterlijk – utterly – äusserlich ‘exterior’ WNT UITERLIJK adj. regular, predictable variants: anlautvowel: u – uu – ui – uj – uij – uy – w suffix -lijk: ij - i – y – ie (?) / c – k – ck endings; -e – -st – -ste – -er – -en – -s uytterlijcste uyterlijkste d'uyterlijke uiterlyke uyterlijcke uiterlijke uyterlijck uiterlyken uiterlijkste uiterlicke wterlicke wterlijcke ulterlijk uiterlyk uiterlijk uyterlick wterlicken d'uyterlijcke uiterlijken uiterlijks wterlijck uytterlicke uitterlijke ujterlijke uytterlijk uyterlycke uyterlicken uijterlicke d'uiterlijcke wtterlijcke wterlyke wtterlijk uiterlijke uuterlick uuterlic uyterlijke uyterlijcken uyterlicke d'uiterlyke wterlijke vuyterlijcke uuterlycke uuterlicke wterlijken uyterlijcksten uuyterlicke uuyterlick uuyterlycke uytterl uytterlijcke uytterlycke uytterlick vuytterlicke uiterlijker uyterlyck uterliek wterlijcken uiterlijkst uitterlijk uytterlijcken uyterlyk uiterlijk-net wterlick uutterlijck uuyterlicken uyttelijck uijterlijk uytterlijck uuterlijck uiterlick uitterlyk uuyterlic uuyterlyck uuyterlijck uiterlijck uytterlyck uterlyc wterlijk

Further research: sense development from Old Dutch through Modern Dutch tuin, old germanic tūn (german zaun, english town) ONW: ‘fence, palings, enclosure(?)’. 1 citation from the Wachtendonck Psalms (10th c.) VMNW: ‘fence, palings’. Oldest attestation Much onomastic material. MNW:1. ‘fence, palings; wattle’. 2. ‘by the fence enclosed area; yard’. Citations from the whole Middle Dutch period. WNT: 1. ‘fence’. Obsolete; current in the 16th c., after that less common; still to be found in dialects. 2. ‘separate piece of land, among other things for growing vegetables or flowers; (engl. ‘garden’)’. Since the 17th c., common from the 19th c. onwards.