D-square (D-kwadraat) Digital Databases and Tools for Dutch Dialect Dictionaries Jos Swanenberg, Folkert de Vriend & Roeland van Hout
Topics Historical background Overview of project phases Conversion procedures New encoding for data End user access to the data
Volumes 1.Agricultural terminology 2.Other technical or craft terminologies 3.Common vocabulary Macro structure WBD & WLD
Constituents: -Lexical meaning (title, description of the concept) -Lexical form (‘dutchified’ entry) -Phonetic form -Sources -Geographical code (+ map) Micro structure WBD & WLD
WBD & WLD Example of WLD, volume 1:
Filing cards Word processor, Genoveva Databases + word proc. 2002Online database WBD D-square History of automation
WBD & WLD Filing cards:
WBD & WLD Example of WLD, volume 1:
Online database WBD
Example from database: “Meikever” (Eng: “maybug”)
Example of WBD, volume 3
Online database, query
Online database, query result
Deel III MS-Word Editors/ManagementUsersEditors/ManagementUsers AnalogDigital Analog (parts of) Vol. I+II MS-Word Filing cards Website WBD/WLD with tools for searching and cartography Enriched data XML Raw data FileM Pro Vol. I+II MacWrite Questionnaires Nijmegen and Leuven Questionnaires (chiefly) Meertens Raw data Vol. I + II Vol. III Edited data Specialized print editions (dialect atlas or local dictionary) Online DB WBD (Polderland) Edited data XML Vol. III FileM Pro SGV on CD (Polderland) Vol. III
1.Conversion to a new format 2.End user access to data 3.Enrichment of data 4.Data management Overview phases D-square
Phase 1: Conversion to a new format
Reasoning behind new encoding XML, not relational database Tailored to WBD and WLD Flexible enough to be used for other dialect dictionaries Based on standard: LMF (ISO TC 37/SC 4)
Example from WBD, meikever
Example from database: “Meikever” (Eng: “maybug”)
Example XML-encoding … Meikever Bakkertje bakkerke bakkərkə K 178 …
Example from WALD
Example from dictionary of the dialects of Zeeland
Phase 2: end user access to data
Small scale survey - Tools: Search engine, Cartographic tool, Format conversions. - Enrichment: POS, morphemes (syllables) - Links to other resources: Other dictionaries, questionnaires, FAND, MAND.
Difficulties to overcome Search engine Getting from question to query (coaching needed). Is SmartMatch (fuzzy matching) helpful in this regard? Speed of XML searching Cartography Availability of base maps Links to other resources Differences in interpretation
Information about D-square
Questions?