Www.isocat.org ISOcat: known issues 19 June 20121CLARIN-NL ISOcat workshop.

Slides:



Advertisements
Similar presentations
Topics to be covered: Paragraph construction * general to specific/identify the problem-provide a solution Paragraph issues * using definitions * internal.
Advertisements

EHR-S Reconciliation Worksheet Instructions. The spreadsheet is an extract from the EHR-S Database. Each column is Filterable by click- ing on the header.
The Library of the University of Konstanz Fostering Open Access by a new international information platform Anja Kersting, Karlheinz Pappenberger Library.
Chapter Two The Scope of Semantics.
© Paradigm Publishing, Inc Word 2010 Level 2 Unit 1Formatting and Customizing Documents Chapter 2Proofing Documents.
ISOcat introduction 19 June 20121CLARIN-NL ISOcat workshop.
Data Category specifications 19 June 20121CLARIN-NL 2012 ISOcat tutorial.
DR. AZAD H. FATAH Definitions of word. Definitions A word is a sound or combination of sounds that has a meaning and is spoken or written. A word is a.
CLARIN-NL/VL procedure 20 June 20131CLARIN-NL ISOcat workshop.
11 CLARIN? ISOCAT! Ineke Schuurman ISOcat content coördinator CLARIN-NL Amsterdam
© ISIS How do we talk to zoo keepers? How do we talk to zoo keepers? Michele Peters University of St. Thomas.
Albert Gatt LIN3021 Formal Semantics Lecture 5. In this lecture Modification: How adjectives modify nouns The problem of vagueness Different types of.
The current state of Metadata - as far as we understand it - Peter Wittenburg The Language Archive - Max Planck Institute CLARIN Research Infrastructure.
Technical Writing S03 Providence University 1 Introduction - Establishing a Context PROVIDENCE UNIVERSITY College of Management Wu-Lin Chen
Semantics Heasley and Hurford Universe of discourse and definiteness Objective: –Students will be able to apply the concept of universe of discourse and.
Its Grammatical Categories
Preparing for the Peer Review Next Class: Definite and Indefinite Articles How do native speakers use them?
E-Government and interoperability : the role of Machine Translation Francisco García Morán Chief IT Advisor European Commission e-Government powered.
ISOcat: known issues 10 May /20111CLARIN-NL ISOcat workshop.
Development of WIPOSTAD Progress report by the International Bureau Committee on WIPO Standards Geneva May Anna Graschenkova Industrial Property.
Reporting on Policies and Measures Introductory presentation by the UNFCCC secretariat Workshop on the preparation of fourth national communications from.
Matti Vartiala, a young civil engineer, has a problem: He is reading an English instruction sheet on surveying, which says he should ’…take a test sample.
Data Category specifications 20 March 20121CLARIN-NL ISOcat workshop.
CLARIN-NL: Dealing with ISOcat Ineke Schuurman. ISOcat and CLARIN Projects call 1 CLARIN-NL Joint Flemish/Dutch pilot Whenever relevant, elements are.
Principles of the GOLD Ontology & Conversion of GOLD to DCIF Presenters: Anthony Aristar, Evelyn Richter.
CLARIN-NL ISOcat workshop 2011 part 2 Ineke Schuurman Menzo Windhouwer.
2002 October 10SFWR ENG 4G030 Translating from English into Mathematics SFWR ENG 4G Robert L. Baber.
The ISO-DCR 17 January /20111CMDI tutorial Marc Kemps-Snijders a, Menzo Windhouwer b, Sue Ellen Wright c a Meertens Institute, b MPI for.
ISOcat demo and providing RELcat input Menzo Windhouwer The Language Archive tla.mpi.nl Data Archiving and Networked Solutions
Trends in Concept Modelling Turning Issues into Solutions How to Discipline a Cat Sue Ellen Wright, Kent State University.
MY E-PORTFOLIO (WHAT I’VE LEARNED DURING THESE MEETINGS, WHAT IS NOT SO CLEAR, WHAT I DON’T GET AT ALL)
CLARIN-NL Call 3 ISOcat follow-up 10/10/20121CLARIN-NL ISOcat Call 3 follow-up.
DC specifications or “Do’s and don’ts” when creating a DC.
Content of the Data Category Registry 10 May /20111CLARIN-NL ISOcat workshop.
ISOcat: known issues 20 June 20131CLARIN-NL ISOcat workshop.
Junior english CARMAN. Monday August 19 On your notecard, write the following: 1.Your name 2.Something you do when you’re not doing schoolwork.
CLARIN-NL Call 4 ISOcat follow-up 2/10/20131CLARIN-NL Call 4 ISOcat follow-up.
Collecting primary data: use of questionnaires Lecture 20 th.
ISOcat introduction 20 June 20131CLARIN-NL ISOcat workshop.
ISOcat introduction 20 March 20121CLARIN-NL ISOcat workshop.
Content analysis and CERN Roman Chyla. Artificial intelligence Natural language processing Web of data Content analysis.
CLARIN-NL ISOcat workshop 2012 part 2 ( ) Ineke Schuurman Menzo Windhouwer.
11 CMDI/ISOcat And Semantic Operability Ineke Schuurman ISOcat content coördinator CLARIN-NL Menzo Windhouwer ISOcat system administrator Utrecht
ISOcat: How to create a DC (including “do’s and don’ts”) 19 June 20121CLARIN-NL ISOcat tutorial.
CLARIN-NL Requirements and Desiderata Jan Odijk CLARIN-NL Call 3 Info-session Utrecht, 25 Aug 2011.
Beyond ISOcat 20 June 2013CLARIN-NL ISOcat tutorial1.
An Iterative Approach to Extract Dictionaries from Wikipedia for Under-resourced Languages G. Rohit Bharadwaj Niket Tandon Vasudeva Varma Search and Information.
Christian Groves Describing Captures in CLUE and relation to multipoint conferencing draft-groves-clue-multi-content-00 CLUE Interim meeting (09/13)
CLARIN Concept Registry: the new semantic registry Ineke Schuurman, Menzo Windhouwer, Oddrun Ohren, Daniel Zeman
ISOcat status
CLARIN Requirements for a Semantic Registry Daan Broeder The Language Archive – MPI Ineke Schuurman CLARIN-NL/VL – KU Leuven & Utrecht.
1 ISOCAT Proposed solutions for Problems encountered in DUELME-LMF Jan Odijk Nijmegen 21 Sep 2010.
1 CLARIN? ISOCAT! Ineke Schuurman Hilversum,
Creating & Testing CLARIN Metadata Components A CLARIN-NL project Folkert de Vriend Meertens Institute, Amsterdam 18/05/2010.
ARTICLE What is an article in English language? An article is a word that combines with a noun to indicate the type of reference being made by the noun.
ISO TC 37/CLARIN DISCUSSION UTRECHT, DECEMBER 9/ Thinning Down a Bloated Cat SUE ELLEN WRIGHT DECEMBER 2013.
ISOcat: How to create a DC (including “do’s and don’ts”) 20 June 20131CLARIN-NL ISOcat tutorial.
Group work and standardization features in ISOcat Menzo Windhouwer 8/14/20101Standardizing Data Categories in ISOcat - Implementing Group.
ARTICLE A,AN,THE. DEFINITION :- AN ARTICLE IS A WORD THAT IS USED WITH A NOUN TO INDICATE THE TYPE OF REFERENCE BEING MADE BY THE NOUN. ARTICLES SPECIFY.
CMD and TEI CMDI interoperability workshop Utrecht Matej Ďurčo, ICLTT, Vienna.
© Paradigm Publishing, Inc. 1 2 Word 2013 Level 2 Unit 1Formatting and Customizing Documents Chapter 2 Proofing Documents.
Co-Financed by European Regional Development Fund and made possible by the INTERREG IVC MITKE Overall progress 5th Steering Group Meeting Rzeszów, 30th.
ISOcat introduction 10 May /20111CLARIN-NL ISOcat workshop.
Reviewing your Program
Resource Sharing Locate
Q2 MARK SCHEME LEVEL DESCRIPTORS Level 4 Perceptive, detailed
ISOCAT ISOCAT Problems
The INTERACT Website: Important source of information for the ETC Community Karen Vandeweghe, Communications Manager, IS Bratislava 27 January 2010.
PHARM Library Orientation
Numbers 0-60 Leçon 1B.1.
Presentation transcript:

ISOcat: known issues 19 June 20121CLARIN-NL ISOcat workshop

Known issues ISOcat: ongoing effort => there are still a series of ‘loose ends’ – RELcat and SCHEMAcat – Linking / Adopting – Searching – Definitions

RELcat Essential for several Dutch tagsets N(soort, ….) comes with 2 DCs: 1.Noun 2.Common How to relate this with one of the DCs for ‘common noun’, even in case we would find the definition perfect? Good news: in progress!

RELcat Linking DC is not just a ‘nice’ feature – Proper noun – Common noun – Mass noun – Count noun are all instances of ‘noun’ (i.e. have an IsA relation with it)

Searching How to detect which DCs are Standardized? Or have a Dutch language section? How to search using the search options? – Be aware: Profile, Match method How to detect which DCs ‘belong together’ (unless one mentions the tag set in the definition) i.e. which ones can be combined?

Consequences of adopting Suppose, you adopt a specific DC, and some essential changes are made to that DC – You may no longer want to map, but how do you know? Suppose there are several relevant DCs, you adopt one and just that one doesn’t get standardized – You have to redo your work (but you first are to be aware that …)

Ill-defined DCs Profile: morphosyntax – Definition: semantic ‘concept’ in definition not defined in ISOcat, or that concept comes with several DCs (which one was meant?)

Too many DCs There are too many ‘almost the same’ DCs, even within the same profile Too vague DCs There are many DCs with rather ‘empty’ definitions – Proper noun: a noun that represents a unique thing or person – Determiner: determiner – Indefinite article: article that is indefinite – Mother tongue: Specifies whether the language is a speakers mother tongue

Too specific DCs Quite a number of DCs are too specific, mostly Polish ones, this makes it difficult to map with them – i.e. stuff that belongs in the Polish language section is in the general, English one Other DCs are too project/tagset related – Mentioning the name in de definition – Or in the name, – Or …

Therefore, while for some technical issues solutions will come up YOU should also be very careful yourself, especially wrt the ‘soundness’ of the DCs, in particular as far as definitions, profile, and translation are concerned! Only in that case ISOcat can become a success story!

Follow-up Contact – Menzo for technical problems – Ineke for content problems Next workshop – September or October – Before 15 August share a first substantial selection for your project with the CLARIN-NL group, and a spreadsheet for RELcat 10 May /2011CLARIN-NL ISOcat workshop11

Thanks !