ICS-FORTH March 10, 2001 1 Institute of Computer Science Foundation for Research and Technology - Hellas Ontologies and Thesauri - Tools for Effective.

Slides:



Advertisements
Similar presentations
ICS-FORTH April 10, Semantic Problems of Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science.
Advertisements

1 ICS –FORTH, Oct.30-Nov.4,2006, Cyprus Documenting Events in Metadata Martin Doerr, Athina Kritsotaki Center for Cultural Informatics Institute of Computer.
1 CIDOC CRM + FRBR ER = FRBR OO … an equation for a harmonised view of museum information and bibliographic information Martin Doerr First CASPAR Seminar.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Melbourne, October 13, Electronic Communication on Diverse Data - The Role of the oo CIDOC Reference Model - Martin Doerr (ICS-FORTH, Crete, Greece)
ICS-FORTH Which Period Is It? A Methodology To Create Thesauri Of Historical Periods Martin Doerr, Athina Kritsotaki, Stephen Stead.
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
Entering A New ERA : The European Research Area Ken Miller UK Data Archive University Of Essex June 11-15, 2002.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
1 CS 502: Computing Methods for Digital Libraries Lecture 12 Information Retrieval II.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Systems Engineering Foundations of Software Systems Integration Peter Denno, Allison Barnard Feeney Manufacturing Engineering Laboratory National Institute.
Reengineering AGROVOC to Ontologies Step towards better semantic structure NKOS Workshop 31 May 2003 Rice University Houston, Texas, USA Frehiwot Fisseha.
Heraklion, April 2, Mapping a Data Structure to the CIDOC Conceptual Reference Model Martin Doerr (ICS-FORTH, Crete, Greece) Heraklion, Crete, April.
EuroVoc, Eurlex, EU Bookshop Danica Maleková, Publications Office STS Bratislava, 22 October 2010.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
ICS-FORTH May 25, The Utility of XML Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Heraklion, May.
Developing facets in UDC for online retrieval Claudio Gnoli (University of Pavia) Aida Slavic (UDC Consortium) 8th NKOS Workshop, Corfu, 1 Oct 2009.
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
ICS-FORTH October 14, The CIDOC CRM, factor for the integration and presentation of cultural information Martin Doerr Foundation for Research and.
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
ODINCINDIO Marine Information Management Training Course February 2006 Organizing the collection Murari P Tapaswi National Institute of Oceanography,
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Mr. Green ANALYZING ART.  Responding to, interpreting meaning, and making critical judgments about specific works of art  Art critics help viewers perceive,
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico.
A CIDOC CRM – compatible metadata model for digital preservation
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
AAT Art & Architecture Thesaurus. Diffuse list of museum standards
Marc Conrad, University of Luton1 Abstract Classes – pure computer science meets pure mathematics. The Beauty of Implementing Abstract Structures.
Controlled Vocabulary & Thesaurus Design Hierarchies & Taxonomies.
ICS-FORTH CIDOC Conceptual Reference Model Special Interest Group Chair: Martin Doerr CIDOC 2006 Gothenburg, Sweden, September 13.
Smithsonian, March 26, International Symposium “Sharing the Knowledge” Martin Doerr Smithsonian, Washington DC March 26, 2003 FORTH, Greece Chair,
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Controlled Vocabulary & Thesaurus Design Hierarchies.
ICS-FORTH July, Classifying Historical Documents Maria Theodoridou, Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer.
1 ISO/IEC 11179, Part 2: Classification Schemes Jim Carpenter Bureau of Labor Statistics Metatopia 2001 Conference September 20 – 21, 2001.
ICS-FORTH Thesauri of Historical Periods A Proposal for Standardization Martin Doerr, Athina Kritsotaki Heraklion, Crete, June
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
Knowledge Technologies for Description of the Semantics of the Bulgarian Iconographical Artefacts Lilia Pavlova-Draganova Laboratory of Telemаtics – BAS,
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
ICS-FORTH June 30, 2014 Knowledge Organisation Systems - Form and Utility - Center for Cultural Informatics, Institute of Computer Science Foundation for.
ORGANIZATION OF ELEMENTS OF INFORMATION The Thesaurus.
ICS-FORTH September, The CIDOC CRM Format Martin Doerr Washington, Sept. 22, 1999 Foundation for Research and Technology - Hellas Institute of Computer.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
Slide 6 HMD1SPI376 - Slide 6. What is the Relationship Between BT and NT?  Normally, BT and NT are "inverse" links. In other words, if X is a broader.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
1 SUBJECT ACCESS INF 389F: Organization of Records Information Professor Fran Miksa October 29, 2003.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization
From FRBR to FRBROO through CIDOC CRM…
Introduction to Semantic Metadata & Semantic Web
Taxonomies and Classification for Organizing Content
THESAURUS CONSTRUCTION: GROUND WATER
Presentation transcript:

ICS-FORTH March 10, Institute of Computer Science Foundation for Research and Technology - Hellas Ontologies and Thesauri - Tools for Effective Information Access Martin Doerr Workshop of the Human Network for Cultural Informatics Heraklion, Crete

ICS-FORTH March 10, Ontologies and Thesauri Problem Statement  Explanation of a term:  What is an ushebti, what a shawabty ?  What did it mean, and when?  What was is made for?  How was it made?  Where was it used ?  Ideas, concepts, rather than words  Multiple aspects of interest !

ICS-FORTH March 10, Ontologies and Thesauri Problem Statement  Searching for comparative Studies  How do I spell It? Ushabti, ushabty, ushebti, shawtaby? Will it be written the same everywhere?  Should I call it : “grave goods”(AAT), “burial figurines”,“dolls”, “afterlife helpers”, “personality surrogate”, “burial ritual”?  And what about “xαρώνειο, δανάκη” ?  Should I call it: “toll”, “cheap coin”, “afterlife helper”, “corpse equipment”, “burial gift”, “burial rites” ?  Would be “grave goods” distinctive enough?

ICS-FORTH March 10, Ontologies and Thesauri Problem Statement  How to find the characteristic term itself ?  How to discover related literature ?  Relevant abstractions are not standardized  How to make statistics even about the same item?  The same items can be referred in a thousand ways  How to do comparative studies by features ?  Implicit features are not declared, explicit features need systematic documentation

ICS-FORTH March 10, Ontologies and Thesauri Problem Statement  Find well defined concepts  uniquely identifiable without dialogue  with wide agreement  for reproducible agreement between classification and retrieval  Co-operative work on shared knowledge bases (Knowledge Organisation Systems, KOS):  knowledge elicitation from experts  many small agreements and data integration  structural evolution  publication - incorporation at user sites

ICS-FORTH March 10, Ontologies and Thesauri Usage Environment? User’s Authority Target AuthoritiesCMSCollections old version specialised Distributed Retrieval Local Term Agreed-on Term foreign language

ICS-FORTH March 10, Ontologies and Thesauri About Thesauri  Thesauri: find good terms by associations  Peter Mark Roget,1852, “Thesaurus of English Words and Phrases”  Linguistic thesauri  TEI, FDIS ISO12620, MARTIF, VHG  Dictionary editing, term based, presentation oriented  Conceptual thesauri  From library science, subject classification  Ranganathan : priority of concept. Confusion of Idea plane - Verbal plane - Notational plane hinders analysis and problem solution  ISO2788, ISO5964, ISO2709, e.g. AAT

ICS-FORTH March 10, Ontologies and Thesauri About Thesauri Intrathesaurus relations (ISO 2788) Hierarchical Relations (from Descriptor, to Descriptor) BT (Broader Term) BTP (Broader Term Partitive) BTG (Broader Term Generic ) = actual BT Associative Relations (from Descriptor, to Descriptor) RT (Related Term) Equivalence Relations (from Descriptor, to Term) ALT (Alternative Term) UF (Used For Term)

ICS-FORTH March 10, Ontologies and Thesauri Broader Term Hierarchies

ICS-FORTH March 10, Ontologies and Thesauri About Thesauri  Concepts identify sets of real world objects  Concepts are identified by scope notes, literature references, examples, images – NOT by terms!  Terms (noun phrases) are used  by social groups to refer to concepts  Links express opinions and differences  about set relation between concepts, subsumption, disjointness etc.  about term usage

ICS-FORTH March 10, Ontologies and Thesauri Example, problems of Monohierarchy

ICS-FORTH March 10, Ontologies and Thesauri Concepts are organized in Facets  Fundamental category, major facet, basic facet:  Ranganathan: Personality, Matter, Energy, Space, Time  CIDOC CRM: Period, Physical Entity, Conceptual Object, Actor, Place, Time-Span, Type, Material, Language  AAT: Objects, Agents, Activities, Styles and Periods, Materials, Physical Attributes, Associated Concepts.  Syntactic element of an indexing expression: e.g. subdivision by period, geography, genre (MARC): “history of painting in 19 th century Greece”, or AAT: “fencing + swords”.

ICS-FORTH March 10, Ontologies and Thesauri About Minor Facets  “Minor facets” provide explicit context criteria:  E.g. MDA Archeological Thesaurus: armour by construction : scale armour armour by form : cuirass armour by function : parade armour  A striking example for explicit use of aspect: SHIC — Social, Historical and Industrial Classification — a “pure”, homogeneous thesaurus of human activities — used by British museums to classify artifacts !

ICS-FORTH March 10, Ontologies and Thesauri Polydeykes  Directorate of Monuments Record and Publications of the Greek Ministry of Culture develops the “Polydeykes”, in collaboration ICS- FORTH:  Basic Facets:  Kosmos, the world as subject  Living Nature, as historical subject  Culture and Civilization  Space  Time  Creations, the man-made world — Immobile objects — mobile objects — conceptual works — Associative concepts: Stylistic, physical and technical characteristics

ICS-FORTH March 10, Thesauri in Archeology Polydeykes  Example: Aspects of Immobile Objects:  “Είδος”, the “design models” of the past (form dominated).  “Ενότητα”, units with respect to social or functional role  “Στοιχεία”, constructive and morphological characteristics: — “τμήματα”, segments/ sections — composition: dependent and independent parts — styles — shapes  Pre-combined in the upper abstraction levels to a complete grid for the classification of characteristic terms and for object classification – consistent but heavy.

ICS-FORTH March 10, Ontologies and Thesauri Polyhierarchies instead of Minor Facets objects swords sword-like objects foils (swords) weapons sword-like Fighting and hunting cutting and thrusting fencing cutting and thrusting weapons Fencing swords Wooden swords Wooden Term specialization Criteria assignment

ICS-FORTH March 10, Ontologies and Thesauri Ontologies  Formal ontologies: mathematical models for thesaurus relationships  Concepts are correlated with sets of objects  BT/NT => IsA/ subsumption  RT => open number of “roles”/properties/attributes (like “produces”, “used by”, “made for”).  Allow for machine-processable definitions: — Fencing sword = sword used for: fencing” — Weapon = object used for: fighting or hunting — Mother = human & female & which has born: human

ICS-FORTH March 10, Ontologies and Thesauri Ontologies  Formal Ontologies are the natural extension of thesauri  Allow for dynamic unambiguous concept formation => multiplication of available vocabulary (in contrast to post- coordination like “grinding+factory)  Allow for machine-based inferencing => multiplication of manageable amounts.  Allow for interpretation of data structures (tables, fields, tags, classes, attributes etc.) and terms => help data interoperability

ICS-FORTH March 10, Ontologies and Thesauri Conclusions  Thesauri and ontologies for information systems are retrieval tools, not terminology dictionaries (concepts often different from expert terminology).  Thesaurus structure must be functional, polyhierarchical.  Thesaurus concepts are a matter of agreement.  Indexing data records is different from scholarly classification.  Try to correlate different (foreign) thesauri !  Formal ontologies are the next step. Thesaurus editors: preserve as much knowledge as possible!