Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Semantics Session 1 (mon 19, 16:30-18:00, Vulcania 1) Vocabularies: –Overview of vocabulary document (APM) –Discussion to resolve WD open issues (NG, AG,...)
Terrier Workshop: 24 th October 2007 Alasdair J G Gray.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Applying ISO25964 to thesaurus mapping and other forms of linkage Stella Dextre Clarke Convenor, ISO TC46/SC9 WG8 1.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Using Semantic Web Technology to Integrate Scientific Data Alasdair J G Gray University of Manchester.
SKOS and Linked Data Antoine Isaac ISKO, London, Sept. 14th 2010.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
AstroTag MUG meeting, STScI December Data Tagging Storing associations between data sets and tags (words/phrases) – IPPPSSOOT {w_1, w_2, …, w_n}
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Some facets of knowledge management in mathematics Wolfram Sperber (Zentralblatt Math) Patrick Ion (Math Reviews) Facets of Knowledge Organization A tribute.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
S. Derriere et al., ESSW03 Budapest, 2003 May 20 UCDs - metadata for astronomy Sébastien Derriere François Ochsenbein Thomas Boch CDS, Observatoire astronomique.
Information Extraction with Linked Life Data 19/04/2011.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Query Relevance Feedback and Ontologies How to Make Queries Better.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Practical RDF Chapter 1. RDF: An Introduction
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
The New Zealand Institute for Plant & Food Research Limited Matthew Laurenson Ontologies.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Multilingual Information Exchange APAN, Bangkok 27 January 2005
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
1 Issues in Reusing and Sharing the Content of Thesauri and Taxonomies in OOR Marcia Zeng NKOS (Networked Knowledge Organization Systems/Services) My participating.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Keyword vs. Controlled Vocabulary Searching 12 Basic Skills for IQ.
Coastal Atlas Interoperability - Ontologies (continued) Luis Bermudez Stephanie Watson Marine Metadata Interoperability Initiative 1.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
The Explicator Project: Integrating Astronomy Data with Semantic Web Tools Alasdair J G Gray Information Management Group Seminar University of Manchester.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
Exploring and Mapping Vocabularies IVOA InterOp Semantics Session Trieste Italy, May 2008 Alasdair J. G. Gray.
ISO 25964: a standard in support of interoperability Stella G Dextre Clarke Project Leader, ISO NP
Thesauri usage in information retrieval systems: example of LISTA and ERIC database thesaurus Kristina Feldvari Departmant of Information Sciences, Faculty.
New Tools for astronomy librarians D Donna Thompson SLA PAM Roundtable June 9, 2014.
Terrier Workshop: 26 th February 2008 Alasdair J G Gray.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
ОТ ТАКСОНОМИИ К ОНТОЛОГИИ ВЛАСТИ. Power Complexity © Folksonomy List Synonym Ring Taxonomy Thesaurus Ontology.
APS Taxonomy Project Arthur Smith, American Physical Society April 2014.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
Semantics: Session 3 Joint Session: Semantics, VOEvent(, Theory, Registry)
12 Basic Skills for IQ: Keyword vs. Controlled Vocabulary Searching.
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Controlled Vocabularies Ilia State University, July 2010 Elisabeth Jijavadze, Natia Gabrichidze 1.
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
Information Organization
Information Organization
PREMIS Tools and Services
RDA in a non-MARC environment
Semantic Interoperability in Digital Library Systems
Taxonomy of public services
THESAURUS CONSTRUCTION: GROUND WATER
Presentation transcript:

Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis

What is a Vocabulary? 15 November 2008ADASS - Vocabularies in the VO

What is a Controlled Vocabulary? A set of terms with: – Label – Synonyms – Definition – Relationships to other terms: Broader term Narrower term Related term Example: – Spiral galaxy – Spiral nebula – A galaxy having a spiral structure – Relationships: BT: Galaxy NT: Barred spiral galaxy RT: Spiral arm 25 November 2008ADASS - Vocabularies in the VO

Why do we need vocabularies? Limit and define terminology Improve search – Retrieve papers from ADS using keywords All papers about “Binary eclipsing stars” – Process all VOEvents about “supernovae” Should also process “1a supernovae” – Locating relevant resources in the registry/VizieR Removes ambiguity Improves precision 5 November 2008ADASS - Vocabularies in the VO3

Analysis of Registry Keywords Problems: – Plural/singular – Case – Abbreviations – Different tags Thanks to Sébastien Derriere for this data. 45 November 2008ADASS - Vocabularies in the VO 75 Star 52 Galaxy 37 Stars 36 Galaxies 16 AGN 12 Cluster of Galaxies 12 Nebulae 11 Planets 10 GRB 10 Globular Clusters 8 Star Cluster 7 Nebula 6 Variable stars 5 Hot stars 5 Pulsar 4 supernova 3 Clusters of Galaxies 3 Infrared:stars 3 Quasars: general 3 Supernova 3 White dwarfs 3 galaxies 2 Comets 2 Cool stars 2 Extragalactic Source 2 Extragalactic objects 2 Infrared: stars 2 Interstellar medium 2 QSO 2 QSOs 2 SNR 2 Variable Star 2 White Dwarf 2 clusters of galaxies 2 stars 1 Asteroids 1 BL Lac 1 Be/X-ray binary stars 1 Binary stars...

Common Vocabulary Format Requirements: – Provide term identifiers Unambiguous tagging – Capture semantic relationships Poly-hierarchy structure – Machine processable Allows inter-operability “Machine intelligence” – Avoids problems of: Spelling Case Plurality problems Tags – Automated reasoning: Interested in all “Supernova” Items tagged as “1a Supernova” also returned 55 November 2008ADASS - Vocabularies in the VO

SKOS – W3C standard for sharing vocabularies – Based on RDF Semantic model for describing resources – Provides URI for each term – Captures properties of terms – Encodes relationships between terms Enables automated reasoning Standard serialisations “Looser” semantics than OWL – Adopted by IVOA Vocabularies standard 65 November 2008ADASS - Vocabularies in the VO

Example SKOS Vocabulary Term Example “Spiral galaxy” “Spiral nebula” “A galaxy having a spiral structure” Relationships: BT: “Galaxy” NT: “Barred spiral galaxy” RT: “Spiral arm” #spiralGalaxy a concept; prefLabel “Spiral altLabel “Spiral definition “A galaxy having a spiral broader #galaxy; narrower #barredSpiralGalaxy; related #spiralArm. In turtle notation 75 November 2008ADASS - Vocabularies in the VO

Terminology Aside Folksonomies – Keyword tags, freely chosen – e.g. VizieR subjects Vocabulary – Controlled list of words with definitions Taxonomy – Relationships: Broader/Narrower/Related Thesaurus – Synonyms, antonyms, see also Ontology – Formal specification of a shared conceptualisation – OWL “Vocabulary” used in IVOA to cover vocabularies, taxonomies, and thesauri. 85 November 2008ADASS - Vocabularies in the VO

Existing Astronomical Vocabularies Journal Keywords – Developed for tagging papers – 311 terms – Actively used Astronomy Visualization Metadata (AVM) – Tagging images – 217 terms – Actively used IAU Thesaurus – Developed for libraries in 1993 – 2,551 terms – Never really used Unified Content Descriptor (UCD) – Tagging resource data – 473 terms – Actively used Published as SKOS vocabularies by the IVOA 5 November 2008ADASS - Vocabularies in the VO9

Inter-operable Vocabularies Which vocabulary should I use? Closest match to your needs Relate vocabulary terms using mappings – Part of the SKOS standard – One mapping file per pair of vocabularies – Chains of mappings A  B  C AC Inter-vocabulary mappings – Broad match: more general term – Narrow match: more specific term – Related match: associated term – Exact match: equivalent term – Close match: similar but not equivalent term 5 November 2008ADASS - Vocabularies in the VO10 

Mapping Editor

Putting it all together Use vocabulary concepts for – Tagging (using URI) Resources in the registry VOEvent packets – Searching by vocabulary concept User keyword search converted to vocabulary URI Provides semantic advantages – Reasoning about terms Relationships Mappings Requires mechanism to convert string to concept 125 November 2008ADASS - Vocabularies in the VO

Vocabulary Explorer Search and browse vocabularies – Configure Vocabularies Mappings Based on Information Retrieval techniques Matching mechanisms Ranking results 135 November 2008ADASS - Vocabularies in the VO

Vocabulary Explorer Screenshot

Conclusions Vocabularies improve search – Remove ambiguity – Increase precision and recall – Enable Reasoning about relevance Faceted browsing Provided tools for working with vocabularies – Reliable search from keyword string to vocabulary term – Exploration of vocabularies – Mapping terms across vocabularies Future Work – Provide a string to concept transformation service – Improve multi-vocabulary ranking results 185 November 2008ADASS - Vocabularies in the VO

Practical Semantic Astronomy March 2009 Glasgow, UK