Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Semantic Web Thanks to folks at LAIT lab Sources include :
Event dashboard: Capturing user-defined semantics for event detection over real-time sensor data CSIRO LAND AND WATER Jonathan Yu | Research engineer Environmental.
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
Ontology Notes are from:
Descriptions Robert Grimm New York University. The Final Assignment…  Your own application  Discussion board  Think: Paper summaries  Web cam proxy.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
A Registry for controlled vocabularies at the Library of Congress
1 1 Roadmap to an IEPD What do developers need to do?
Modernizing the Data Documentation Initiative (DDI-4) Dan Gillman, Bureau of Labor Statistics Arofan Gregory, Open Data Foundation WICS, 5-7 May 2015.
A Really Brief Crash Course in Semantic Web Technologies Rocky Dunlap Spencer Rugaber Georgia Tech.
The NERC DataGrid Vocabulary Server Roy Lowry British Oceanographic Data Centre Ontology Registry Meeting.
The NERC DataGrid Vocabulary Server: an operational system with distributed ontology potential Roy Lowry British Oceanographic Data Centre GO-ESSP 2008,
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Towards validating observation data in WaterML 2.0 WATER FOR A HEALTHY COUNTRY You can change this image to be appropriate for your topic by inserting.
Practical RDF Chapter 1. RDF: An Introduction
Using Vocabulary Services in Validation of Water Data May 2010 Simon Cox, JRC Jonathan Yu & David Ratcliffe, CSIRO.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Metadata IN Smart Grid Group Name: REQ
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
A centre of expertise in digital information management The MEG Metadata Schemas Registry Pete Johnston, Research Officer (Interoperability),
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
INLS 520 – Erik Mitchell INLS 520 Information Organization.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
What is Information Modelling (and why do we need it in NEII…)? Dominic Lowe, Bureau of Meteorology, 29 October 2013.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Publications Office Metadata Registry (MDR) INSPIRE Registry and Registers Workshop Willem van Gemert Publications Office of the EU Dissemniation and Reuse.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Web Technologies for Bioinformatics Ken Baclawski.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
® Using (testing?) the HY_Features model, 95th OGC Technical Committee Boulder, Colorado USA Rob Atkinson 3 June 2015 Copyright © 2015 Open Geospatial.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Ontology Technology applied to Catalogues Paul Kopp.
An Introduction to the MEDIN Discovery Metadata Standard MEDIN Workshop BGS, Edinburgh, June 2015.
© Tata Consultancy Services ltd.12 June Metadata and Data Standards Levels of Metadata C. Anantaram Innovation Lab.
Validating water resources described in WaterML 2.0 Jonathan Yu, CSIRO Land and Water Peter Taylor, Gavin Walker (CSIRO ICT Centre), Simon Cox (CSIRO ESRE)
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
U.S. Department of the Interior U.S. Geological Survey WaterML Presentation to FGDC SWG Nate Booth January 30, 2013.
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
M&CML: A Monitoring & Control Specification Modeling Language
Components.
The Re3gistry software and the INSPIRE Registry
PREMIS Tools and Services
LOD reference architecture
Presentation transcript:

Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water

CSIRO. Talk outline Water Data Transfer Standards What are vocab services? How are they used? How are we are using it in WDTS? Future work

CSIRO. Talk outline What are vocab services? Set of services for querying and managing vocabularies Vocabularies themselves (water regulation codes, units of measure, pizza classifications, wine ontologies) Metadata about a domain: concepts, properties, relationships, assertions RDF-based vocabulary languages: RDFS, SKOS, OWL Embedding RDF into xHTML (RDFa) Interfaces SPARQL Protocol and RDF Query Language (SPARQL) queries HTTP Get/Post, REST Html forms How are they used? Dictionary lookup: What does this term mean? What is a margherita pizza? What is the German equivalent that pizza?? marguerite ?marguerite Discovery and analysis: What is it related to? What pizza has similar toppings to a margherita pizza? Where does the concept “Gold” occur in Geological surveys in Australia? Interoperability and shared definitions: Oh, this concept in my Business model maps to this other concept in your Business model! Oh, fruitti de mare pizza is actually a seafood pizza! Data validation: Is this a valid pizza order? Is my XML data consistent with an International standard ? Config. management and Generating code: Fill templates or Spit out some code based on concepts, properties, or conceptual structure in the vocabulary (pizza ordering website, sitemaps, Java code, Schematron rules) How are we are using it in WDTS? Validation services Validate potentially lots and lots of XML data in WDTF format > 200 data providers transferring their water data to BoM Need to ensure format is followed Hang on, can’t we just use XML Schema to enforce validation rules? XML Schema not sufficient enough and can’t capture a lot of the semantics in business rules Specific cardinality constraints and vocabulary checking Examples: Using HTTP get queries like: Is this a valid vocabulary definition? Is this URN valid? Does this Water regulation code parameter have the right measurement unit associated with it? Generating Schematron code To check cardinality between one element and another Example: your HydroCollection xml data may have as many nodes but must only have one node Future work Validation using complex business rules Currently unknown – suspect, continue to push the boundary with leveraging of vocabulary services Documentation generation Leverage on vocabulary service to aid documentation generation – i.e. constraints

CSIRO. Joint effort by CSIRO & Bureau of Meteorology (BoM) Problem space: standardising format of water observation data Currently water data providers sending data in various formats Water Data Transfer Standards (WDTS)

CSIRO. WDTF Develop Water Data Transfer Format (WDTF) Standardised format for sending and receiving water related data using XML (e.g. groundwater, river flow) Primarily used by water providers to send their data for ingestion by BoM But also for exchange between other organisations Part of integrated national water information system to help with water crisis

CSIRO. Validating WDTF Potentially lots of agencies (over 200) submitting WDTF Can’t possibly examine each XML file for valid structure and content ! Need mechanism(s) for validating WDTF

CSIRO. Why not use XML Schema? Hang on, can’t we just use XML Schema to enforce validation rules? XML Schema not sufficient enough and can’t capture a lot of the semantics in business rules Reg s3.9g Unclassified 7.56 Approp. identifiers Valid content and contextual use

CSIRO. What are Vocab Services? Set of services for querying and managing vocabularies 1.Interfaces SPARQL Protocol and RDF Query Language (SPARQL) queries HTTP Get/Post, REST Html forms 2.Vocabularies Vocabularies Descriptions about a domain in specification language: concepts, properties, relationships, assertions

CSIRO. Vocabularies What do they look like?

CSIRO. Vocabularies What do they look like? water regulation codes, units of measure, pizza classifications wine vocabularies

CSIRO. Example of specification languages Limit to RDF/XML based RDF-based vocabulary languages: Simple Knowledge Organisation System (SKOS) Simple taxonomic descriptions Broader, narrower, related to relationships Web Ontology Language (OWL) Ability to describe in very specific logic i.e. Class A is disjoint from Class B, C and D and has this custom relationship that has this defined cardinality constraint with Class B but is a subclass of Class X

CSIRO. What are vocabs services good for? 1.Dictionary lookup: What does this term mean? What is beetroot? What is Metres? 2. Discovery and analysis: What is it related to? I know I have beetroot in my fridge, what other related food is in my fridge? What water regulation parameters use the unit Metres? Where does the concept “Gold” occur in Geological surveys in Victoria? 3. Interoperability and shared definitions and semantics: Oh, this concept in my Business model maps to this other concept in your Business model! Oh, your parameter of WaterCourseLevel is measured in metres? Mine is in millimetres – let’s talk 4. Data validation: Do I have milk in my fridge? Is this a valid water parameter? Is my XML data consistent with WDTF? 5. Config. management and Generating code: Fill templates or Spit out some code or artifact based on concepts, properties, or conceptual structure in the vocabulary (my dinner, sitemaps, Java code, Schematron rules)

CSIRO. Vocab Services in WDTS Leveraging Vocabulary Services for… Representing schema control lists currently being maintained in Excel spreadsheet Validation services: Validate potentially lots and lots of XML data in WDTF format > 200 data providers transferring their water data to BoM Need to ensure format is followed

CSIRO. Typical usage of vocabulary service Specific cardinality constraints and vocabulary checking Using HTTP get queries like: Is this a valid vocabulary definition? /get/concept /vocab1.0 /param%3AWaterpH_pH /check/concept /vocab1.0 /param%3AWaterpH_pH

CSIRO. More query examples Is this Urn or http-Uri valid? E.g. urn:ogc:def:crs:EPSG:: /check/property /vocab1.0 /dc:identifier /%27urn:ogc:def:crs:EPSG::28349%27 Does this Water regulation code parameter have the right measurement unit associated with it? /check/relation /vocab1.0 /param:WaterpH_pH /skos:related /dc:identifier/%27[pH]%27

CSIRO. Generating Schematron code from query To check cardinality between one element and another Example: your HydroCollection xml data may have as many nodes but must only have one node structure/wdtf:HydroCollection/wdtf:metadata Or just get all of them /wdtf-structure2/

CSIRO. Problems and areas of difficulty Emerging requirements Open source tools Standards for representing vocabularies Implementation specific details of how vocab stored, managed Exploring what we can and can’t do with vocabulary services Method or approach used Encapsulating vocabulary in the ‘right’ way Various ways to represent something Best practices for querying Versioning

CSIRO. Conclusion WDTS Project and driving problem space for Vocabulary services Vocabulary services: Huuh - what is it good for… absolutely something! At least, for validating content and business rules in WDTF

CSIRO. Future work Continuing to develop solutions to current business rules and best practices for WDTF 1.0, 1.1 Validation of future (complex) business rules Currently unknown – suspect that we will continue to push the boundary with leveraging of vocabulary services Documentation generation Leverage on vocabulary service to aid documentation generation – i.e. populating constraints WaterML 2.0 Worldwide standard for a water data exchange format

CSIRO. Questions?