Data modeling Goal: Agree on data modeling process and ontology.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Health Ingenuity Exchange (HingX) Best Practices for User Groups and Resource Registration.
The Semantic Web – WEEK 4: RDF
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Unified Digital Format Registry (UDFR) Stakeholder Meeting Library of Congress Washington, DC April 13, 14, 2011.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Practical RDF Chapter 1. RDF: An Introduction
RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.
Clément Troprès - Damien Coppéré1 Semantic Web Based on: -The semantic web -Ontologies Come of Age.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
INLS 520 – Erik Mitchell INLS 520 Information Organization.
Metadata Modularization Concepts and Tools Carl Lagoze CS
Presentation : Konstantinos Kanaris.  What is Jena?  Usage of Jena  Main Concepts  Main Components  Storage Models  OWL API  RDF API  Reasoning.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
U.S. Department of the Interior U.S. Geological Survey A Consideration of Geospatial Feature Formation in Linked Open Vocabularies Workshop on Linked Open.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
RELATORS, ROLES AND DATA… … similarities and differences.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Understanding RDF. 2/30 What is RDF? Resource Description Framework is an XML-based language to describe resources. A common understanding of a resource.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Ontology Resource Discussion
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
1cs The Need “Most of the Web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.” Berners-Lee,
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Doc.: IEEE /0169r0 Submission Joe Kwak (InterDigital) Slide 1 November 2010 Slide 1 Overview of Resource Description Framework (RFD/XML) Date:
Winter 2011SEG Chapter 11 Chapter 1 (Part 1) Review from previous courses Subject 1: The Software Development Process.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
RDA, linked data, and update on development
Appellations, Authorities, and Access
Session 2: Metadata and Catalogues
Ontology.
LOD reference architecture
Information Networks: State of the Art
RDA in a non-MARC environment
The new RDA: resource description in libraries and beyond
Presentation transcript:

Data modeling Goal: Agree on data modeling process and ontology

Agenda 1.Scope 2.Provenance/ Governance (briefly) 3.Identifiers 4.Guiding Principles, Terms, Concepts 5.Controlled Vocabularies

Scope Current model is based on PRONOM 6 and UDFR Is there a useful distinction between “fact” and “institutional policy”? What should be contained in the registry? FactAssessmentPolicy JPG2000 is an image compression format. JPEG2000 is a well- adopted standard. JPG2000 is acceptable by CDL for reformatting photographs

Scope Are there other aspects of PRONOM 7 we want to include in the registry?

Provenance (briefly) What is the proper granularity for provenance and technical review, per- property or per-aggregate entity (e.g., format, agent, document, etc) Representation within the model is statements about the provenance Statements about the formats, rather than who stated those facts. Provenance about the registry information itself can be managed by Open Provenance Vocabulary whether as reified statements or statements about particular triples or graphs.

Governance (briefly) What level of technical review should/will contributed information be subject, and by whom? What are the criteria for contributor eligibility? Anonymous? Public, but known? Self-nominated, but vetted? Invited? More food for thought (to be extended tomorrow):

Identifiers (1) There are multiple identifiers that are defined in the model: 1.PRONOM ID (PUID) 2.GDFR Identifier 3.UDFR Identifier 4.UDFR SystemID (internal registry ID)

Identifiers (2) UDFR Identifier: A globally unique identifier across registry instances A persistent identifier Can be ported to persistent space at later time Non-opaque identical or mappable to URI local name machine-actionable Should UDFR identifier be opaque or transparent?

Identifiers (3) Node Create a zero-padded numeric sequence for organizational node ids (e.g. “001”) to be used within the identifier. Format Keep version information as it is defined idiosyncractically by the original format creator. Parse it to reveal family and other useful categorizations.

Identifiers (4) UDFRID = (addressable-prefix, “/”, identifier )| (addressable-prefix, “#”, identifier); addressable-prefix = “ | (“ udfr-ezid) ; udfr-ezid = 5 * digit ; identifier = node-id, “/”, entity-code, “/”, local-id, “/”, version-id ; node-id = 3 * digit ; entity-code = “f” | “n” local-id = alpha, {alphanumeric-with-slash} ; version-id digit = [0 – 9] ; alpha = [a-zA-Z] ; alphanumeric = [alpha | digit] alphanumeric-with-slash = [alphanumeric | “/”]. For example:

Goals and guiding principles 1.Support existing functionality and use cases 2.Reuse and map to existing ontologies where it makes sense (“linked data”) 3.Primarily be a descriptive ontology, with the goal of expanding to machine-actionable semantic representations where needed 4.Create natural partitions to modularize 5.Enable for expansion 6.Be consistent 7.Have the application be model-driven (yet domain model-agnostic) as much as possible

Terms ResourceAn object or element expressed in RDF. A resource is identified by a URI. ClassTypically represents a concept. A set of individuals which may possess a set of properties or relationships. InstanceAn individual member of a class. PropertyRepresents a relationship or attribute. Owl divides properties into Object Properties, which relate two resources and Datatype Properties, which relate a resource to a datatype.

Conceptual Entities SimpleBaseEntity – Contains all basic provenance/governance properties such as: administrativeStatus baseNote identifier creationDate, modificationDate veriticationDate, verificationStatus, verifiedBy

Conceptual Entities CoreEntity – Classes where the circumstance of its creation are meaningful: Assessment Document File Format: CharacterEncoding, CompressionTechnique, FileFormat Holding Identifier IntellectualPropertyRightsClaim Product: Hardware and Software Products Has additional properties relating to release information and agents who created them.

Conceptual Entities EnumeratedTypes – Class of Enumerated Type Classes (List of Values) as well as the GDFR Facets. Examples include: ByteOrderType CompressionFamilyTeyp CountryCode DisclosureType DocumentIntentType FormatRoleType LanguageCode MediaType

Conceptual Entities Format – use GDFR definition of Format to include: File Format Character Encoding Compression Technique Most properties are defined at Format level (to be inherited by subclasses) Should we use GDFR definition of Format?

Properties Should the registry support actionable inheritance of properties? For example, should BWF automatically inherit all properties defined for “generic” WAVE? When should inference take place? At UI entry time? Current relationships from GDFR (restricted, extended, …) may be difficult to formalize. Shall we just replace with “isDerivedFrom” property?

Controlled Vocabularies Semantic: RDF, RDFS, OWL Vocabulary/Thesaurus: SKOS Metadata: DC, DCTERMS Agents: FOAF Provenance: OPMV (Open Provenance model Vocabulary) Country Codes/ Language Codes Organization IDs MIME Types ?Governance

Questions/ Concerns ?