Download presentation
Presentation is loading. Please wait.
1
Cornell CS 502 Resource Description Framework Building the Semantic Web CS 502 – 20020226 Carl Lagoze – Cornell University Acknowledgements: Eric Miller Dieter Fensel
2
Cornell CS 502 Motivating the “Semantic Web” M. Doe illustrated the book “Best Stories” Mary Doe animated the cartoon “Best Stories – the movie” Illustration is a type of contribution animation is a type of contribution M. Doe and Mary Doe are pseudonyms for Susan Mann Show me the works to which Susan Mann contributed? Cartoons and Books are types of Works
3
Cornell CS 502 Modeling & Encoding Metadata Components: RDF RDF (Resource Description Format) The instantiation of the Warwick Framework on the Web –Support for and integration of multiple independent metadata vocabularies Provides enabling technology for richly-structured metadata Rich data model supporting notions of distinct entities and properties Primitives permit semantic inferencing Expressible in machine readable manner (e.g., XML)
4
Cornell CS 502 RDF Components Formal data model Syntax for interchange of data Schema Type system (schema model) Syntax for machine-understandable schemas Query and profile protocols Ontologies layered on top
5
Cornell CS 502 RDF Data Model Imposes structural constraints on the expression of application data models – for consistent encoding, exchange and processing of metadata –Provides for structural interoperability Enables resource description communities to define their own semantics
6
Cornell CS 502 RDF Data Model Directed labeled graphs Model elements –Resource –Property –Value –Statement –Containers
7
Cornell CS 502 RDF Model Primitives Resource Property Value Resource Statement
8
Cornell CS 502 Simple Example Resource Author “Eric”
9
Cornell CS 502 RDF Syntax RDF Model defines a formal relationships among resources, properties and values Syntax is required to... –Store instances of the model into files –Communicate files from one application to another XML is one well-supported syntax There are syntax alternatives –Relational databases –Triple Stores
10
Cornell CS 502 RDF Model Example #1 URI:R “CIMI Presentation” Title Creator dc: “Eric Miller”
11
Cornell CS 502 RDF Syntax Example #1 URI:R “CIMI Presentation” Title Creator dc: “Eric Miller” <RDF xmlns = “http://www.w3.org/TR/WD-rdf-syntax#” xmlns:dc = “http://purl.org/dc/elements/1.0/”> CIMI Presentation Eric Miller
12
Cornell CS 502 “Eric Miller” RDF Model Example #2 URI:R URI:ERIC “emiller@ oclc.org” “Eric Miller” “OCLC” bib:Emailbib:Aff bib:Name URI:OCLC “CIMI Presentation” Title Creator oa: dc:
13
Cornell CS 502 <RDF xmlns = “http://www.w3.org/TR/WD-rdf-syntax#” xmlns:dc = “http://purl.org/dc/elements/1.0/” xmlns:bib = “http://www.bib.org/persons#”> CIMI Presentation Eric Miller emiller@oclc.org RDF Syntax Example #2
14
Cornell CS 502 “Eric Miller” RDF Model Example #3 URI:R URI:ERIC “emiller@ oclc.org” “Eric Miller” “OCLC” bib:Emailbib:Aff bib:Name URI:OCLC “CIMI Presentation” Title Creator admin:By admin:On “LOC” “03-09-99” admin:For “...” dc:
15
Cornell CS 502 RDF Containers Permit the aggregation of several values for a property Express multiple aggregation semantics –unordered –sequential or priority order –alternative
16
Cornell CS 502 RDF Containers Permit the aggregation of several values for a property Express multiple aggregation semantics –unordered –sequential or priority order –alternative
17
Cornell CS 502 RDF Containers Bag –unordered grouping Sequence –ordered grouping Alternatives –alternate values need to choose –at least one value –first value is default or preferred value
18
Cornell CS 502 RDF - Bag Unordered group “Carl Lagoze and Stuart Weibel are co-authors” Carl Lagoze Stuart Weibel
19
Cornell CS 502 RDF - Sequence Ordered or priority group “Carl Lagoze is primary author and Stuart Weibel is second author” Carl Lagoze Stuart Weibel
20
Cornell CS 502 RDF - Alt Client chooses one of several values First value is default “The distance is 15 kilometers or 9.3 miles” 15KM 9.3M
21
Cornell CS 502 Formalizing the RDF model – Thinking in triples RDF basic types –rdf:Resource – everything that can be identified (with a URI) –rdf:Property – specialization of a resource expressing a binary relation between two resources –rdf:statement – a triple with properties rdf:subject, rdf:predicate, rdf:object An RDF statement is a triple consisting of a resource (subject), a property and a second resource (object) –(:s :p :o) Expressible also as binary relations –P(S,O) – e.g., Title(R, “War & Peace”)
22
Cornell CS 502 RDF triple model
23
Cornell CS 502 RDF statements and basic types WYA creator Digital Libraries rdf:subject rdf:predicate rdf:object rdf:statement rdf:property “CL says ‘WYA wrote Digital Libraries’”
24
Cornell CS 502 Reification – Statements about statements “CL says ‘WYA wrote Digital Libraries’” WYA creator Digital Libraries rdf:subject rdf:predicate rdf:object rdf:statement rdf:property CL assertedBy
25
Cornell CS 502 From Graphs to Triples alice betty charles doris eve
26
Cornell CS 502 Expressing Collection Primitives in Binary Relations
27
Cornell CS 502 RDF Schemas Declaration of vocabularies –properties defined by a particular community –characteristics of properties and/or constraints on corresponding values Schema Type System - Basic Types –Property, Class, SubClassOf, Domain, Range –Minimal (but extensible) at this time –Expressible in the RDF model and syntax
28
Cornell CS 502 Schema Vocabularies Enables communities to share machine readable tokens and locally define human readable labels. dc:Creator “Nom” rdfs:label “Author” rdfs:label “$100 $a” rdfs:label
29
Cornell CS 502 Relationships among vocabularies dc:Creator ms:director marc:100 bib:Author
30
Cornell CS 502 Relationships among vocabulary elements URI:R “John Smith” ms:director dc:Creatorms:director rdfs: subPropertyOf rdfs:label “Director” dc:Creator
31
Cornell CS 502 RDF Schema: Specializing Properties rdfs:subPropertyOf – allows specialization of relations –E.g., the property “father” is a subPropertyOf the property parent subProperty semantics
32
Cornell CS 502 Sub-Property Semantics
33
Cornell CS 502 Constraints on Properties Force objects to be of a certain type rdfs:domain –Restricts the type of resources that may have a specific property rdfs:range –Restricts the type of resources that may be the value of a specific property
34
Cornell CS 502 Inferences from Constraints
35
Cornell CS 502 Class Hierarchy rdfs:Class –Resources denoting a set of resources; range of rdf:type rdfs:subClassOf –Create class hierarchy rdf:type rdfs:class rdfs:subClassOf rdf:type rdf:class rdf:type rdf:class
36
Cornell CS 502 Sub-Class Inferencing
37
Cornell CS 502 Sub-class Inferencing Example
38
Cornell CS 502 Storing and querying RDF models – Relational DB Issues –Scalability: potentially huge # of triples –Tables: number, sparseness, joins –Queries: how and how expensive –Reification?
39
Cornell CS 502 Storing and querying RDF models – SQUISH SELECT ?sal, ?t, ?x FROM http://ilrt.org/discovery/2000/11/rss-query/jobs- rss.rdf, http://ilrt.org/discovery/2000/11/rss-query/jobs.rss WHERE (job::advertises ?x ?y) (job::salary ?y ?sal) (job::title ?y ?t) AND ?sal > 55000 USING job for http://ilrt.org/discovery/2000/11/rss-query/jobvocab.rdf# http://swordfish.rdfweb.org:8085/rdfquery/index.html
40
Cornell CS 502 Where do you stop? Model provides enabling technology Degree of metadata simplicity/complexity is a matter of: –Resource description communities needs, best-practice and experience –Organization/Institution’s Policy –Economics –Goals and requirements of implementation
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.