Semantic-Web, Triple-Strores, and SPARQL

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
XML: Extensible Markup Language
The Semantic Web – WEEK 4: RDF
RDF Tutorial.
Semantic Web Introduction
Dr. Jim Bowring Computer Science Department College of Charleston CSIS 690 (633) May Evening 2009 Semantic Web Principles and Practice Class 5: 27 May.
Dr. Alexandra I. Cristea RDF.
Knowledge Technologies Manolis Koubarakis 1 An Introduction to RDF Schema.
Resource Description Framework (RDF) developed by World Wide Web Consortium (W3C) by Satya.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Nancy Ide Vassar College USA Resource Definition Framework A Tutorial EUROLAN 2003 July 28 - August 8 Bucharest - Romania.
RDF Semantics by Patrick Hayes W3C Recommendation Presented by Jie Bao RPI Sept 4, 2008 Part 1 of RDF/OWL Semantics Tutorial.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Logics for Data and Knowledge Representation Resource Description Framework (RDF) Feroz Farazi.
Logics for Data and Knowledge Representation
The Semantic Web Web Science Systems Development Spring 2015.
RDF – Resource Description Framework M. Missikoff – F. Taglino LEKS, IASI-CNR Una piattaforma inferenziale per il Web Semantico: Jena2 Roma, 2006 Web Semantico.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
The Resource Description Framework And its application to thegateway.org For the IIAP Jon Jablonski, Research Assistant The Information.
© 2012 IBM Corporation Best Practices for Publishing RDF Vocabularies Arthur Ryman,
RDF and XML 인공지능 연구실 한기덕. 2 개요  1. Basic of RDF  2. Example of RDF  3. How XML Namespaces Work  4. The Abbreviated RDF Syntax  5. RDF Resource Collections.
Chapter 3 RDF and RDFS Semantics. Introduction RDF has a very simple data model But it is quite liberal in what you can say Semantics can be given using.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Part II. Property domain and range restriction RDFS allows us to put restriction on the properties by specifying their domain and range, providing the.
1 Artificial Intelligence Applications Institute Centre for Intelligent Systems and their Applications Stuart Aitken Artificial Intelligence Applications.
Of 35 lecture 5: rdf schema. of 35 RDF and RDF Schema basic ideas ece 627, winter ‘132 RDF is about graphs – it creates a graph structure to represent.
RDF Schema (RDFS) RDF user communities need to define the vocabularies (terms) to indicate that they  are describing specific kinds or classes of resources.
Introduction The Resource Description Framework (RDF) is recommended by the World Wide Web Consortium (W3C) to model meta-data about the resources of the.
Practical RDF Chapter 10. Querying RDF: RDF as Data Shelley Powers, O’Reilly SNU IDB Lab. Hyewon Lim.
OIL and DAML+OIL: Ontology Languages for the Semantic Web Sungshin Lim TOWARDS THE SEMANTIC WEB: Ontology-driven Knowledge.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
Ch 7: RDF schema 현근수, 김영욱, 백상윤, 이용현 Team C. Introduction Semantic web modeling In RDF: simply creates graph structure to represent data In RDFS: about.
Important Concepts from the W3C RDF Vocabulary/Schema Sungtae Kim SNU OOPSLA Lab. August 19, 2004.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
RDF and Relational Databases
RDF & SPARQL Introduction Dongfang Xu Ph.D student, School of Information, University of Arizona Sept 10, 2015.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Silterra, April 2004 RDF, RSS and all that THREADING THE RDF MAZE.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall RDF & RDF Schema Machine Understandable Metadata for the.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Data & Semantic Web Technology The Semantic Web Part 7. RDF Semantics Dr. Myungjin Lee.
Semantic Web in Depth RDF Schema Dr Nicholas Gibbins –
1 RDF Storage and Retrieval Systems Jan Pettersen Nytun, UiA.
Service-Oriented Computing: Semantics, Processes, Agents
Charlie Abela Department of Intelligent Computer Systems
Introduction to Persistent Identifiers
An Introduction to RDF Schema
The Semantic Web Part 6. RDF Vocabularies: RDF Schema
Service-Oriented Computing: Semantics, Processes, Agents
Introduction to the Semantic Web (tutorial) 2009 Semantic Technology Conference San Jose, California, USA June 15, 2009 Ivan Herman, W3C
Service-Oriented Computing: Semantics, Processes, Agents
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Introduction to RDF and RDFS Editor: MR3
RDF 1.1 Concepts and Abstract Syntax
ece 720 intelligent web: ontology and beyond
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Chapter 3 RDF and RDFS Semantics
LOD reference architecture
Semantic Web Basics (cont.)
Presentation transcript:

Semantic-Web, Triple-Strores, and SPARQL BCHB697

Outline Semantic Web Triple-Stores SPARQL URIs & Links, Ontologies/RDF/RDFS Triple-Stores Semi-structured data, again Federation SPARQL SQL for triple-stores / semantic web BCHB697 - Edwards

(HTML) Web-pages Designed for humans to read. Page content Prose, Numbers, Text, Pictures, Layout Anchors/Links w/ Label Search engines exploit the text and links to find useful information PageRank – number of pages that link to you UniProt XRefs – jump to related information Still, how to distinguish between: jaguar & jaguar; stamp & stamp; C4 and C4? BCHB697 - Edwards

Semantic Web Annotate instances with semantics Jaguar (big cat) & Jaguar (car) Stamp (collector's item) & Stamp (one's foot, verb) C4 (gene) & C4 (explosive) Annotate instances with: Semantic information (name, size, value), and Semantic relationships to other instances Who manages the semantics? Anyone with data to describe, but… …reuse others' work to promote interoperability BCHB697 - Edwards

authority / data-provider RESTful web-services RESTful web-services associate URIs with data-model entities: Notice how well this maps to database, table, and row This web-service returns everything about taxonomy id 9606 in XML or JSON format. http://hoyataxa.georgetown.edu/taxa/9606 protocol (http,https) authority / data-provider entity identifier BCHB697 - Edwards

URIs: Universal Resource Identifiers URIs need not actually represent a web-service Pure identifiers, even if no machine or server. Associate semantic properties with URIs Literal values, or other URIs http://hoyataxa.georgetown.edu/taxa/9606 protocol (http,https) authority / data-provider entity identifier BCHB697 - Edwards

RDF: Resource Description Framework XML format for describing instances, and their semantic properties – triples! (subject, predicate, object) Subject: a URI identifying the resource (instance identifier). Predicate: a URI indicates the relationship between Subject and Object (property identifier). Object: a literal value or URI of another resource related to the Subject (property value). BCHB697 - Edwards

RDF: Resource Description Framework Conceptually, this is either: A really tall, thin table, containing the entire database A graph of nodes (subjects, objects) and edges (predicates) Regardless, still need a logical data model (at least in your head) to navigate the information. Predicate Subject Object BCHB697 - Edwards

GlycoConjugate Ontology BCHB697 - Edwards Matthew Campbell

Example Triples (TURTLE) <http://rdf.unicarbkb.org/referencedprotein/P01588> a gco:ReferencedProtein ; gco:glycosylated_at <http://rdf.unicarbkb.org/P01588Region375> ... ; gco:has_protein <http://purl.uniprot.org/uniprot/P01588> ; gco:has_saccharide_set <http://rdf.unicarbkb.org/griffithP01588SaccSet375> ; ... . <http://rdf.unicarbkb.org/P01588Region375> a gco:Glycosylation_site , faldo:region ; gco:has_saccharide_set <http://rdf.unicarbkb.org/griffithP01588SaccSet375> ; faldo:ExactPosition <http://rdf.unicarbkb.org/P01588ExactPositionSer153> . <http://rdf.unicarbkb.org/P01588ExactPositionSer153> a faldo:ExactPosition ; gco:has_amino_acid <http://rdf.unicarbkb.org/amino_acid_ser> ; faldo:position "153^^xsd:int" . <http://rdf.unicarbkb.org/amino_acid_ser> a gco:amino_acid ; gco:amino_acid "Ser" . Matthew Campbell BCHB697 - Edwards

RDF/XML <?xml version="1.0" encoding="utf-8" ?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:oboInOwl="http://www.geneontology.org/formats/oboInOwl#" xmlns:obo="http://purl.obolibrary.org/obo/" > <rdf:Description rdf:about="http://purl.obolibrary.org/obo/PR_000027736"> <rdf:type rdf:resource="http://www.w3.org/2002/07/owl#Class" /> <rdfs:label rdf:datatype="http://www.w3.org/2001/XMLSchema#string">neuraminidase subtype N2 (Influenza A virus)</rdfs:label> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string">Category=organism-gene. Requested by=IEDB. Requested by=ImmPort.</rdfs:comment> <rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/PR_000049742" /> <rdfs:subClassOf rdf:nodeID="b45527437" /> <oboInOwl:hasExactSynonym rdf:datatype="http://www.w3.org/2001/XMLSchema#string">fluA-NA(N2)</oboInOwl:hasExactSynonym> <oboInOwl:hasOBONamespace rdf:datatype="http://www.w3.org/2001/XMLSchema#string">protein</oboInOwl:hasOBONamespace> <oboInOwl:id rdf:datatype="http://www.w3.org/2001/XMLSchema#string">PR:000027736</oboInOwl:id> <obo:IAO_0000115 rdf:datatype="http://www.w3.org/2001/XMLSchema#string">A neuraminidase (Influenza A virus) that is expressed on the surface of Influenza A virus and has similar antigenic properties, i.e., it will be neutralized by a similar set of antibodies. Example: UniProtKB:P06820.</obo:IAO_0000115> </rdf:Description> <rdf:Description rdf:nodeID="b45527437"> <rdf:type rdf:resource="http://www.w3.org/2002/07/owl#Restriction" /> <owl:onProperty rdf:resource="http://purl.obolibrary.org/obo/RO_0002160" /> <owl:someValuesFrom rdf:resource="http://purl.obolibrary.org/obo/NCBITaxon_11320" /> </rdf:RDF> BCHB697 - Edwards

RDFS: RDF Schema Data modeling vocabulary (ontology) for classes and properties rdfs:Class, rdfs:subClassOf, rdf:Property, rdfs:subPropertyOf, rdfs:range, rdfs:domain, rdfs:Resource, rdfs:Literal, rdfs:Datatype, rdfs:type, rdfs:comment, rdfs:label, rdfs:seeAlso, rdfs:isDefinedBy BCHB697 - Edwards

Ontology Formal definition of terms and their conceptual definitions: In particular, classes and their properties OWL – Web Ontology Language RDF document with specific classes and properties for defining ontologies Public, facilitates data-use/reuse Compare with: Logical data-model for a relational database BCHB697 - Edwards

Triple-Stores Database for storing RDF triples, and efficiently querying them Constrain subject, predicate, and/or object Triple-store query as web-service, or dump RDF/XML Semi-structured, similar to document stores From one extreme to the other… Extreme "(de-)normalized form" Ontology / logical data-model is crucial BCHB697 - Edwards

Federation / Federated Queries Triple-stores can easily be concatenated: …even virtually, with triples staying put. However, this only makes sense if Both triple-stores agree on classes, properties Both triple-stores agree on URIs NOTE: True for any data-integration project Done right, federated queries of multiple triple-stores can be executed automatically …across multiple independent triple-stores BCHB697 - Edwards

SPARQL SPARQL Protocol and RDF Query Language SQL-like query language for triples BCHB697 - Edwards

SPARQL URI Namespaces: Result clause: Query pattern: Placeholders: BCHB697 - Edwards

SPARQL URI Namespaces: Result clause: Query pattern: Placeholders: rdfs:type BCHB697 - Edwards

SPARQL BCHB697 - Edwards

SPARQL Not quite as expressive as SQL …but provides a significant subset of its functionality Properties must be present to be matched Absence of property values is difficult to query Primarily use "equality" clauses Consequences for data-modeling strategies Multi-values appear as multi-triples BCHB697 - Edwards

Exercise Explore the SPARQL endpoint at Uniprot: https://sparql.uniprot.org/ Lots of interesting queries here! Come up with a hypothesis about how the triples are used to represent UniProt entries BCHB697 - Edwards