 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie Linked Broken Data? Dr Axel.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute 1 From OntoSelect to OntoSelect-SWSE.
Copyright 2007 Digital Enterprise Research Institute. All rights reserved. SEMEDIA PARENTAL ADVISORY: Neither formulas nor inference rules.
Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Uniasdsd Dynamic querying.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute 0:45:001 LODPeas: Like Peas.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Linked Broken Data? Dr Axel.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Using Datalog for Rule-Based.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Semantic Web Technologies:
Linked-data Architecture Payam Barnaghi Centre for Communication Systems Research University of Surrey FIA Budapest Linked data session Budapest, May 2010.
Chapter 3 Querying RDF stores with SPARQL. TL;DR We will want to query large RDF datasets, e.g. LOD SPARQL is the SQL of RDF SPARQL is a language to query.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Anatomy of a Semantic Virus.
5/17/20151 FOAF. 5/17/20152 Introduction Metadata is data about data The terms refer to data used to identify, describe, or locate information resources.
The Web of data with meaning... By Michael Griffiths.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Digital Enterprise Research Institute Stefan Decker Digital.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Context Dependent Reasoning.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 The Architecture of a Large-Scale Web Search and Query Engine.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Live Linked Open Sensor.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 John Breslin (for Stefan Decker) Site Interoperability Projects.
Using Java in Linked Data Applications Fuming Shih Oct 12.
ONTOLOGY ENGINEERING Lab #9 - November 3, Linking Relational Databases to Ontologies 2  Relational databases are still a common means of storing.
Semantic Web Andrejs Lesovskis. Publishing on the Web Making information available without knowing the eventual use; reuse, collaboration; reproduction.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Semantic Search for CMS IKS.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards a Social Notion of Provenance on the Web Andreas Harth,
Where is the Semantic Web?
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute (DERI) Galway
OWL vs. Linked Data: Experiences and Directions Axel Polleres Siemens AG Österreich 1 27/05/2013.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Using/Extending RIF and.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Can Semantics catch up with.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
1 Virtualisation and Validation of Smart City Data Dr Sefki Kolozali Institute for Communication Systems Electronic Engineering Department University of.
UMBC an Honors University in Maryland 1 Search Engines for Semantic Web Knowledge Tim Finin University of Maryland, Baltimore County Joint work with Li.
© Copyright 2013 STI INNSBRUCK Linked Open Data Anna Fensel, Ioannis Stavrakantonakis,
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute The Digital Enterprise Research.
JSON-LD. JSON as an XML Alternative JSON is a light-weight alternative to XML for data- interchange JSON = JavaScript Object Notation – It’s really language.
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute The Social Semantic Desktop.
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Scalable Authoritative OWL.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Rapid Prototyping of Semantic Mash-Ups through Semantic Web Pipes Danh Le-Phuoc, Axel Polleres, Manfred Hauswirth, Giovanni Tummarello 1, Christian Morbidoni.
Internet Sources An Introduction to evaluating information on the Internet.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Reasoning and Querying for.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Portal Support for Communities of Interest In Ireland NUIGalway Ina O’Murchu Stefan Decker.
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Expert (and Novice) Finding.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 A Sitemap extension to enable efficient interaction with large.
Working Ontologist Ch. 9 Using RDFS-Plus in the wild 구해모 김윤승 박상훈 송용주 임유빈.
Chapter 3 Querying RDF stores with SPARQL
Alexandra Cristea 1.  pronounced "sparkle“  recursive acronym for: ◦ SPARQL Protocol and RDF Query Language  a semantic query language  a query language.
Microsoft Research Faculty Summit Jennifer Golbeck Assistant Professor, College of Information Studies University of Maryland, College Park Social.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Linked Broken Data? Dr Axel.
UMBC an Honors University in Maryland 1 Finding and Ranking Knowledge on the Semantic Web Li Ding, Rong Pan, Tim Finin, Anupam Joshi, Yun Peng and Pranam.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
Aidan Hogan, Antoine Zimmermann, Jürgen Umbrich, Axel Polleres, Stefan Decker Presented by Joseph Park SCALABLE AND DISTRIBUTED METHODS FOR ENTITY MATCHING,
UMBC an Honors University in Maryland 1 Searching for Knowledge and Data on the Semantic Web Tim Finin University of Maryland, Baltimore County
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Semantic Web in Depth RDFa, GRDDL and POWDER Dr Nicholas Gibbins
What is the Semantic Web? 17 th XBRL International Conference Eindhoven, the Netherlands 5 st May, 2008 Ivan Herman, W3C.
@ How the Semantic Web is Being Used: An Analysis of FOAF Documents Li Ding, Lina Zhou, Tim Finin, Anupam Joshi eBiquity Lab, Department of CSEE University.
OData “The low hanging fruit of the Semantic Web”
SPARQL SPARQL Protocol and RDF Query Language
Now how do I aggregate/process all this RDF data out there?
Linked Open Data and federated search
JSON for Linked Data: a standard for serializing RDF using JSON
Linked Data Ryan McAlister.
Presentation transcript:

 Copyright 2009 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Linked Broken Data? Dr Axel Polleres Digital Enterprise Research Institute, NationaI University of Ireland, Galway Based on joint work with Aidan Hogan, Andreas Harth, Renaud Delbru, Giovanni Tummarello, Stefan Decker

Digital Enterprise Research Institute How can I reason about Web Data in a Semantic Search Engine? Datawarehouse approach, e.g. SWSE  crawling, harvesting, SPARQL interface, RDFS+resricted OWL reasoning Search/Lookup indices for the Semantic Web, e.g. Sindice  Indexing RDF sources on the Web, go there and query yourself 2

Digital Enterprise Research Institute Why is complete reasoning non-optimal anyways? Our use case: Search the Semantic Web!  Hypothetically: The explosive semantics of inconsistencies in DL/FOL reasoning would spoil our results.  What if we throw all into one big KB? one inconsistency… a owl:differentFrom a. :me ex:age “old”^^xs:integer. … would make everything true. 3

Digital Enterprise Research Institute Inconsistencies/wrong inferences on Web Data 4 main reasons  Publishers deliberately publish spoilt data (“SPAM”)  Opinions differ  “URI-sense” ambiguities  Accidently wrong/inconsistent 4 Least common Most common

Digital Enterprise Research Institute Publishers deliberately publish spoilt data (“SPAM”) Examples:  a owl:differentFrom a.  Can occur for “testdata” being published, deliberate SPAM can become an issue, as the SW grows! 5

Digital Enterprise Research Institute Opinions differ Fictitous Example Ontology: Originofthings.example.org: o1:surpremePower owl:disjointWith o1:naturalPhenomenom. o1:originsFrom rdf:type owl:functionalProperty. o1:god rdf:type o1:surpremePower. o1:evolution rdf:type o1:naturalPhenomenom. darwin.example.org: ex:mankind o1:originsFrom o1:evolution. creationism.example.org: ex:mankind o1:originsFrom o1:god FlyingSpaghettimonster.org fsm::theSpaghettiMonster rdf:type surpremePower. ex:mankind o1:originsFrom fsm:theSpaghettiMonster. 6

Digital Enterprise Research Institute “URI-sense” ambiguities foaf:knows i.e., why do I have to use a different URI for myself and my homepage? Many people don’t understand/like this and make mistakes. But is this really a mistake or a design error? 7

Digital Enterprise Research Institute Accidentially inconsistent data :me ex:age "old"^^xs:integer. can e.g. arise from an exporter, that collects age from a form Source1 (faulty): TimBL foaf:homepage TimBL rdf:type foaf:Person. W3.org: W3C foaf:homepage W3C rdf:type foaf:Organisation. Did occur in our Web crawls at some point, people don’t have the right semantics in mind! Suspiciously resembles problems with e.g. flawed HTML … browsers, normal search engines still have to deal with it  So do we! 8

Digital Enterprise Research Institute Accidently wrong (non-inconsistent data) FOAF Ontology: foaf:mbox rdf:type owl:InverseFunctionalProperty Careless FOAF exporters produce something like this for any empty address: ex:alice foaf:mbox “mailto:” ex:bob foaf:mbox “mailto:” … IFP reasoning (Rules: owl1-4) on Web Data equates too many things! Dangerous! 9

Digital Enterprise Research Institute How can I reason about Web Data in a Semantic Search Engine? In these systems we deal with inconsistencies/noise in one or the other way…

Digital Enterprise Research Institute RDFAlerts Checks and analyses common mistakes Short Demo. 11

Digital Enterprise Research Institute Visit: 12 Already several successes in finding/fixing: FOAF, dbpedia, NYtimes, even W3C specs… etc.