Christian Bizer: Fusing the Web of Data (12/08/2008) 3rd Asian Semantic Web Conference (ASWC 2008) DIST Workshop, Bangkok, Thailand 8 December 2008 Fusing.

Slides:



Advertisements
Similar presentations
By Rohit Bhardwaj Principal Software Engineer Kronos Inc. IEEE Computer Society and GBC/ACM June 18 th 2009.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
KOM, SEKE, June 20, 2004 Representing Chains of Custody Along a Forensic Process: A Case Study on Kruse Model Tamer Fares Gayed, UQAM Hakim Lounis, UQAM.
RDF Tutorial.
Semantic Web Introduction
Chris Bizer, Richard Cyganiak: D2RQ – Lessons Learned ( ) W3C Workshop on RDF Access to Relational Databases October, 2007 — Boston, MA,
DBpedia: A Nucleus for a Web of Open Data
1 Publishing Linked Sensor Data Semantic Sensor Networks Workshop 2010 In conjunction with the 9th International Semantic Web Conference (ISWC 2010), 7-11.
International Workshop Linked Open Data & the Jewish Cultural Heritage Rome, 20 th January 2015 International Workshop Linked Open Data & the Jewish Cultural.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Linked Data Practices for the Geospatial Community Talk subtitle Presented at GEOSS Workshop on Climate Boulder Colorado, 23 September 2011 Stephan Zednik,
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Research Problems in Semantic Web Search Varish Mulwad ____________________________ 1.
Enterprise Linked Data Seán O’Riain Domain of eBusiness Digital Enterprise Research Institute - National University of Ireland, Galway  Copyright 2010.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Linked Data The Short Version. Linked Data is a set of best practices for publishing and deploying instance and class data using the RDF data model, naming.
Exposing the University of Economics‘ academic bibliography database as linked data Jitka Hladká, University of Economics, Prague Jindřich Mynarz,
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS
Shared innovation How to Publish Linked Data on the Web Dr. Tom Heath Platform Division Talis Information Ltd
Linking Open Data Linking the world of data from LOD mailinglist Acknowledgement for Tom Heath (Talis) Ying Ding
Linked TCM and Drug Datasets Background  Traditional Chinese Medicine (TCM), which is a type of alternative medicine, is receiving growing attention from.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Logics for Data and Knowledge Representation
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Shared innovation An Introduction to Linked Data Dr Tom Heath Platform Division Talis Information Ltd 13/14.
Christian Bizer: The Web of Linked Data (26/07/2009) SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 The Emerging Web of.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad An Introduction to Linked Data, Its Applications and Challanges.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Linking Open Data Linking the world of data Iftikhar Alam.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Journal Club Report “Linked Data – The Story So Far” Denise Warzel Feb
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Linked Data: Emblematic applications on Legacy Data in Libraries.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 A Sitemap extension to enable efficient interaction with large.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The TriQL.P Browser Filtering Information using Context-, Content- and Rating-Based Trust Policies Christian Bizer, Freie Universität Berlin, Germany Richard.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
The Semantic Web (Slides by Fabian M. Suchanek). Motivation scientists from Brisbane Australia's scientists visit Brisbane The National Science Education.
DBpedia - A Crystallization Point
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Linking Open Drug Data (HCLSIG LODD)
Linked Data Web that can be processed by machines
Introduction to Persistent Identifiers
Cloud based linked data platform for Structural Engineering Experiment
Linked Data Platform zhengliang
Overview Linked Data Principals Linking Open Drug Data.
Big Data Quality the next semantic challenge
Lifting Data Portals to the Web of Data
Unit for Natural Language Processing
Now how do I aggregate/process all this RDF data out there?
Linking Open Drug Data (HCLSIG LODD)
Linking Open Drug Data (HCLSIG LODD)
Linked Data Ryan McAlister.
Presentation transcript:

Christian Bizer: Fusing the Web of Data (12/08/2008) 3rd Asian Semantic Web Conference (ASWC 2008) DIST Workshop, Bangkok, Thailand 8 December 2008 Fusing the Web of Data Christian Bizer, Freie Universität Berlin

Christian Bizer: Fusing the Web of Data (12/08/2008) Overview 1.The Web of Data Linked Data Principles Linked Data Deployment Applications that consume Linked Data 2.Linked Data Fusion 1.The Linking Process 2.Inconsistency Resolution 3.Provenance Tracking and Explanations

Christian Bizer: Fusing the Web of Data (12/08/2008) The Classic Web B C HTML Web Browsers Search Engines hyper- links Single global information space 1.URLs as globally unique IDs retrieval mechanism 2.HTML as shared content format 3.Hyperlinks Shortcomings  Content is not well structured  You can not ask expressive queries  You can not process content within applications A

Christian Bizer: Fusing the Web of Data (12/08/2008) Linked Data B C Thing typed links A D E Thing Use Semantic Web technologies to 1.publish structured data on the Web, 2.set links between data from one data source to data within other data sources.

Christian Bizer: Fusing the Web of Data (12/08/2008) Linked Data Principles 1.Use URIs as names for things. 2.Use HTTP URIs so that people can look up those names. 3.When someone looks up a URI, provide useful RDF information. 4.Include RDF statements that link to other URIs so that they can discover related things. Tim Berners-Lee

Christian Bizer: Fusing the Web of Data (12/08/2008) The RDF Data Model Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri

Christian Bizer: Fusing the Web of Data (12/08/2008) Data objects are identified with HTTP URIs pd:cygri Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri = dbpedia:Berlin =

Christian Bizer: Fusing the Web of Data (12/08/2008) Dereferencing URIs over the Web dp:Cities_in_Germany dp:population skos:subject Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type pd:cygri

Christian Bizer: Fusing the Web of Data (12/08/2008) Dereferencing URIs over the Web dp:Cities_in_Germany dp:population skos:subject Richard Cyganiak dbpedia:Berlin foaf:name foaf:based_near foaf:Person rdf:type dbpedia:Hamburg dbpedia:Muenchen skos:subject pd:cygri

Christian Bizer: Fusing the Web of Data (12/08/2008) The Disco – Hyperdata Browser

Christian Bizer: Fusing the Web of Data (12/08/2008)

2. Linked Data Deployment on the Web B C Thing typed links A D E Thing  Is this real?

Christian Bizer: Fusing the Web of Data (12/08/2008) W3C Linking Open Data Project  Community effort to publish existing open license datasets as Linked Data on the Web interlink things between different data sources

Christian Bizer: Fusing the Web of Data (12/08/2008) LOD Datasets on the Web: May 2007  Over 500 million RDF triples  Around 120,000 RDF links between data sources

Christian Bizer: Fusing the Web of Data (12/08/2008) Example RDF Links  RDF links from DBpedia to other data sources  RDF link from a FOAF profile to DBpedia owl:sameAs. foaf:topic_interest. owl:sameAs.

Christian Bizer: Fusing the Web of Data (12/08/2008) LOD Datasets on the Web: February 2008

Christian Bizer: Fusing the Web of Data (12/08/2008) LOD Datasets on the Web: September 2008 > 2 billion RDF triples > 6 million RDF links

Christian Bizer: Fusing the Web of Data (12/08/2008) The Bio2RDF Project  Goals 1.Make bioinformatics data available in RDF format on the Web. 2.Promote the linked data vision within the bioinformatics community. 3.Answer questions which were not possible or practical to ask before.  Participants Université Laval, Canada Queensland University of Technology, Australia

Christian Bizer: Fusing the Web of Data (12/08/2008) The Bio2RDF Cloud  27 data sources  260 million records  2,7 billion RDF triples

Christian Bizer: Fusing the Web of Data (12/08/2008) 3. Applications B C Thing typed links A D E Thing Search Engines Linked Data Mashups Linked Data Browsers  What can I do with this?

Christian Bizer: Fusing the Web of Data (12/08/2008) Linked Data Browsers  Tabulator Browser (MIT, USA)  Disco Hyperdata Browser (FU Berlin, DE)  OpenLink RDF Browser (OpenLink, UK)  Zitgist RDF Browser (Zitgist, USA)  Humboldt (HP Labs, UK)  Fenfire (DERI, Irland)  Marbles (FU Berlin, DE)

Christian Bizer: Fusing the Web of Data (12/08/2008)

Linked Data Mashups  Domain-specific applications using Linked Data from the Web

Christian Bizer: Fusing the Web of Data (12/08/2008) DBtune Slashfacet  Visualizes music-related Linked Data  Uses LastFM, MySpace, and BBC data

Christian Bizer: Fusing the Web of Data (12/08/2008) DBpedia Mobile  Geospatial entry point into the Web of Data  Starts with DBpedia, Revyu and Flickr data

Christian Bizer: Fusing the Web of Data (12/08/2008) DERI Semantic Web Pipes

Christian Bizer: Fusing the Web of Data (12/08/2008) Web of Data Search Engines  Falcons (IWS, China)  Sindice (DERI, Ireland)  MicroSearch (Yahoo, Spain)  Watson (Open University, UK)  SWSE (DERI, Ireland)  Swoogle (UMBC, USA)

Christian Bizer: Fusing the Web of Data (12/08/2008) Falcons

Christian Bizer: Fusing the Web of Data (12/08/2008)

Is this good enough? No.

Christian Bizer: Fusing the Web of Data (12/08/2008) 2. Linked Data Fusion Data Object 1 Data Object 2 Data Object 3 Data Object 4 Data Object 5 Data Object 6 Integrated View Application B C owl:sameAs A Users want an integrated view on all data that is available about an real-world entity!

Christian Bizer: Fusing the Web of Data (12/08/2008) Linked Data Fusion - Requirements 1.Map data into a single schema so that data can be rendered and queried properly. 2.Smush data from all sources about a single real-world entity while keeping track of information provenance. 3.Resolve inconsistencies in the data by applying different data fusion heuristics. 4.Be able to explain the fusion process Tim Berner-Lee‘s „Oh, yeah?“ button.

Christian Bizer: Fusing the Web of Data (12/08/2008) Roles in the Linked Data Scenario  Data Publisher 1.Publish data itself 2.Set RDF links to other data items describing the same real-world entity. 3.Reuse terms from existing vocabularies or set links to related schemata. 4.Publish metadata about -provenance -timeliness -data license  Client Application 1.Map data into single schema. 2.Smush data from different sources about real-world entity. 3.Resolve inconsistencies in the data. 4.Keep track of information provenance and lineage. 5.Explain fusion process.

Christian Bizer: Fusing the Web of Data (12/08/2008) 2.1 Setting RDF Links  Today: Simple pattern- and graph-matching based techniques used to generate links. Usually proprietary code.  There is lots of existing work in database and knowledge representation communities on identity resolution to be used. Rule-based approaches Distance-based techniques Probabilistic matching Supervised and unsupervised learning Using a wide range of distance metrics see: Elmagarmid et al: Duplicate Record Detection: A Survey. KaDE, 2007.

Christian Bizer: Fusing the Web of Data (12/08/2008) Linking Frameworks  Goal: (Semi-)automatically generate RDF Links based on declarative rules.  Ongoing work Oktei Hassanzadeh (University of Toronto): ODDLinker Andriy Nikolov et al. (Open University): KnoFuss Julius Volz (Freie Universität Berlin): XXXX seeAlso: EquivalenceMining CREATE LINKS owl:sameAs BETWEEN a FROM dbpedia AND b FROM factbook RESTRICT a TO { ?a rdf:type dbpedia-owl:Country } METRIC { STRING_SIMILARITY(a/rdfs:label, b/rdfs:label), NUM_SIMILARITY(a/p:populationEstimate, b/factbook:population_total), NUM_SIMILARITY(a/p:areaKm, b/factbook:area_total) } THRESHOLDS MATCH 0.9 VERIFY 0.7;

Christian Bizer: Fusing the Web of Data (12/08/2008) Schema Level RDF Links  Today: Simple mappings: owl:equivalentClass owl:equivalentProperty rdfs:subClassOf rdfs:subPropertyOf  UMBEL effort:  Lots of existing work on schema/ ontology matching to build on.  Missing: Agreed-upon way to publish more expressive mapping rules on the Web.

Christian Bizer: Fusing the Web of Data (12/08/2008) 2.2 Publish Metadata  Document Metadata Dublin Core, Semantic Web Publishing Vocabulary  Licensing Metadata Creative Commons Licensing Framework Open Data Commons Public Domain Dedication & Licence (PDDL) # Metadata and Licensing Information rdf:type foaf:Document ; dc:publisher ; dc:date " "^^xsd:date ; dc:rights. # The Document Content rdf:type foaf:Person ; foaf:name "Empire, Alec" ; dbpedia-owl:associatedBand dbpedia:Atari_Teenage_Riot ;

Christian Bizer: Fusing the Web of Data (12/08/2008) 2.3. Provenance and Lineage Tracking  Named Graphs data model part of W3C SPARQL Recommendation implemented by an increasing number of RDF stores # TriG Representation of three Named Graphs :G1 { :Monica ex:name "Monica Murphy". :Monica ex:homepage. :Monica ex: .} :G2 { :Monica rdf:type ex:Person. :Monica ex:hasSkill ex:Programming } :G3 { :G1 swp:assertedBy _:w1. _:w1 swp:authority :Chris. _:w1 dc:date " "^^xsd:date. :G2 swp:quotedBy _:w2. _:w2 swp:authority :Chris. _:w2 dc:date " "^^xsd:date. }

Christian Bizer: Fusing the Web of Data (12/08/2008) 2.4. Inconsistency Resolution  There is lots of overlap between LOD datasets Places: Dbpedia, Geonames, Riese, … People: Freebase, LinkedMDB, DBLP, … Music: Dbpedia, Musicbrainz, Jamendo,..  There are naturally lots of inconsistencies Dbpedia: Person born at date X. Freebase: Person born at date Y. Dbpedia: Band album X. Musicbrainz: Band album Y. Geonames: City has geo-coordinates Freebase: City has geo-coordinates

Christian Bizer: Fusing the Web of Data (12/08/2008) Inconsistency Resolution Strategies  Pass it on. Pass conflicting values to the user and let him decide.  Take the information If value is missing in dataset 1, use value from dataset 2  Trust your friends Prefer information from certain sources.  Cry with the wolfes Choose most common value  Meet in the middle Take the averadge of all values  Keep up to data Use the newest value SeeAlso: Bleiholder and Naumann: Conflict Handling Strategies in an Integrated Information System. WWW2006.

Christian Bizer: Fusing the Web of Data (12/08/2008) 2.5. Explain Data Provenance and Fusion Steps  Tim Berner-Lee‘s „Oh, yeah?“ button.  Existing Work: Deborah McGuinness et al: Inference Web: Portable Explanations for the Web. Chris Bizer: Web Information Quality Assessment Framework (WIQA)

Christian Bizer: Fusing the Web of Data (12/08/2008) Example WIQA Explanations

Christian Bizer: Fusing the Web of Data (12/08/2008) Outlook  Lots of exiting open issues to solve! DIST related technologies will be one of the hot topics for next years (see for instance WWW2009)  Important for LOD Progress with Publishing Schema Mappings on the Web Progress with Data Fusion Linked Data client applications that address all issues mentioned  Please submit such solutions and client applications to the Semantic Web Challenge 2009 Linked Data on the Web (LDOW2009) workshop at WWW2009 IJSWIS Special Issue on Linked Data

Christian Bizer: Fusing the Web of Data (12/08/2008) Thanks! References Linking Open Data Project Wiki LinkingOpenData Tutorial on How to Publish Linked Data on the Web