Georgi Kobilarov, Chris Bizer, Sören Auer, Jens Lehmann Freie Universität Berlin, Universität Leipzig.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Creating Knowledge out of Interlinked Data Wissenserschliessung um Web Page 1 Vom Web der vernetzten Daten zum Web vernetzten.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
DBpedia: A Nucleus for a Web of Open Data
Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Highs and Lows of Library Linked Data Adrian Stevenson UKOLN, University of Bath, UK (until end Dec 2011) Mimas, Libraries and Archives Team, University.
 Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Live Linked Open Sensor.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Data Sets, Vocabularies and Tools Pablo N. Mendes Freie Universität Berlin 1st year review Luxembourg, December /02/11.
Lecturer: Ghadah Aldehim
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Semantic Technologies for Cultural Heritage Ongoing Projects at Ontotext Mariana Damova, PhD September, 2011.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
University of Sheffield, NLP Entity Linking Kalina Bontcheva © The University of Sheffield, This work is licensed under the Creative Commons.
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
5 Quick ways to improve content value do cool stuff using Calais.
© Copyright 2008 STI INNSBRUCK Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections.
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
EUscreen: Examining An Aggregator ’ s Role in Digital Preservation Samantha Losben Digital Preservation - Final Project December 15, 2010.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad An Introduction to Linked Data, Its Applications and Challanges.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
Libraries at the Network Level: APIs, Linked Data, and Cloud Computing Roy Tennant OCLC Research rtennant on Twitter.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
NLPainter “Text Analysis for picture/movie generation” David Leoni Eduardo C á rdenas 12/01/2012.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
New RCLayout. Do product layout 3 improvements All products Local databases New functionalities.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
A Short Tutorial to Semantic Media Wiki (SMW) [[date:: July 21, 2009 ]] At [[part of:: Web Science Summer Research Week ]] By [[has speaker:: Jie Bao ]]
GeoNames is … Gazetteer aggregator of open geo data I am... Marc Wick GeoNames.
Linked Data: Emblematic applications on Legacy Data in Libraries.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Controlled Vocabulary Giri Palanisamy Eda C. Melendez-Colom Corinna Gries Duane Costa John Porter.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 A Sitemap extension to enable efficient interaction with large.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Semantic Publishing Benchmark Task Force Fourth TUC Meeting, Amsterdam, 03 April 2014.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
DBpedia - A Crystallization Point
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Presenting Semantic Data Through “Instance Hubs” Using Authoritative URI Design Schemes Alexei Bulazel 1 ( ), Dominic Difranzo 1 (
KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institut AIFB – Angewandte Informatik.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
LoCloud Conference - Sharing local cultural heritage online with LoCloud services Microservices in LoCloud Walter Koch Gerda Koch
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
The value of Structured Data Gilbane Boston Ole Gulbrandsen CTO Webnodes
INHA UNIVERSITY, KOREA Rainer Simon Austrian Institute of Technology.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Linked Data Web that can be processed by machines
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
Presented at Archives Records 2016, session 510
Traditional Linked DATA or Connect Your Data Your Way
DBpedia 2014 Liang Zheng 9.22.
5.00 Apply procedures to organize content by using Dreamweaver. (22%)
Linked Data Ryan McAlister.
Presentation transcript:

Georgi Kobilarov, Chris Bizer, Sören Auer, Jens Lehmann Freie Universität Berlin, Universität Leipzig

Querying Wikipedia like a Database

Title Description Languages Web Links Categorization Domain specific Data Images Infoboxes

Infobox Extraction dbpedia:Albert_Einstein p:name „Albert Einstein“ dbpedia:Albert_Einstein p:birth_place dbpedia:Ulm dbpedia:Albert_Einstein p:birth_date „ “

Property Synonyms

Structuring Wikipedia‘s Knowledge Structuring actual data, not modeling the world Bound to Wikipedia Templates, parsers handle template values based on rules (property splitting, merging, transformation)

DBpedia Ontology DBpedia Ontology build from scratch 170 classes, 900 properties

No living things

Class Hierarchy „Select all TV Episodes …“

Template Mapping Class TV Episode (Work) Wikipedia Templates: Television Episode UK Office Episode Simpsons Episode DoctorWhoBox

Template Mapping Infobox Cricketer Infobox Historic Cricketer Infobox Recent Cricketer Infobox Old Cricketer Infobox Cricketer Biography => Class Cricketer (Athlete)

People Actors Athlete Journalist MusicalArtist Politician Scientist Writer

Places Airport City Country Island Mountain River

Organisations Band Company Educational Institution Radio Station Sports Team

Event Convention Military Conflict Music Event Sport Event

Work Book Broadcast Film Software Television

More structured data Categories in SKOS Intra-wiki links Disambiguation Redirects Links to Images (and Flickr) Links to external webpages

Data about 2.6 million “things”

274 million pieces of information (RDF triples)

Multilingual Abstracts – English: 2,613,000 – German: 391,000 – French: 383,000 – Dutch: 284,000 – Polish: 256,000 – Italian: 286,000 – Spanish: 226,000 – Japanese: 199,000 – Portuguese: 246,000 – Swedish: 144,000 – Chinese: 101,000

DBpedia as Linked Data Hub

Semantic Web “My document can point at your document on the Web, but my database can't point at something in your database without writing special purpose code. The Semantic Web aims at fixing that.” Prof. James Hendler

Web of Documents Web Browsers Search Engines AB CD HTML hyper links HTML HTTP

Web of Data B C Thing data link A D E Thing Search Engines Linked Data Mashups Linked Data Browsers HTTP

Linked Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things. Wikipedia Article URI: DBpedia Resource URI

HTTP URIs Information Resources HTTP GET -> 200 OK Real-World Resources HTTP GET -> 303 See other -> 200 OK

Life Sciences Publications Online Activities Music Geographic Cross-Domain

4.5 billion triples 180 million data links

Use Cases

1.Data Source for Web-Applications 2.Querying Wikipedia like a database 3.Tag Web content with concepts instead of free-text tags 4.Vocabulary and semantic backbone for enterprise linked data integration

DBpedia as data source Embed structured information from Wikipedia into your web applications Build (mobile) maps applications using DBpedia data about places Display multilingual titles & descriptions in 15 languages

DBpedia Mobile

Sparql Endpoint

Wikipedia Query

Annotating Documents Use DBpedia concepts to annotate documents instead of free-text tags Named Entity Extraction Systems already use DBpedia URIs (OpenCalais, Muddy Boots) Social Bookmarking with DBpedia URIs as tags

„Apple“

Annotating Documents BBC editors tag news articles with DBpedia concepts DBpedia Lookup Service

Linking Enterprise Data Take the Linking Open Data approach to the enterprises

Connect data sets with DBpedia as shared vocabulary Enable meaningful navigation paths across BBC websites Browsing Madonna-related information across BBC News, BBC Music, BBC Programmes, … Make use of the rich background information: relate the release of a music album to a news article about the artist Linking Enterprise Data

The Future of DBpedia

Improve Information Extraction

Croud-source Information Extraction

Crowd Sourced Extraction Where‘s the user benefit?

Data Fusion

Cross-Language Data Fusion 264 Wikipedia Editions in different languages – Italian Wikipedians know more about Italian villages – German Wikipedia contains more person infoboxes Augment the infobox dataset with facts from other Wikipedia editions.

Augment DBpedia with External Data Linking Open Data cloud provides more data than Wikipedia – EuroStat provides additional statistical information about countries. – Musicbrainz contains additional information about other bands. – Geonames provides additional information about locations. Idea – Augment DBpedia with additional data from external sources.

Contribute back to Wikipedia Opportunity – Feed data back to Wikipedia Extend the Wikipedia authoring environment with – Suggestions for infobox values – Cross-language consistency checking for infoboxes Currently going on – New maps in Wikipedia based on Dbpedia Mobil Code (OpenStreetMap)

Contribute back to Wikipedia Initialize Wikipedia Clean-Up Cycles – Data-driven search interfaces expose the weaknesses of Wikipedia template system. – Preferred items not showing up in end-user interfaces may motivate Wikipedia editors to use templates more stringently.

Live Update Current Situation – DBpedia update cycle: 3 month – Wikipedia provides us with access to the live update stream Opportunity – Increase the currency of the DBpedia dataset using this update stream Result – DBpedia in synchronization with Wikipedia.

Open Source

Open Data

What is the Wikipedia for Data?

Wikipedia is the Wikipedia for Data

Summary