DBpedia - A Crystallization Point

Slides:



Advertisements
Similar presentations
Improving Human-Semantic Web Interaction: The Rhizomer Experience Roberto García and Rosa Gil GRIHO - Human Computer Interaction Research Group Universitat.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
DBpedia: A Nucleus for a Web of Open Data
Vocabulary Mapping Framework & Libraries Alan Danskin Metadata & Bibliographic Standards Coordinator.
Georgi Kobilarov, Chris Bizer, Sören Auer, Jens Lehmann Freie Universität Berlin, Universität Leipzig.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Actual Trends Semantic Web Lecture WS 2010/2011. What‘s next? W3C view: Look at Semantic Web activity:
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
The Web of Linked Data Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
JOSH FLECK Semantic Web. What is Semantic Web? Movement led by W3C that promotes common formats for data on the web Describes things in a way that computer.
Leveraging Names with Linked Data Karen Smith-Yoshimura Ralph LeVan 2010 RLG Partnership Annual Meeting Chicago, IL 9 June 2010.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
CSE 428 Semantic Web Topics Introduction Jeff Heflin Lehigh University.
Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
Semantic Web author: Michał Dettlaff. Tim Berners-Lee director of W3C created the World Wide Web in 1990 proposed the idea of Semantic Web Tim Berners-Lee.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
The Semantic Web Web Science Systems Development Spring 2015.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
The World Wide Web (abbreviated as WWW or W3 and commonly known as the Web) is a system of interlinked hypertext documents accessed via the Internet.
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
© Copyright 2008 STI INNSBRUCK Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections.
Publishing and Interacting with Linked Data Roberto Garcia, Josep Maria Brunetti, Antonio López-Muzás, Juan Manuel Gimeno, Rosa Gil WIMS’11 Conference,
© Copyright 2013 STI INNSBRUCK Linked Open Data Anna Fensel, Ioannis Stavrakantonakis,
Semantic Web Applications GoodRelations BBC Artists BBC World Cup 2010 Website Emma Nherera.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Linked Data: Emblematic applications on Legacy Data in Libraries.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
Chapter 3 Querying RDF stores with SPARQL
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Chapter 5 The Semantic Web 1. The Semantic Web  Initiated by Tim Berners-Lee, the inventor of the World Wide Web.  A common framework that allows data.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
The Registration Agency, DDI and Linked Open Data
Linked Data Web that can be processed by machines
Presented at Archives Records 2016, session 510
Linked (Open) Data Speaker: 呂瑞麟 國立中興大學資訊管理學系教授
Zachary Cleaver Semantic Web.
Cataloging the Internet
DBpedia 2014 Liang Zheng 9.22.
LOD reference architecture
W3C Recommendation 17 December 2013 徐江
Semantic MediaWiki BCHB697.
Linked Data Ryan McAlister.
Presentation transcript:

DBpedia - A Crystallization Point for the Web of Data 2011.10.05 Junghee - Han

Outline The DBpedia Project Understanding Linked Data The DBpedia Knowledge Extraction Framework The DBpedia Knowledge Base Accessing the DBpedia Knowledge Base Applications facilitated by DBpedia DBpedia - A Crystallization Point for the Web of Data

The DBpedia Project DBpedia 위키피디아로부터 구조화된 정보를 추출하고, 이를 웹에서 이용할 수 있도록 만들기 위한 커뮤니티 Dbpedia is a community effort to Extract structured information from Wikipedia Make this information available on the Web under an open license Interlink the DBpedia dataset with other open datasets on the Web DBpedia - A Crystallization Point for the Web of Data

The DBpedia Project DBpedia knowledge base Currently describes more than 2.6 million entities - 198,000 persons - 328,000 places - 101,000 musical works - 34,000 films - 20,000 companies. The knowledge base contains 3.1 million links to external web pages and 4.9 million RDF links into other Web data sources. DBpedia - A Crystallization Point for the Web of Data

Linked Data 참고:

Linked Data Web Browsers Search Engines HTTP HTTP 참고:

Linked Data RDF stands for RDF는 Graph Model을 갖고 있다. Resource : URI를 갖는 모든 것(웹페이지, 이미지, 동영상등) Description : 자원(Resource)들의 속성, 특성, 관계기술 Framework : 위의 것들을 기술하기 위한 모델, 언어, 문법 RDF는 Graph Model을 갖고 있다. 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

Linked Data Graph Model 예시 Triple 형식표현 RDF Syntax SPARQL(Simple Protocol and RDF Query Language) W3C에서 만든 RDF 질의 언어 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

2017-04-26 Linked Data 1. Use URI(Uniform Resource Identifier)s as names for things 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful RDF Information 4. Include RDF statements that link to other URIs so that they can discover related things Tim Berners-Lee 2007 http://www.w3.org/DesignIssues/LinkedData.html

Linked Data 1. Use URIs as names for things 2017-04-26 http://bibleontology.com/page/Bilhah 1. Use URIs as names for things 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

Linked Data 2. Use HTTP URIs so that people can look up those names 2017-04-26 Linked Data http://bibleontology.com/page/Bilhah 2. Use HTTP URIs so that people can look up those names 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

2017-04-26 Linked Data http://bibleontology.com/page/Bilhah 3. When someone looks up a URI, provide useful RDF Information 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

2017-04-26 Linked Data http:// http://bibleontology.com/page/Bilhah 4. Include RDF statements that link to other URIs so that they can discover related things 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

Linked Data 2017-04-26 HongGilDong [residences] Seoul [sameAs] http://dbpedia.org/ resource/Seoul http://sws.geonames.org/1835848/ http://sws.geonames.org/1835848/nearby.rdf [nearbyFeatures] [researches] [age] SemanticWeb [name] [hasPhotoCollection] http://dbpedia.org/ resource/Semantic_Web http://www4.wiwiss.fu-berlin.de/flickrwrappr/ photos/Semantic_Web Hong, Gil Dong 35 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

URI RDF SPARQL HTTP Linked Data 로 식별하고, Linking 하고, 로 표현하고, 로 질의하고, 2017-04-26 Linked Data URI RDF SPARQL HTTP 로 식별하고, Linking 하고, 로 표현하고, 로 질의하고, 로 유통하고, SQL SPARQL 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data

Linked Data 2017-04-26 민간 정보 해외 정보 국가 공공정보 16 TopQuadrant Korea Inc., 공간정보 여행정보 교통정보 부동산정보 문화재정보 문헌정보 토지정보 환경정보 XXX 정보 상품정보 일자리정보 단절된 국가 공공정보 공간정보 여행정보 교통정보 부동산정보 문화재정보 문헌정보 토지정보 환경정보 XXX 정보 상품정보 일자리정보 연결된 국가 공공정보 포털 및 언론 대학 기타 민간 정보 DBPedia BBC etc 해외 정보 여행정보 공간정보 문헌정보 환경정보 XXX정보 국가 공공정보 참고: [KSWC2010]데이터의 가치를 높이는 Linked Data 16 TopQuadrant Korea Inc.,

Wikipedia Content Domain specific Data Images Infoboxes Title Description Languages Web Links Categorization DBpedia - A Crystallization Point for the Web of Data

The DBpedia Knowledge Extraction Framework(1/2) Currently 19 extractors Labels(title,rdfs:label) Abstracts(first paragraph,rdfs:comment) Interlanguage links. Images. Redirects. Disambiguation(depedia:disambiguates) External links(dbpedia:reference) Page links(dbpedia:wikilink) Homepages(foaf:homepage) Geo-coordinates. Person data. PND. SKOS categories. Page ID. Revision ID. Category label. Article categories. Mappings. Infobox. Until March 2010, the DBpedia project was using a PHP-based extraction framework to extract different kinds of structured information from Wikipedia. This framework has been superseded by the new Scala-based extraction framework and the old PHP framework is not maintained anymore DBpedia - A Crystallization Point for the Web of Data

Two Work-Flows The DBpedia Knowledge Extraction Framework(2/2) Dump-based extraction The Wikimedia Foundation publishes SQL dumps of all Wikipedia editions on a monthly basis The dump-based workflow uses the DatabaseWikipedia page collection as the source of article texts and the N-Triples serializer as the output destination. Live extraction Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) DBpedia - A Crystallization Point for the Web of Data

Infobox Extraction dbpedia:BBC p:network_name „British Broadcasting Corporation (BBC)“ dbpedia:BBC p:country dbpedia:United_Kingdom dbpedia:BBC p:key_people dbpedia:Michael_Lyons dbpedia:Mark_Thompson DBpedia - A Crystallization Point for the Web of Data

The DBpedia Knowledge Base Identifying Entities Resources are assigned a URI according to the pattern http://dbpedia.org/resource/Name (where Name is taken from the URL of the source Wikipedia article, which has the form http://en.wikipedia.org/wiki/Name) Classifying Entities DBpedia entities are classified within four classification schemata in order to fulfill different application requirements. - Wikipedia Categories - YAGO - UMBEL(Upper Mapping and Binding Exchange Layer) - DBpedia Ontology Describing Entities Every DBpedia entity is described by a set of general properties DBpedia - A Crystallization Point for the Web of Data

Linked Data SPARQL Endpoint RDF Dumps Lookup Index Accessing the DBpedia Knowledge Base over the Web Linked Data DBpedia resource identifiers(ex: http://dbpedia.org/resource/Berlin) SPARQL Endpoint http://dbpedia.org/sparql RDF Dumps http://wiki.dbpedia.org/Downloads32 Lookup Index http://lookup.dbpedia.org/api/search.asmx DBpedia - A Crystallization Point for the Web of Data

Interlinked Web Content Currently contains 4.9 million outgoing RDF links DBpedia - A Crystallization Point for the Web of Data

Applications facilitated by Dbpedia(1/3) Browsing and Exploration DBpedia Mobile DBpedia - A Crystallization Point for the Web of Data

Applications facilitated by Dbpedia(2/3) Querying and Search DBpedia Query Builder . http://querybuilder.dbpedia.org DBpedia - A Crystallization Point for the Web of Data

Applications facilitated by Dbpedia(3/3) Querying and Search Relationship Finder . DBpedia - A Crystallization Point for the Web of Data

Conclusions and Future Work The resulting DBpedia knowledge base covers a wide range of different domains and connects entities across these domains. Future Work Cross-language infobox knowledge fusion - Derive an astonishingly detailed multi-domain knowledge base Wikipedia article augmentation - Develop a MediaWiki extension that augments Wikipedia articles with additional information as well as media items (pictures, audio) from these sources Wikipedia consistency checking - Improve the overall quality of Wikipedia DBpedia - A Crystallization Point for the Web of Data