Linked Open Data Current State and Future Trends Martin Nečaský Faculty of Mathematics and Physics Charles University.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Czech approach to Regulatory Impact Assessment Prof. Michal Mejstřík Chairman of Regulatory Impact Assessment Board (RIAB) of the Czech Government Legislative.
Library Automation Overview of Results January 24 th 2006 Jomo Kenyatta Memorial Library.
Semantic Web Introduction
Rerport on the use geographical information in the Statistical Office Slovak Republic October, 2001 Zuzana Podmanická, Ivan Masaryk.
Public Procurement Authority of Montenegro 8th Regional Public Procurement Forum -Electronic Procurement - a big step towards transparency et efficiency.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
Reducing the Reporting Burden in the Regulatory Environment XBRL Reports from SME’s to the Mercantile Registers Iñaki Vázquez June 25th 2009.
Data Sets, Vocabularies and Tools Pablo N. Mendes Freie Universität Berlin 1st year review Luxembourg, December /02/11.
The SADI plug-in to the IO Informatics’ Knowledge Explorer...a quick explanation of how we “boot-strap” semantics...
CZSO Business Register in the Czech Statistical Office Prepared by: Jan Matejcek CZSO, Prague, Czech Republic
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
Review on development of SDI as a basis of E-government in Croatia Ivan Landek, assistant director State Geodetic Administration of RoC International Workshop.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
The Semantic Web Web Science Systems Development Spring 2015.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Regional Innovation in Central Europe: Brno – Center of Education and Innovation (– ) Ivo Šanc: Support for research and development in the Czech.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
National Statistical Committee of the Kyrgyz Republic Science, technology and innovation statistics in the Kyrgyz Republic Training workshop for ECO countries.
Serving society Stimulating innovation Supporting legislation Workshop on the INSPIRE registry and registers Martin Tuchyňa, Tomáš.
NREL is a national laboratory of the U.S. Department of Energy, Office of Energy Efficiency and Renewable Energy, operated by the Alliance for Sustainable.
Reform and Modernization of Russian Statistics. New Challenges in Data Collection and Compilation International Seminar on Modernizing Official Statistics:
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
ITGS Databases.
UNIVERSITY OF ZAGREB FACULTY OF GEODESY V. Cetl, M. Lapaine NSDI in Croatia (NUTS level 1)
INFORMATION SYSTEM FOR SUPPORT OF REGIONAL DEVELOPMENT (INFOREG) IN THE SLOVAK REPUBLIC INFOSTAT, Bratislava, Slovakia Prepared by Lenka Priehradnikova,
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Linked Open Library Bielefeld Conference, Dr. Silke Schomburg.
October 09-11, 2008euroCRIS Membership Meeting, Moscow 1 HunCRIS an Example on Project Information Systems Adam Tichy-Racs National Technical Information.
How to start with the implementation of IPPC Directive Czech Republic Czech Environmetal Inspectorate.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
Web Services Martin Nečaský, Ph.D. Faculty of Mathematics and Physics Charles University in Prague, Czech Republic Summer 2014.
Linked Open Data Martin Nečaský Faculty of Mathematics and Physics, Charles University in Prague.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
DBpedia - A Crystallization Point
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic Business Register in the Czech Statistical Office =DISSEMINATION.
Renovation of Eurostat dissemination chain
EU Cohesion Policy and its implementation in the Czech Republic Zuzana Kasáková Department of W European Studies Charles University in Prague.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Intellectual Property Teaching in the Czech Republic Ladislav Jakl Professor, Metropolitan University Prague IP Teaching Roundtable, Bucharest November.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
The Registration Agency, DDI and Linked Open Data
Digital Media Technology
Principles and definitions of conducting Agriculture Census in Armenia
BIBFLOW Project Update
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
Wikidata How to build SPARQL queries Repo Fringe 2017
Middleware independent Information Service
CUAHSI HIS Sharing hydrologic data
Eurostat activities update
The system S.INTE.S.I.S.-Establishments
DIRECT – DIsaster REsilient Communities and Towns
LOD reference architecture
Public Administration in the Czech Republic
Laur Mägi Department of Information Systems and Document Management
Hungarian Association of NGOs for Development and Humanitarian Aid
Item 2.2 of the Agenda Remote access to confidential data for researchers: possible actions under the 7th Framework Programme Pascal JACQUES Unit B 5 15.
Ðì SA Effective Monitoring and Evaluation of Progress on the SDGs Monitoring SDGs : the perspective of Armstat Learning Conference: Implementing.
NTTS 2019 Conference / Brussels / Belgium
Input for ad hoc on software update on 7th Dec. from Japan
Input for ad hoc on software update on 7th Dec. from Japan
Linked Data Ryan McAlister.
PUBLIC PROCUREMENTS IN THE REPUBLIC OF SERBIA
GISCO Working Party Mirosław Migacz Chief GIS Specialist
Presentation transcript:

Linked Open Data Current State and Future Trends Martin Nečaský Faculty of Mathematics and Physics Charles University

Agenda What is Open Data Linked Open Data principles usage examples research challenges Open Data activities of OpenData.cz Our contribution to Czech legislation

Open Data Definition Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike.

5 levels of Open Data Zdroj : http://5stardata.info

Public Sector Open Data? National Statistics http://www.czso.cz http://www.potravinynapranyri.cz/ Food Inspections Environment Inspections http://www.cizp.cz ★★★ Geopolitical Regions http://www.cuzk.cz ★★★ Business Registers http://www.mfcr.cz Trade Inspections http://www.coi.cz http://data.nku.cz Public Sector Inspections ★★★ Code of Law http://portal.gov.cz ★★★

? What is Linked Data? ★★★ ★★★★★ Check Actions Inspected entities ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 2012/34 Účetní závěrka a finanční ... ? Inspected entities Linked Open Data is a set of (technological) principles of publishing data on the Web. ENTITY ID DISTRICT ACTION Ministry of Defence 60162694 Prague 2012/33 Social Security Administration 6963 2012/34

1st Linked Data Principle Use URIs as names for things. Check Actions http://data.nku.cz/action/2012/33 ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 2012/34 Účetní závěrka a finanční ... http://data.nku.cz/action/2012/34 Inspected entities http://data.nku.cz/entity/60162694 ENTITY ID DISTRICT ACTION Ministry of Defence 60162694 Prague 2012/33 Social Security Administration 6963 2012/34 http://data.nku.cz/entity/6963 http://data.nku.cz/district/prague

2nd Linked Data Principle Use HTTP URIs so that people can look up those names. WWW HTTP GET "http://data.nku.cz/action/2012/33 http://data.nku.cz/action/2012/33

3rd Linked Data Principle When someone looks up a URI, provide useful information, using the W3C standards (RDF, SPARQL). Check Actions ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 http://data.nku.cz/action/2012/33 "Peněžní prostředky určené …" nku:start "2012/33" nku:id "2012/11" nku:subject RDF expression (Turtle) <http://data.nku.cz/action/2012/33> nku:id "2012/33" . <http://data.nku.cz/action/2012/33> nku:ubject "Peněžní prostředky určené …" . <http://data.nku.cz/action/2012/33> nku:start "2012/11" . subject predicate object

3rd Linked Data Principle When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL). HTTP GET SPARQL query SPARQL API (SPARQL endpoint) NKÚ RDF store HTTP SERVER HTTP GET "http://data.nku.cz/...

SPARQL crash course Similar to SQL. Query expressed as a graph pattern. SELECT <result specification> WHERE <graph pattern>

SPARQL crash course Graph pattern consists of simple triple patterns. SELECT ?x WHERE { ?x nku:start "2012/11" . } ?x nku:start "2012/11"

SPARQL crash course Graph pattern consists of simple triple patterns. SELECT ?x ?y WHERE { ?x nku:start "2012/11" ; nku:subject ?y . } "2012/11" nku:start ?x nku:subject ?y

SPARQL crash course Graph pattern consists of simple triple patterns. SELECT ?x ?y ?z WHERE { ?x nku:start "2012/11" ; ?z ?y . } "2012/11" nku:start ?x ?z ?y

SPARQL crash course Query may return a graph as well CONSTRUCT { ?x ?z ?y . } WHERE { ?x nku:start "2012/11" ; ?z ?y . } "2012/11" nku:start ?x ?z ?y

4th Linked Data Principle Include links to other URIs so that others can discover more things. "2012/33" "2012/11" <http://data.nku.cz/action/2012/33> id "2012/33" ; subject "Peněžní prostředky určené …" ; start "2012/11" ; entity <http://data.nku.cz/entity/60162694> . id start http://data.nku.cz/action/2012/33 subject entity "Peněžní prostředky určené …" <http://data.nku.cz/entity/60162694> title “Ministry of Defense" ; district <http://data.nku.cz/district/prague> . http://data.nku.cz/entity/60162694 “Ministry of Defense" district <http://data.nku.cz/district/prague> title "Prague". http://data.nku.cz/district/prague "Prague"

4th Linked Data Principle Include links to other URIs so that others can discover more things (including URIs of other publishers). http://data.nku.cz/action/2012/33 http://data.mfcr.cz/ares/entity/60162694 entity same as http://data.nku.cz/entity/60162694 district district http://data.cuzk.cz/ruian/district/3100 http://data.nku.cz/district/prague

4th Linked Data Principle Include links to other URIs so that others can discover more things (including URIs of other publishers). Trade Inspection Gov Off Science and Research IS Business Entities Soc Sec Statistics Geopolitical Regions Public Sector Inspection Nat Stats Demography

LOD usage examples http://linked.opendata.cz/sparql http://ruian.linked.opendata.cz/sparql http://data.cssz.cz/sparql

Searching Datasets Where can I get some data about entities inspected by Supreme Audit Office (SAO)? SAO linked.opendata.cz ? owl:sameAs owl:sameAs Entity Organizace … owl:sameAs SPARQL : https://drive.google.com/open?id=0BwP-TfUUfcFTR0VYd3ZJaTJub3c (try on http://linked.opendata.cz/sparql endpoint)

Searching Datasets Public Agreements Registry - Agreements 61961 Registr Agreements Registry - Orders 27726 Database of Science, Research and Innovations 14286 Offices of Public Authorities 763 Public Sector Inspections 6254 Agendas of Public Institutions 12112 Identification Numbers of Business Entities 60520 Trade Register 167376 Monitor of Public Budgets 104522 Registr Agreements Registry - Payments 5516 Trade Inspections 2576 Integrated Registry of Environmental Pollution 6658 Public Authorities 60007 Business Register 94881

Science and Research DB Combining datasets Which public research institutions were inspected by SAO and what is their public research budget? Science and Research DB SAO linked.opendata.cz Project Entity owl:sameAs Entity Participant Budget Premise owl:sameAs ResearchOrg CheckAction SPARQL : https://drive.google.com/open?id=0BwP-TfUUfcFTS2VjV1puakdIeG8 (try on http://linked.opendata.cz/sparql endpoint) RESULT IN CSV : https://drive.google.com/open?id=0BwP-TfUUfcFTN2NzVlk4Zk1ncGM

Combining datasets Sanctions for unfair trade practices in Czech regions and numbers of pensioners. Social Security Trade Inspections # pensioners Inspection RAMON EU owl:sameAs Region NUTS Sanction owl:sameAs Geopolitical regions owl:sameAs Region SPARQL : https://drive.google.com/open?id=0BwP-TfUUfcFTQzBGZzdwYzFuTUE (try on http://linked.opendata.cz/sparql endpoint, note : this federated query also asks http://ruian.linked.opendata.cz/sparql and http://data.cssz.cz/sparql) RESULT IN CSV : https://drive.google.com/open?id=0BwP-TfUUfcFTaEVxNF84NlUwTTg

Building Applications http://lekovaencyklopedie.cz Each oval is a data source which exists (MeSH, NDF-RT, NCI, DrugBank) as LOD or we have converted it to LOD. Links represent types of RDF links between datasets. LOD made us much faster in the development. RDF data updated periodically thanks to http://etl.linkedpipes.com

Linked Open Data (LOD) Cloud

Knowledge Graphs as LOD DBPedia Wikipedia as LOD http://dbpedia.org/sparql 402,086,316 triples about 17,315,785 entities Wikidata Emerging project of Wikimedia Foundation Structured data source for Wikipedia https://query.wikidata.org 1,373,105,652 triples about 24,437,040 entities

Two research challenges for near future “A data journalist writes an article about unfair trade practices on elderly people in Czech Republic. He needs to find datasets with an evidence for his article (unfair trade inspections, elderly people numbers, regions in Czech Republic). He also needs to preview the discovered datasets, create map visualizations and embed them to his article.” Challenge 1: Dataset discovery Challenge 2: Dataset visualization

Dataset discovery Input : User’s intent How the intent should be expressed? How we can assist the user when expressing the intent? How the expression of the intent should be translated to a formal query language? Output : Combinations of datasets which fulfill the intent How datasets should be indexed? How the indexes should be kept up-to-date? How the user’s intent should be evaluated against the index?

Back to Open Data OpenData.cz – a group of academicians supporting and boosting (Linked) Open Data in Czech public sector We have assisted several public institutions with opening their data http://data.ctu.cz http://data.nku.cz http://data.cssz.cz http://data.gov.cz Cooperation with ČSÚ, ČOI, MF ČR, MV ČR

Back to Open Data Under Ministry of Interior of Czech Republic, we have helped with making Open Data as one of the major eGovernment topics position of National Coordinator for Open Data National Open Data Catalogue (http://data.gov.cz) Standards for open data publication and cataloging (http://opendata.gov.cz) Open Data in Czech legislation Educating public institutions Plan for National Linked Open Data Infrastructure

Our Journey to Czech Open Data Legislation October 2014 : Open Data must be part of Czech legislation Public bodies did not want to or could not open their data without legislation. October 2016 : The Czech president signed our amendment of Public Sector Information Act (106/1999) introducing Open Data Only data published according to given conditions can be called Open Data. Ministry of Interior must provide National Open Data Catalogue Czech Government instructs ministries and national authorities to mandatorily publish given datasets as Open Data since 1.1.2017. http://www.lupa.cz/aktuality/jizdni-rady-ares-ci-volna-mista-vlada-naridila-uradum-ktera-data- maji-otevrit/ Defending our position Ministry of Interior (Oct 2014 – Aug 2015) Office of the Government (Sep 2015 – Mar 2016) Parliament (Apr 2016 – Aug 2016) Office of the Government (Oct 2016 – Dec 2016)

How you can help (Linked) Open Data? Develop applications which use open data. bachelor or diploma theses, student software projects If you need some data, ask for them. You can ask OpenData.cz and we will try to help

Thank you Martin Nečaský necasky@ksi.mff.cuni.cz