Download presentation
Presentation is loading. Please wait.
Published byDeirdre Daniel Modified over 6 years ago
1
Linked Open Data Current State and Future Trends Martin Nečaský Faculty of Mathematics and Physics Charles University
2
Agenda What is Open Data Linked Open Data
principles usage examples research challenges Open Data activities of OpenData.cz Our contribution to Czech legislation
3
Open Data Definition Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike.
4
5 levels of Open Data Zdroj :
5
Public Sector Open Data?
National Statistics Food Inspections Environment Inspections ★★★ Geopolitical Regions ★★★ Business Registers Trade Inspections Public Sector Inspections ★★★ Code of Law ★★★
6
? What is Linked Data? ★★★ ★★★★★ Check Actions Inspected entities
ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 2012/34 Účetní závěrka a finanční ... ? Inspected entities Linked Open Data is a set of (technological) principles of publishing data on the Web. ENTITY ID DISTRICT ACTION Ministry of Defence Prague 2012/33 Social Security Administration 6963 2012/34
7
1st Linked Data Principle
Use URIs as names for things. Check Actions ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 2012/34 Účetní závěrka a finanční ... Inspected entities ENTITY ID DISTRICT ACTION Ministry of Defence Prague 2012/33 Social Security Administration 6963 2012/34
8
2nd Linked Data Principle
Use HTTP URIs so that people can look up those names. WWW HTTP GET "
9
3rd Linked Data Principle
When someone looks up a URI, provide useful information, using the W3C standards (RDF, SPARQL). Check Actions ID SUBJECT START 2012/33 Peněžní prostředky určené … 2012/11 "Peněžní prostředky určené …" nku:start "2012/33" nku:id "2012/11" nku:subject RDF expression (Turtle) < nku:id "2012/33" . < nku:ubject "Peněžní prostředky určené …" . < nku:start "2012/11" . subject predicate object
10
3rd Linked Data Principle
When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL). HTTP GET SPARQL query SPARQL API (SPARQL endpoint) NKÚ RDF store HTTP SERVER HTTP GET "
11
SPARQL crash course Similar to SQL.
Query expressed as a graph pattern. SELECT <result specification> WHERE <graph pattern>
12
SPARQL crash course Graph pattern consists of simple triple patterns.
SELECT ?x WHERE { ?x nku:start "2012/11" . } ?x nku:start "2012/11"
13
SPARQL crash course Graph pattern consists of simple triple patterns.
SELECT ?x ?y WHERE { ?x nku:start "2012/11" ; nku:subject ?y . } "2012/11" nku:start ?x nku:subject ?y
14
SPARQL crash course Graph pattern consists of simple triple patterns.
SELECT ?x ?y ?z WHERE { ?x nku:start "2012/11" ; ?z ?y . } "2012/11" nku:start ?x ?z ?y
15
SPARQL crash course Query may return a graph as well CONSTRUCT {
?x ?z ?y . } WHERE { ?x nku:start "2012/11" ; ?z ?y . } "2012/11" nku:start ?x ?z ?y
16
4th Linked Data Principle
Include links to other URIs so that others can discover more things. "2012/33" "2012/11" < id "2012/33" ; subject "Peněžní prostředky určené …" ; start "2012/11" ; entity < . id start subject entity "Peněžní prostředky určené …" < title “Ministry of Defense" ; district < . “Ministry of Defense" district < title "Prague". "Prague"
17
4th Linked Data Principle
Include links to other URIs so that others can discover more things (including URIs of other publishers). entity same as district district
18
4th Linked Data Principle
Include links to other URIs so that others can discover more things (including URIs of other publishers). Trade Inspection Gov Off Science and Research IS Business Entities Soc Sec Statistics Geopolitical Regions Public Sector Inspection Nat Stats Demography
19
LOD usage examples
20
Searching Datasets Where can I get some data about entities inspected by Supreme Audit Office (SAO)? SAO linked.opendata.cz ? owl:sameAs owl:sameAs Entity Organizace … owl:sameAs SPARQL : (try on endpoint)
21
Searching Datasets Public Agreements Registry - Agreements 61961
Registr Agreements Registry - Orders 27726 Database of Science, Research and Innovations 14286 Offices of Public Authorities 763 Public Sector Inspections 6254 Agendas of Public Institutions 12112 Identification Numbers of Business Entities 60520 Trade Register 167376 Monitor of Public Budgets 104522 Registr Agreements Registry - Payments 5516 Trade Inspections 2576 Integrated Registry of Environmental Pollution 6658 Public Authorities 60007 Business Register 94881
22
Science and Research DB
Combining datasets Which public research institutions were inspected by SAO and what is their public research budget? Science and Research DB SAO linked.opendata.cz Project Entity owl:sameAs Entity Participant Budget Premise owl:sameAs ResearchOrg CheckAction SPARQL : (try on endpoint) RESULT IN CSV :
23
Combining datasets Sanctions for unfair trade practices in Czech regions and numbers of pensioners. Social Security Trade Inspections # pensioners Inspection RAMON EU owl:sameAs Region NUTS Sanction owl:sameAs Geopolitical regions owl:sameAs Region SPARQL : (try on endpoint, note : this federated query also asks and RESULT IN CSV :
24
Building Applications
Each oval is a data source which exists (MeSH, NDF-RT, NCI, DrugBank) as LOD or we have converted it to LOD. Links represent types of RDF links between datasets. LOD made us much faster in the development. RDF data updated periodically thanks to
25
Linked Open Data (LOD) Cloud
26
Knowledge Graphs as LOD
DBPedia Wikipedia as LOD 402,086,316 triples about 17,315,785 entities Wikidata Emerging project of Wikimedia Foundation Structured data source for Wikipedia 1,373,105,652 triples about 24,437,040 entities
27
Two research challenges for near future
“A data journalist writes an article about unfair trade practices on elderly people in Czech Republic. He needs to find datasets with an evidence for his article (unfair trade inspections, elderly people numbers, regions in Czech Republic). He also needs to preview the discovered datasets, create map visualizations and embed them to his article.” Challenge 1: Dataset discovery Challenge 2: Dataset visualization
28
Dataset discovery Input : User’s intent
How the intent should be expressed? How we can assist the user when expressing the intent? How the expression of the intent should be translated to a formal query language? Output : Combinations of datasets which fulfill the intent How datasets should be indexed? How the indexes should be kept up-to-date? How the user’s intent should be evaluated against the index?
29
Back to Open Data OpenData.cz – a group of academicians supporting and boosting (Linked) Open Data in Czech public sector We have assisted several public institutions with opening their data Cooperation with ČSÚ, ČOI, MF ČR, MV ČR
30
Back to Open Data Under Ministry of Interior of Czech Republic, we have helped with making Open Data as one of the major eGovernment topics position of National Coordinator for Open Data National Open Data Catalogue ( Standards for open data publication and cataloging ( Open Data in Czech legislation Educating public institutions Plan for National Linked Open Data Infrastructure
31
Our Journey to Czech Open Data Legislation
October 2014 : Open Data must be part of Czech legislation Public bodies did not want to or could not open their data without legislation. October 2016 : The Czech president signed our amendment of Public Sector Information Act (106/1999) introducing Open Data Only data published according to given conditions can be called Open Data. Ministry of Interior must provide National Open Data Catalogue Czech Government instructs ministries and national authorities to mandatorily publish given datasets as Open Data since maji-otevrit/ Defending our position Ministry of Interior (Oct 2014 – Aug 2015) Office of the Government (Sep 2015 – Mar 2016) Parliament (Apr 2016 – Aug 2016) Office of the Government (Oct 2016 – Dec 2016)
32
How you can help (Linked) Open Data?
Develop applications which use open data. bachelor or diploma theses, student software projects If you need some data, ask for them. You can ask OpenData.cz and we will try to help
33
Thank you Martin Nečaský
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.