1
Michalis Vafopoulos NTUA & Data in a nutshell Open data workshop, Lebanon 23/5/2014
Welcome to the data era
5
6
The era of Open budgets, spending, registries, contracting… 7
The Transparency program in GreeceThe Transparency program in Greece ( ) o A revolution in open government o ex-ante reporting of every state decision o paradigm shift for 40K public servants 8
The Transparency program in Greece o manifests the value of procrastination principle (again) o strong rival to the Clientelistic state o The new version under beta testing (delivery: in 10 days!) 9
publicspending.net 2011: I believed that the Transparency program is the open data “gold” (& persuaded 7 more people) 10
publicspending.net 2012: …with some dust and rocks in a deep goldmine 11
2013: time to chisel some jewelry 2014: open data everywhere 12
Indexing, searching, global comparisons
Where the money goes in Greece
Data: Open, big, linked Open: access …everyone to use and republish as she wishes Big: scale high volume, velocity and variety Linked: use Publish once, use as many times
Is it working? Current Employee Names, Salaries, and Position Titles Current Employee Names, Salaries, and Position Titles The Open Database Of The Corporate World The Open Database Of The Corporate World Crime map NHS efficiency savings: the role of prescribing analytics NHS efficiency savings: the role of prescribing analytics where public money goes worldwide
Examples Can you find the famous persons born in Beirut before 1900? In Paris, Athens, … ?
Examples
Examples
Why Open Data o more & better information o objective and processable information for economic/political “dialogue” to promote competition to decrease cost to judge the efficiency of policy mixtures to enable participation 24
(initial) scope OGD to provide an objective & intermediate layer of information that will enable citizens, journalists, business people and politicians to re- discover their own “stories” from data. 25
LOD in Greece: why it is important quality of information during economic crisis transparency & efficiency in funding development 26
Issues o how can we initiate the virtuous cycle of creation? demonstrate LOD’s added value o how to get the most out of data? local & global interconnections 27
In few words, Apps, Apps, Apps….. 28
Public Spending in Greece & worldwide publicspending.net the first LOD App in Greece daily updates open spending linked data, endpoint & visualizations 29
Insights in Global Public Spending
Open but Effective? o Who really gets the public money? o For what? From whom? o Can we compare them? o Is public spending effective? o 31
Useful economic open data 1.The full cycle of public money 2.Uniform Company names 3.Compatible Payment categories 32
1. The full cycle of public money 33 Prices
Follow Public Money all the Way Vocabulary (fpm) o A compact and minimal way to model the flows of public money o From budget to spending including business information and prices 34
Useful economic open data 1.The full cycle of public money 2.Uniform Company names 3.Compatible Payment categories 35
2. Not uniform Company names 36 The problem: different names for the same company “Oracle” in the Australian public spending
Reconciling Company names: the CORFU technique Rodríguez, Jose María Álvarez, Ordoñez de Pablos, Patricia, Vafopoulos, Michalis N. and Labra, José Emilio
3. Compatible Payment categories The problem: Spending decisions are using different (or not any!) classification schemes (e.g. CPV, UNSSC, NAICS ) 38
Compatible Payment categories Transforming classification schemes or literal descriptions to CPV, expanding: The MOLDEAS projectMOLDEAS project Methods On Linked Data for E-procurement Applying Semantics 39
40
Reconciling Company names: the Forbes Global 2000 companies
Compatible Payment categories
Going global: AUSTRALIA 43
the 5 stars of open linked data ★ make your stuff available on the Web (whatever format) ★★ make it available as structured data (e.g. excel instead of image scan of a table) ★★★ non-proprietary format (e.g. csv instead of excel) ★★★★ use URLs to identify things, so that people can point at your stuff ★★★★★ link your data to other people’s data to provide context
Linked data = internet + http + RDF
Linked Data Principles 1.Use URIs as names for things 2.Use URIs so that people can look up (dereference) those names. 3.When someone looks up a URI, provide useful information. 4.Include links to other URIs so that they can discover more things.
Web as a database Linked Data makes the web exploitable as ONE GIANT HUGE GLOBAL DATABASE! Is there any query language like sql? SPARQL…
What are we planning? LOD for the main economic activities (insurance, banking) Law for open data by default in ALL public organisations Open data education (open generation)
References o Vafopoulos, Michalis N., Rodríguez, Jose María Álvarez, Meimaris, Marios, Xidias, Ioannis, Klonaras, Michailis and Vafeiadis, Giorgos, Insights in Global Public Spending (May 12, 2013). Available at SSRN: or Vafopoulos, Michalis N., Rodríguez, Jose María Álvarez, Meimaris, Marios, Xidias, Ioannis, Klonaras, Michailis and Vafeiadis, Giorgos, Insights in Global Public Spending (May 12, 2013). Available at SSRN: or o Vafopoulos, Michalis N., The Web Economy: Goods, Users, Models, and Policies (July 26, 2012). Michalis Vafopoulos (2012) "The Web Economy: Goods, Users, Models, and Policies", Foundations and Trends® in Web Science: Vol. 3: No 1-2, pp Available at SSRN: Vafopoulos, Michalis N., The Web Economy: Goods, Users, Models, and Policies (July 26, 2012). Michalis Vafopoulos (2012) "The Web Economy: Goods, Users, Models, and Policies", Foundations and Trends® in Web Science: Vol. 3: No 1-2, pp Available at SSRN: o ALVAREZ, J. and LABRA, J Towards a pan-european e-procurement platform to aggregate, publish and search public procurement notices powered by Linked Open Data: the MOLDEAS approach. International Journal of Software Engineering and Knowledge Engineering. 22, 3 (2012), 365–383. ALVAREZ, J. and LABRA, J Towards a pan-european e-procurement platform to aggregate, publish and search public procurement notices powered by Linked Open Data: the MOLDEAS approach. International Journal of Software Engineering and Knowledge Engineering. 22, 3 (2012), 365–383 49
More info