To Boldly Go PC-Axis Reference Group, Copenhagen, 2014 Central Statistics Office, Cork, Ireland Kevin Healy, (00353 21) 453

Slides:



Advertisements
Similar presentations
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Advertisements

Linked-data Architecture Payam Barnaghi Centre for Communication Systems Research University of Surrey FIA Budapest Linked data session Budapest, May 2010.
Dissemination Channels for the 2011 Census data Accessing ONS data Callum Foster Office for National Statistics.
Graffiti Reporting A partnership of Local and State Government; My Local Services App enhancements.
Open Data at the World Bank. Open Data at the World Bank Open about what we do Open about what we.
Chapter 3 Database Management
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Africa Information Highway and SDMX implementation in Africa Beejaye Kokil Economic & Social Statistics Division African Development Bank
The Data Cube Vocabulary: Statistics in the Web of Linked Data Arofan Gregory Open Data Foundation WICS, Geneva, 5-7 May 2015.
Social Networking – The Ways and Means Rosey Broderick May 2011.
The Austrian Adaptation Platform Sabine McCallum, 19 June 2013.
Impact on the research dance floor Line dance, tango or ceilidh? Drs. Astrid Wissenburg Economic and Social Research Council / Research Councils UK.
Accessing data – Statbank, PC Axis and the Public Sector Statistics Network Eoin MacCuirc Databank and Dissemination Unit Central Statistics Office, Cork.
3.02 The Information Superhighway
STRENGTHENING the AFRICA ENVIRONMENT INFORMATION NETWORK An AMCEN initiative A framework to support development planning processes and increase access.
Good practice in Research Data Management Module 6: Tools, training and support.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Carol Tullo, The National Archives 14 April 2011 The Checks and Balances of a Transparent Public Sector World of Information.
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
The CountrySTAT Philippines Maura S. Lizarondo Asst. Director Jing B. Jalisan Webmaster Bureau of Agricultural Statistics Department of Agriculture Philippines.
1 Implementation of the SDMX Standard by the Federal State Statistics Service of the Russian Federation Alexander Surinov Head of the Federal State Statistics.
Web 2.0 Tools Used in the Finance/Investment Management Industry.
1 Virtualisation and Validation of Smart City Data Dr Sefki Kolozali Institute for Communication Systems Electronic Engineering Department University of.
Transparency and Open Data: GSS Response Iain Bell HoP MoJ.
EGovernment Ireland’s eGovernment Strategy Enda Holland, Department of Public Expenditure and Reform.
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
BP/DRIS TG Sergey Parinov, euroCRIS membership meeting, St. Andrews, November 2009.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
1 Women Entrepreneurs in Rural Tourism Evaluation Indicators Bristol, November 2010 RG EVANS ASSOCIATES November 2010.
 ByYRpw ByYRpw.
Research on US Federal Government Handling of Data Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Provider communications strategy Alice Rawcliffe Provider Communications Manager November 2010.
U.S. Department of Agriculture eGovernment Program eGovernment Working Group Meeting February 11, 2004.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Population Census Data Dissemination through Internet H. Furuta Lecturer/Statistician SIAP 1 Training Course on Analysis and Dissemination of Population.
Hampshire Hub Data Platform Progress update 1 October Bill Roberts Swirrl.
MEDIN Work Plan for By March 2011 MEDIN will be 3 years into the original 5 year development plan started in Would normally ask for continued.
Semantic Web: The Future Starts Today “Industrial Ontologies” Group InBCT Project, Agora Center, University of Jyväskylä, 29 April 2003.
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
SDMX IT Tools Introduction
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
CEC and its WWW Challenges for the New Year Results Web Survey December 2009 among CEC members Frits Hesselink, Andy Alm 31 December 2009.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
CENSUS OUTPUTS Dissemination Plans Chris Ashford 2011 Census Outputs : Technical Delivery.
Pacific IWRM Resource centre A hub for knowledge sharing and provision of technical support/information/resources/links: General aim To assist governments.
Dissemination of SBS data and technical visits to MSs item 10 of the agenda Structural Business Statistics Working Group 14 April 2015, Luxembourg.
Driving Innovation with Open Data Chris Musialek in place for Jeanne Holm Data.gov February 9, 2012.
EGI-InSPIRE RI EGI-InSPIRE Open Science Open Data Open Access Sergio Andreozzi Strategy & Policy Manager, EGI.eu
Collection Description considerations in the nof-digitise programme Sarah Mitchell Programme Manager New Opportunities Fund.
Dissemination of ONS Data - Future Channels and Tools Callum Foster, Web Data Access Project ONS 1.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Enterprise Directorate General European Commission Innovation information for Innovation NCPs Irja Vounakis
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
ONS API Progress / Plans July 2010 Census Web Services Working Group.
Dissemination Working Group
Richard Waller NOF Technical Advisor UKOLN is supported by:
Data Management: Documentation & Metadata
Using the Checklist for SDMX Data Providers
PC-Axis Reference Group Faroe Islands
Linked Data for SDG Reporting
DIME ITDG, Luxembourg 28 June 2016
ESSnet Linked Open Statistics Update
ESS roadmap on Linked Open Data State of play
Innovating Statistical Communication in Europe: the DIGICOM Project
LOSD Publication Deirdre Lee
Statistical Information Technology
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Developing Ireland’s Data Hub
Pilot use of Linked Open Data technologies for publishing official statistics: current status in the ESS and Eurostat April 17th, 2018 GISCO WG.
Presentation transcript:

To Boldly Go PC-Axis Reference Group, Copenhagen, 2014 Central Statistics Office, Cork, Ireland Kevin Healy, ( ) 453 Eoin MacCuirc ( ) 453

Linked Open Data

The Tower of Babel “If as one people speaking the same language they have begun to do this, then nothing they plan to do will be impossible for them. Come, let us go down and confuse their language so they will not understand each other.”

Tim Berners Lee – Founder of the Web “In an extreme view, the world can be seen as only connections, nothing else. We think of a dictionary as the repository of meaning, but it defines words only in terms of other words. I liked the idea that a piece of information is really defined only by what it's related to, and how it's related. There really is little else to meaning. The structure is everything. There are billions of neurons in our brains, but what are neurons? Just cells. The brain has no knowledge until connections are made between neurons. All that we know, all that we are, comes from the way our neurons are connected.”

How open is the data? - Linked Open Data star scheme Tim Berners-Lee suggested a 5-star deployment scheme for Linked Open Data and Ed Summers provided a nice rendering of it. In the following, examples are given for each level. The example data used throughout is 'the temperature forecast for Galway, Ireland for the next 3 days':nice rendering ★ make your stuff available on the Web (whatever format) under an open license 1 example...1example... ★★ make it available as structured data (e.g., Excel instead of image scan of a table)2 example...2example... ★★★ use non-proprietary formats (e.g., CSV instead of Excel) 3 example... 3example... ★★★★ use URIs to identify things, so that people can point at your stuff4 example...4example... ★★★★★ link your data to other data to provide context 5 example5example

Linked Open Data cloud Media Government Geo Publications User-generated Life sciences Cross-domain

Linked open data -The Semantic Web

Copenhagen – 99,100,000 hits looking for a needle in a haystack

URI – Uniform Resource Identifier give the thing a name and an address The following picture shows the desired relationships between a resource and its representing documents:

Tim’s cool URIs Cool URIs don't change What makes a cool URI? A cool URI is one which does not change. What sorts of URI change? URIs don't change: people change them. It is the the duty of a Webmaster to allocate URIs which you will be able to stand by in 2 years, in 20 years, in 200 years. This needs thought, and organization, and commitment.

The Web of Things – The Internet of Things The Internet of Things is coming, but it needs a semantic backbone to flourish. With some 25 billion devices expected to be connected to the Internet by 2015 and 50 billion by 2020, providing interoperability among the things on the IoT “is one of the most fundamental requirements to support object addressing, tracking, and discovery as well as information representation, storage, and exchange.” So write the authors of Semantics for the Internet of Things: Early Progress and Back to the Future, Payam Barnaghi and Wei Wang, Centre for Communication Systems Research, University of Surrey, Guildford, UK and Cory Henson, Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing.Semantics for the Internet of Things: Early Progress and Back to the Future “The suite of technologies developed in the Semantic Web … such as ontologies, semantic annotation, Linked Data and semantic Web services … can be used as principal solutions for the purpose of realizing the IoT,” they state. “Defining an ontology and using semantic descriptions for data will make it interoperable for users and stakeholders that share and use the same ontology.”

Where is the CSO with all this? In partnership with DERI/NUIG/INSIGHT One of the first NSIs in the world to upload census data as linked open data – data.cso.ie – Census 2011 One of the organisations involved in the EU Open Cube pilot projects Launched apps4gaps competition

data.cso.ie

Census – Linked Open Data 12 million RDF triples from Census Geographical entities (counties, cities, etc.) Codelists

Most technical work done by students/interns at NUIG CSO supplied data, use cases, and expertise Lots of manual work and ad-hoc solutions Results not fully “owned” by CSO Skills needed to maintain/extend are mostly in NUIG November 2013OpenCube kick-off meeting15 CSO/NUIG collaboration summary position

18-19 November 2013 OpenCube kick-off meeting Open Cube Project Pilots

PilotFocus Tool/platform Data setsType of users Number of users Evaluation Cycle DCLGPublishSwirrl’s PublishMyDa ta open datasets regarding finance, planning Performance, land use, housing and homlessness. Public servants (members of the DCLG statistical data management team) as well as statisticians/ researchers 3-4 members of the data management team and 5 test users (statisticians, research analysts) 2 evaluation cycles: M9- M12 and M18-M21 Flemish GovPublish/ Reuse FluidOps’ IWB 1100 open datasets VRIND A varied audience ranging from public servants to data scientists evaluation cycles: M9- M12 and M18-M21 Central Statistics Office Publish/ Reuse OpenCube toolkit 2011 Census dataset & StatBank dataset Public servants 25 employees2 evaluation cycles: M9- M12 and M18-M21 OpenCube Pilots

Publishing statistics from StatBank as linked data Publishing statistics from StatBank as SDMX- ML Facilitate the creation of general reports aimed at the general public Assist with answering queries from the public Help third parties to tell stories with CSO data Open Cube business case for the CSO

Own the data.cso.ie process and technology – Enable in-house maintenance, changes, etc. Publish StatBank* data as Linked Open Data – Ongoing publication process – Adhering to release schedule is critical – Publish data that are regularly updated (monthly, quarterly, annual) as linked open data ( Census 2011 static data) *StatBank is the CSO published time series database (PC Axis) Deploy tools that enable analytics and exploitation of linked data – Both internally and externally CSO goals (independent from OpenCube)

The Role of the CSO in the Future of Linked Data in Ireland As the technology trends that drive adoption of Linked Data continue further, and the importance of Open Data increases, the CSO is well-positioned to play a leading role as a “hub” in the Irish data Web. Some key steps include: 1. Proactively encourage the adoption of standard classifications and metadata for Open Data that are published by different public bodies within Ireland. The CSO is already documenting classifications on its StatCentral (Portal) website, and has more experience in disseminating data on the Web than perhaps any other organisations in the public sector. Ideally, the classifications themselves would be published as Linked Data. 2. Going beyond pure classifications, encourage the use of standard identifiers (URIs) for geographical areas. 3. Support Linked Data as a new dissemination format for the CSO StatBank. Key economic and demographic statistics are necessary in all sorts of data analysis tasks, and ideally they should be published as Linked Data directly by the source (CSO).

Application Programming Interface (API)

StatBank API

StatBank API – by theme

StatBank API – Download

Key Indicators, quick tables and multi-quick tables

Key Economic Indicators

Quicktables

Multi-quicktables

Public Sector Statistics Network (PSSN)

PSSN – Organisations hosted

OGP as a driver

data.gov.ie – Irish OGP portal

Context and Impact Indicators

CSO - Context and Impact Printed output No. of releases and publications Online output – CSO website Visits2,387,0002,303,4412,718,287 Page views10,070,00013,997,03117,034,035 Downloaded files1,539,0001,733,8331,856,176 StatBank table accesses400,4001,042,7501,282,674 Online output – StatCentral site Visits131,400158,117179,527 Page views300,200418,564451,788 Publication of statistics on social media Followers (at year-end)3,0305,6448,548 Burden Reduction Annual reduction in statistical burden on business-28%-4.7%n/a Context and Impact Indicators

Questions?