Conference on New Technologies for official Statistics

Slides:



Advertisements
Similar presentations
Natale Renato Fazio, Stefano Menghinello, Carmela Pascucci and Carla Sciullo Foreign trade and multinational enterprises statistics Division ISTAT ITALY.
Advertisements

The Business Register Research, Design and Evaluation Division Statistical Institute of Jamaica.
Eurostat FDI by ultimate host and ultimate investing country European Commission – Eurostat Directorate G: Global business statistics.
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Changes in the Structure of the Business Population Business Demography Jillian Delaney 29 September 2011.
Statistics on enterprise groups – the EGR potential European Commission – Eurostat Directorate G: Global business statistics.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Census Census of Population, Housing,Buildings,Establishments and Agriculture Huda Ebrahim Al Shrooqi Central Informatics Organization.
Electronic reporting in Poland 27th Voorburg Group Meeting Warsaw, Poland October 1st to October 5th, 2012 Central Statistical Office of Poland.
Country Experiences in Identifying Enterprise Groups and Incorporating them into Statistical Business Registers Workshop on Business Registers and their.
CZSO Business Register in the Czech Statistical Office Prepared by: Jan Matejcek CZSO, Prague, Czech Republic
Eurostat European profiling: a crucial tool in the current European developments on statistical units D. FRANCOZ Eurostat J OINT UNECE/OECD/E UROSTAT MEETING.
Eurostat Q2014 – Session 35 Quality assurance for Business Statistics in Europe through the ESS.VIP.ESBRs project D. Francoz Eurostat.
The impact of globalisation on defining the NACE code Nordisk Statistikermøde i København august 2010 Tema 1. Udvikling af statistikken: Globalisering.
MEE TSMEETS Modernisation of European Enterprise and Trade Statistics - MEETS and EGR Q2010 Conference in Quality Helsinki, May 2010 Eduardo Barredo Capelot.
Item III.2 Frame population EGR frame methodology Barry Coenen, Statistics Netherlands MEETS Conference June 2014.
Statistics Portugal Methodology and Information Systems Department Information Infrastructure Unit Isabel Farinha and Jorge Magalhães « 21th Meeting of.
Geneva – 18/19 June 2007 EuroGroups Register A pilot study for Eurostat to create a register of multinational enterprise groups John Perry – UK ONS Barry.
Data sources of the EuroGroups Register Presentation by Eurostat
ESTAT/EFTA/UNECE high level seminar on streamlining statistical production, 7-8 July 2011 The European Statistical System’s Vision Claudia Junker, Tomasz.
Compilation of External Trade Statistics by Enterprise Characteristics: Progress Report Karo Nuortila European Commission/DG Eurostat/ Unit G3 International.
14-Sept-11 The EGR version 2: an improved way of sharing information on multinational enterprise groups.
Eurostat, Unit G-1 1 EuroGroups Register project UNECE/Eurostat/OECD June 2007 Road Map for the Future.
4° ESSnet workshop on the EuroGroups Register Development of an enhanced EGR Vision EGR version 2.0.
ESTP Course on the EGR November Validation of preliminary EGR data and files to EGR on GEG.
ESTP Course on the EGR November a. EGR Data sources.
Data sources and data on enterprise groups Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical Bureau of Latvia.
M O N T E N E G R O Negotiating Team for Accession of Montenegro to the European Union Working Group for Chapter 18 – Statistics Bilateral screening: Chapter.
Statistical Business Register Enterprise Groups in Latvia Sarmite Prole Head of Business Register Section Business Statics Department Central Statistical.
EuroGroups Register Almira Hecimovic, Register Unit 1 Register Unit, Study visit Georgia
A register on Multinational Enterprise groups
European profiling and the EGR
Discussion: Timely estimates of economic indicators – Session C3 –
Business Demography Indicators for the euro area
PROFILING FOR THE EGR : THE FRENCH VIEW
How to record samples in order to make respondent burden feasible
The EuroGroups Register: a tool to share information on globalisation
8 EGR preliminary frame and validation tasks
Modernisation of European Enterprise and Trade Statistics - MEETS OECD-Working Party on International Trade in Goods and Trade in Services Statistics.
1 What is EGR? ESTP course on EGR 6-7 September 2016.
ESSnet Projects Pascal JACQUES Unit/B5 Methodology and research
11 Access to EGR applications
2. An overview of SDMX (What is SDMX? Part I)
11. Rules of consolidation
The impact of globalisation on defining the NACE code
6. EGR Identification Service
New data sources for the EuroGroups Register
Wiesbadengroup 2018, Neuchâtel (CH)
Session 7 – Eurostat 2017 SBR User Survey
The OECD Analytical Database on Individual Multinationals and Affiliates (ADIMA) SESSION 5 Diana Doyle, Fabienne Fortanier, Graham Pilgrim OECD Statistics.
Wiesbaden, 24 October, 2007 Svetlana Shutova Statistics Estonia
Multinational enterprise groups in the EU Dissemination from the EGR
A review of the 2011 census round in the EU, including the successful implementation of a detailed European legal base First meeting of the Technical Coordination.
10 EGR developments ESTP Training on EGR 6-7 September 2016.
1st Implementation Report of the Water Framework Directive
EuroGroups register First results of measures on advancement
7. Consolidation of enterprise groups
The EuroGroups Register: a tool to share information on globalisation
Data integration methods
1. Mission of EGR and legal framework
Conference on New Techniques and Technologies for official Statistics
The EuroGroups Register Agne Bikauskaite, August Götzfried
3 EGR Identification Service
European profiling and the EGR
14. Preliminary EGR and validation
7 EGR consolidation process
7 EGR initial and preliminary frames and validation tasks
ESTP course on EuroGroups Register
Presentation transcript:

Conference on New Technologies for official Statistics Open data sources for retrieving information on multinational enterprise groups Conference on New Technologies for official Statistics Brussels, 12-14 March 2019

Content What is EuroGroups Register (EGR) Short overview of DBpedia Feasibility study objectives Results for proof of concept Coverage Completeness Accuracy Timelines Conclusions

What is EGR? The EuroGroups Register (EGR) is a statistical business register of multinational enterprise groups in the EU Member States and in the EFTA countries coverage: multinational groups present in Europe, their constituent enterprises and legal units the EGR process is in operation since 2009 For statistical use only Restricted use in national statistical offices and national central banks of EU and EFTA countries

Information needed for statistical representation Legal units Unique identifiers Relationships: ownership shares / voting rights LEU A controls LEU B with x% voting rights Enterprises Economic characteristics (turnover, employment) Links to legal units Groups Group characteristics (turnover, employment) Global decision centre

Statistical representation As a complete structure of legal units and their controlling relationships and the economic enterprises Enterprise Group Enterprise 1 Enterprise 4 Enterprise 2 Enterprise 3 Enterprise 5 Head LEU A LEU E LEU D LEU C LEU B LEU F LEU G LEU I LEU H LEU J LEU K

EGR 2.0 process overview CDP EGR NSI Identification of legal units Identification service Commercial data provider – CDP (LEU,REL) Processing NSI and commercial data NSI data (LEU, REL, ENT) Consult and update preliminary frame and GEG data Initial and preliminary frames Final frame 6

Problem statement The European part of the legal units, enterprises and enterprise groups are well-covered by EGR, but there is missing data for units outside of the EU and EFTA as well as for attributes on the group level. Web crawling and different open data projects are seen as further opportunities to increase the quality of the EGR, its completeness and accuracy.

DBpedia « global and unified access to knowledge » Started in 2008 as community effort for semi-automatic knowledge extraction from Wikipedia  One of the most successful open knowledge graphs (OKG) working on https://databus.dbpedia.org  Shared effort on KG Governance, Integration, Collaboration, Curation ... Pushes societal value and data economy Maven with Git-for-data and persistent identifiers

DBpedia Extraction Framework Open source software which extracts structured semantic  data (RDF) from Wikipedia (infoboxes) in order to make it publicly available as OKG Execute sophisticated queries against Wikipedia data  Link different datasets to Wiki/DBpedia resources Example RDF Data for Siemens AG

Wikipedia Knowledge Extraction project that extracts structured data from Wikipedia (infoboxes) in order to make it publicly available  Execute sophisticated queries against Wikipedia data  Link different datasets to Wikipedia data

Feasibility study objectives The project goal was to create an interface that handles a list of groups names and returns a list of results with information on aggregate numbers for those groups. The contractor, Leipzig University, was provided with a population of 73 group names in order to design an interface that fetches search results from DBpedia.

Proof of Concept Results This Proof of Concept focused on validating the following indicators: Coverage – number of successful matched enterprise group names Completeness – number of received values for the different attributes Accuracy – quality of the returned values when compared to annual report data Timelines – availability of data for certain reference period based on EGR cycle

Coverage 2016 The searches carried out during the testing phase proved that 70 of 73 groups could be found in DBpedia. The group names used were taken from a data set received from Dun and Bradstreet covering a selection of 3000 groups addressing groups size and geographical location diversity.

Completeness 2016

Accuracy 2016: Employees

Accuracy 2016: Turnover

Accuracy 2016: Assets

Timelines: Coverage 2014 - 2017 The interface includes a historical mode that allows to retrieve data on enterprise groups even if Wikipedia data has already been updated with new data. Due to the delay with which the EGR provides data on enterprise groups this feature is essential

Conclusions The results from the feasibility study did not managed to achieve complete automation. Further steps in a prototype phase will test the possibility of making cross reference links between EGR and DBpedia in the context of automation. The highest percentage of data coverage achieved was for persons employed attribute - still below 50% (42.5%), for turnover it is 37.0% and for assets 16.4%. The retrieved data on the three parameters showed high accuracy when compared to the figures published by the groups on their websites.

Thank you!

DBpedia Information and contact https://wiki.dbpedia.org/ hellmann@informatik.uni-leipzig.de