27 June 2007 QMSS CONFERENCE PRAGUE 1 European statistical microdata bases: What form of access for social science researchers? Michel GLAUDE Director “Social and information society statistics” EUROSTAT
27 June 2007 QMSS CONFERENCE PRAGUE 2 Eurostat Free dissemination policy Since 1 October 2004, all European non confidential statistical data has been available free of charge on the Eurostat website… …But Social science researchers need to have access to microdata
27 June 2007 QMSS CONFERENCE PRAGUE 3 Eurostat data collection policy: To develop transmission of harmonised microdata to Eurostat To better answer users’ needs To produce and disseminate more analytical studies To alleviate and simplify the burden for NSIs To increase the central access for researchers to these “Gold mines”
27 June 2007 QMSS CONFERENCE PRAGUE 4 Microdata bases for access At present: The Labour Force Survey (LFS) The Community Statistics on Income and Living Conditions (EU-SILC) and the former ECHP The Community Innovation Statistics (CIS) The Continuing Vocational Training Survey (CVTS) In a near future: The Structure of Earnings Survey (SES) The Adult Education survey (AES)
27 June 2007 QMSS CONFERENCE PRAGUE 5 The Labour Force Survey (LFS) Main quarterly European data source for unemployment, employment, working conditions, education, occupation… Yearly ad-hoc modules All European Economic Area countries (except Malta) plus Bulgaria and Romania in June 2007 (2006 data) individuals par Quarter for the whole EU More than 100 variables plus 11 for each ad-hoc module In June Y release of data for the reference years from 1983 to Y-1 for 8000 € and 1000 € for each following year In March/April Y release of the ad-hoc module of year Y-2 Regulations 2257/2003, 430/2005, 246/2003
27 June 2007 QMSS CONFERENCE PRAGUE 6 The Community Statistics on Income and Living Conditions (EU-SILC)… Main annual European data source for income, poverty, social exclusion and living conditions (housing, health, education, child care…) started in 2004 Yearly ad-hoc modules: Transmission of poverty (2005), Housing conditions (2006), Social participation (2007)… All 25 MS plus Norway and Iceland for SILC individuals and households for the SILC variables at household level and 85 at individual level Cross sectional data for year Y-2 (income reference year Y-3) released in March Y for 500 € and 250 € for each following year Longitudinal data (4 years trajectories) released in September Y Regulations 1177/2003, 1980/2003, 1983/2003…
27 June 2007 QMSS CONFERENCE PRAGUE 7 …and the former European Community Household Panel (ECHP ) Annual panel survey covering income, working life, housing situation, social relations, health and biographical information Community Survey covering 14 Member States from 1994 to adults (16+) and households interviewed every year € for the whole anonymised microdata set (8 waves) Gentlemen’s agreement basis
27 June 2007 QMSS CONFERENCE PRAGUE 8 The Community Innovation Statistics (CIS) Main data source for measuring innovation in Europe covering basic information of the enterprise, product and process innovation, innovation activity and expenditure, effects of innovation… 27 MS plus Iceland, Norway and Turkey for CIS4 (launched in 2005, reference period 2004 with observation period 2002 to 2004) enterprises (10 employees +) from industry and services 120 variables collected Specific microdata anonymisation method In Autumn 2007 release of CIS4 anonymised microdata for 20 countries In mid 2006 release of CIS3 (reference period 2000 or 2001) anonymised microdata for 15 EEA countries for 200 € Access through the SAFE Center in Eurostat to CIS3 Confidential microdata for 25 € per day Regulation 1450/2004
27 June 2007 QMSS CONFERENCE PRAGUE 9 The Continuing Vocational Training Survey (CVTS) Main data source for describing the strategies of enterprises with respect to training of their staff (Participation rates, volumes, costs…) 27 MS plus Norway for CVTS3 (launched in 2006, reference period 2005) enterprises (10 employees +) from industry and services 160 variables collected In spring 2008 release of CVTS3 anonymised microdata Regulations 1552/ /2006
27 June 2007 QMSS CONFERENCE PRAGUE 10 The Structure of Earnings Survey (SES) Main data source for detailed information on the level of remuneration, individual characteristics of employees and of their employer (every 4 years) 27 MS plus Norway and Iceland for SES employees from enterprises (10 employees +) belonging to NACE C to K (C to O from 2006 onwards) SES 2002: CD-Rom dissemination in September 2007 SES 2002: Safe Center access in June 2007 (including SES1995) SES 2006: Safe Center access and CD-Rom dissemination expected January 2009 Regulations 1552/ /2006
27 June 2007 QMSS CONFERENCE PRAGUE 11 The Adult Education survey (AES) Main data source for the participation of adult to formal, non formal education and training as well as to informal training. 27 MS plus Norway, Turkey and Switzerland for the first data collection ( ) individuals plus countries extensions: Poland , Italy Maximum 170 variables collected. Non core modules are not taken by all Member States Timeliness of access to researchers not yet specified. Microdata will arrive at Eurostat over the period Gentlemen’s agreement basis
27 June 2007 QMSS CONFERENCE PRAGUE 12 Procedure for access Provision of Regulation 831/2002 Anonymised microdata sets on CD-Rom SAFE Center access at Eurostat premises Practical arrangements: The “fast track” procedure See Eurostat Website Costs have been reduced
27 June 2007 QMSS CONFERENCE PRAGUE 13 Future Developments: New data bases to be collected by Eurostat The Information and Communication Technology Survey (ICT household) The European Health Interview Survey (EHIS)
27 June 2007 QMSS CONFERENCE PRAGUE 14 Future Developments: The European System of Social statistical Survey Modules A flexible tool to quickly answer to new or existing policy demands Aiming at rationalising the household surveys data collection system A mixed structure composed of : - annual regular modules (Core variables, ICT, Health indicators) - irregular modules (victimisation, consumer protection, fertility, youth…) individuals for the EU-27 according to country sizes Results within 2 years after the demand is expressed Microdata at European level First implementation in 2010?
27 June 2007 QMSS CONFERENCE PRAGUE 15 Future Developments: Methodological improvements A common set of “Core variables” The new European Socio-economic Classification (ESeC)
27 June 2007 QMSS CONFERENCE PRAGUE 16 Future Developments: Improving access for researchers Revision of the “Statistical law” Regulation 322/97 Creation of “Public use Files” Harmonising methods for anonymisation Developing “Remote access” Extending SAFE centers in NSIs? Giving better access for Data Archive centers?
27 June 2007 QMSS CONFERENCE PRAGUE 17 How to develop collaboration… Between European social science research and Eurostat and NSIs ?