Integrated Data Sources and methodological Issues. A Case Study from the German AFiD Project Hans - Peter Hafner NTTS 2009, Brussels February 20, 2009.

Slides:



Advertisements
Similar presentations
High Resolution studies
Advertisements

1 The Changing Fortunes of the EUs Energy Market Antony Froggatt.
Copyright © 2002 Pearson Education, Inc. Slide 1.
Chapter 9 Growth.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
1 The contribution of foreign affiliates to productivity growth: evidence from OECD countries Chiara Criscuolo Economic Analysis and Statistics Division.
1 « June, 6 and 7, 2007 Paris « Satellite Account for Education for Portugal: Implementation process and links with the National Accounts and Questionnaire.
© Statistisches Bundesamt, Federal Statistical Office of Germany, National Accounts Statistisches Bundesamt German experiences in estimating households.
MEASURING LABOUR FORCE PARTICIPATION OF WOMEN
1 Revision of SEEA 2003 State of Play – Environmental Goods and Services Sector Presentation for the London Group meeting 29 September – 3 October 2008,
Treatment of social insurance schemes in the 2008 SNA Regional Seminar on Developing a Programme for the Implementation of the 2008 SNA and Supporting.
Scope and method of pilot survey in China Yang kuan kuan Deputy director-general of office on Leading group of the Second National Economic Census under.
TRANSITION TO AN INTEGRATED SYSTEM OF BUSINESS SURVEYS THE BRAZILIAN CASE Magdalena Cronemberger Góes IBGE/Brazil Workshop on Economic Census, Beijing,
The Business Register Research, Design and Evaluation Division Statistical Institute of Jamaica.
ECONOMIC CENSUS IN MALAYSIA POST-CENSUS ACTIVITIES Presented by Norazizah Ibrahim Wong Services Statistics Division Department of Statistics, Malaysia.
Extrapolation and benchmarking Vu quang Viet UNSD consultant.
The Challenge of Integrating New Surveys into an Existing Business Survey Infrastructure Éric Pelletier Statistics Canada ICES-III Montréal, Québec, Canada.
Improved Questionnaire Design Yields Better Data: Experiences from the UKs Annual Survey of Hours and Earnings Jacqui Jones, Pete Brodie, Sarah Williams.
Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
State of the States: STATE FISCAL OUTLOOK
1 The 2007 National Immigrant Survey (Spain) National Statistics Institute (INE) Antonio ARGÜESO director for social and demographic statistics.
Jörg Drechsler (Institute for Employment Research, Germany) NTTS 2009 Brussels, 20. February 2009 Disclosure Control in Business Data Experiences with.
Catherine Renne Insee Measuring sampling error in business surveys The case of the French monthly industry survey.
1Regional policy responses to demographic challenges, Bruxelles, January 2007 EUROSTAT regional population projections Giampaolo LANZIERI Eurostat.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Outline of talk The ONS surveys Why should we weight?
Multistage Sampling Module 3 Session 9.
Collecting data for informed decision-making
Micro Focus Research 1 As far as youre aware, how does your organization plan to drive business growth over the next three years? (Respondents' first choices)
1 WATER AUTHORITY Dr. Or Goldfarb CENTRAL BUREAU of STATISTICS Zaur Ibragimov Water Accounts in Israel Vienna January 2009.
1 Understanding Multiyear Estimates from the American Community Survey.
Child Care Subsidy Data and Measurement Challenges 1 Study of the Effects of Enhanced Subsidy Eligibility Policies In Illinois Data Collection and Measurement.
1 Wyomings Labor Market: A Brief Overview Doug Leonard, Principal Economist Wyoming Department of Workforce Services, Research & Planning
The Nature of the Bias When Studying Only Linkable Person Records: Evidence from the American Community Survey Adela Luque (U.S. Census Bureau) Brittany.
Residents’ register service under the Ministry of the Interior
GENERATIONS AND GENDER SURVEY IN RUSSIA: Parents and Children, Men and Women in Family and Society 2 nd wave IWG, 13 May, 2008 Oxana Sinyavskaya, IISP.
1 1 Plan for use of GIS in front of Census of Agriculture and Forestry 2010 Mr. Geir Inge Gundersen Senior Adviser Statistics Norway
ABC Technology Project
Real Estate Market Analysis
Industrial Statistics in Bangladesh: Current Practice Md. Shamsul Alam,Director,BBS Md. Azadul Islam,Deputy Director,BBS.
Evaluation of an intervention to increase online filing of individuals’ tax returns Peter Lumb September 2009.
Chapter 5 Test Review Sections 5-1 through 5-4.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Addition 1’s to 20.
25 seconds left…...
COMPENSATION AND BENEFITS STUDY March 21, CONTENTS.
Annual Industry Accounts Overview George Smith & Nicole Mayerhauser Current Industry Analysis Division Bureau of Economic Analysis Industry Accounts Users’
The french population at 1st january 2004 Aguascalientes 4 – 6 july 2005.
Week 1.
We will resume in: 25 Minutes.
Supply Chain Performance Measurement
1 ADJUSTMENTS OF DEDUCTION. 2 BASICS 3 Periods of adjustment and method Periods of adjustment –Five years (periods of reference in Malta) for capital.
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
© Federal Statistical Office, Research Data Centre, Maurice Brandt Folie 1 Analytical validity and confidentiality protection of anonymised longitudinal.
Copyright 2010, The World Bank Group. All Rights Reserved. Sources of Agricultural Data Section A 1.
Georgia: business register data and gender-disaggregated indicators Tengiz Tsekvava Technical Meeting on Measuring Entrepreneurship from Gender Perspective.
Some aspects concerning analytical validity and disclosure risk of CART generated synthetic data Hans-Peter Hafner and Rainer Lenz Research Data Centre.
IAB homepage: Institut für Arbeitsmarkt- und Berufsforschung/Institute for Employment Research A New Approach for Disclosure Control in the.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Access to environmental microdata in Germany IAOS Conference, Chile, 2010 Markus Zwick Federal Statistical Office Germany.
1Your reference The Menu of Indicators and the Core Set from the South African Point of View Moses Mnyaka 13/08/2009.
Why should official statistics care about data integration? First experiences in linking microdata in Germany Christopher Gürke RDC of the Federal Statistical.
Quality Aspects and Approaches in Business Statistics
Workshop on Reporting Environmental Goods and Services
Presentation transcript:

Integrated Data Sources and methodological Issues. A Case Study from the German AFiD Project Hans - Peter Hafner NTTS 2009, Brussels February 20, 2009

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 2 Overview The AFiD Project Data Products Methodological Issues Case Study: Longitudinal Weighting Factors for the AFiD – Panel Services

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 3 AFiD AFiD = Amtliche Firmendaten für Deutschland (Official Business Data for Germany) Project of the Research Data Centre of the statistical offices of the Länder of Germany (since 2007) Goal: Generation of integrated data sources Method: Matching of business and environmental surveys Legal basis: §13a BStatG (German Federal Statistics Law)  Added in 2005

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 4 AFiD Data access for universities and similar scientific institutions  Safe scientific work stations in the statistical offices   Controlled Remote Data

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 5 AFiD – Products AFiD - Panel Industrial Plants All essential information about local units with focus of economic activity in the manufacturing sector  sector of economic activity, number of employees, turnover, investment, working hours and earnings. Combination of full surveys with a cut-off of 20 employees. Additional until 2002: Small Firm Survey with data on local units having less than 20 employees. From 1995 to Actual data will be added.

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 6 AFiD – Products AFiD - Panel Industrial Enterprises Information about enterprises with focus of economic activity in the manufacturing sector  sector of economic activity, number of employees, turnover, investment, working hours and earnings, cost structure. Combination of full surveys with a cut-off of 20 employees and a sample survey (cost structure survey, contains around 45% of the enterprises). Complements the Plants panel. All fields of activity of the enterprise are included, not only those in the manufacturing sector. No back- calculation of values for the local units possible.

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 7 AFiD – Products AFiD - Panel Agriculture Farm level data of the agricultural census 1999 and the agricultural structure services 2003 and 2007  Agricultural production, land use and livestock, legal form, property and tenure situation and manpower Germany 1999: About half a million farms cultivate 17,1 millions hectares of cropland with 1,4 million full- or part-time farmers 2007: Number of farms had reduced to about Scientific interest: Analyzing the determinants of this development Political interest due to high subsidy payments

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 8 AFiD – Products AFiD - Panel Services Recently many enterprise foundations in the services sector  Sector become more and more important for the national economics Panel consists of the structure survey for the services sector for 2003 to 2005 (sample of around 15% of the enterprises) Information about the number of employees, the earnings, the turnover, the taxes, the subsidies of the enterprise Longitudinal dataset permits for example the analysis of growth processes

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 9 AFiD – Products AFiD – Module Earnings Structure of Earnings Survey (SES) for the years 2001 and 2006 SES is a linked employer employee dataset Employees: Socio-demographic attributes (sex, year of birth), working hours and earnings, specification which collective or enterprise agreement is applied Local unit or enterprise to which it belongs: information about the economic sector, the influence of the public sector, number of employees Analysis of gender-specific earnings differences and differences between local units tied to labor contracts and units that remunerate by individual agreement Can be merged with the AFiD - Panel Industrial Plants

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 10 AFiD – Products AFiD – Module Environment Module combines surveys about water supply and waste water disposal, investment for environment protection and good for environment protection Can be merged with the AFiD - Panel Industrial Plants  simultaneous analysis of economic and environmental relevant factors

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 11 Methodological Issues Inconsistencies concerning the Identifiers Identifiers: Numbers of the plant respective the enterprise (business register) Problems:  Merger of enterprises: New enterprise receives one of the numbers of the old companies.  Splitting up of an enterprise: One part retains the old number.  Change of form of organization or headquarter moves to another federal state: Enterprise gets new number.  No or no proper matching possible

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 12 Methodological Issues Implausibilities when matching Data from different Surveys Several attributes are contained in different surveys, for example number of employees and turnover, but the definitions or the date of report can differ  Check of definitions and reporting dates may help to clear implausibilities

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 13 Methodological Issues Diverse Sampling Frames and Sampling Units AFiD combines complete surveys with a cut-off and sample surveys Compare results of analyses including all units with results obtained by using the units of the sample (check whether there is a selectivity problem)

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 14 Methodological Issues Missing Values Unit does not answer a question: Rare in official statistics since for the most surveys there is an obligation to give information Unit does not participate in the survey: Values can be imputed. Until now no imputations are planned by the research data centre.

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 15 Methodological Issues Panel Attrition and longitudinal Weighting Factors See the following detailed example.

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 16 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Motivation Surveys that are conducted in regular intervals allow for analyses of every cross – section and – as far as some units participate in more than one wave – the longitudinal section. Units for which we have valid data for a wave and which belong to the population of interest at the reference date of the wave, represent the population and they build the basis for the cross – sectional weighting. In the case of longitudinal sections it is not always clear, what the population is. This depends on the specific scientific question.

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 17 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Kinds of Attrition False attrition: The unit exists no longer or it belongs no longer to the population of the survey, for example because of a change of the focus of the economic activity. True attrition: The unit does not answer (refusal or moved to a new address which is unknown). The true attriters have to be compensated by increasing the weighting factors of the units with valid data.  Additional factor ni / (ni - ai,e) ni number of sample units in strata i ai,e number of true attriters in strata i

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 18 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Methodology of the Structure Survey for the Services Sector Twofold stratified sample survey: 68 strata of economic activity and 12 strata of turnover classes. Sample unchanging for some years, at most 15% of all enterprises. To compensate for attrition: Additional new entrant sample every year (layered by 22 groups of economic activity and 8 classes of turnover).

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 19 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Panel attrition from 2003 to

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 20 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Problems:  Number of stratum not included in original dataset (additional reference file had to be ordered by all statistical offices of the federal states -> delay of the project)  Strata of the original sample and the new entrant sample had to be harmonized

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 21 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Plan for the construction of the weighting factors : Basis weight:  Number N of all enterprises that have existed in at least one of the years 2003 to 2005 for every stratum  Basis weight = N divided by the number of enterprises of the stratum contained in the sample for at least one year Additional factor:  Calculate separate factors for 2003 to 2005 to compensate the true attriters  Compute the average of the factors

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 22 Case Study: Construction of longitudinal Weighting Factors for the AFiD – Panel Services Outlook Test and modification of the concept Enhancement of the panel for the year 2006

© Statistische Ämter der Länder, ForschungsdatenzentrumFolie 23 Contact Statistical Offices of the Länder Research Data Centre Wiesbaden Hans-Peter Hafner Phone Thank you for your attention!