Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

Slides:



Advertisements
Similar presentations
Linking Trade Data to Business Statistics Exporter Register: 1993 to 2004 Importer Register: 2002 Results International Trade Division Statistics Canada.
Advertisements

Re-design of the trade in commercial services program in Canada October 2010 OECD Working Party on Trade in Goods and Services.
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Use of Tax Data in the Unified Enterprise Survey (UES) Workshop on Use of Administrative Data in Economics Statistics Marie Brodeur Moscow November, 2006.
By: Saad Rais, Statistics Canada Zdenek Patak, Statistics Canada
A Statistical Architecture for Economic Statistics Ron McKenzie ICES III.
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
The Challenge of Integrating New Surveys into an Existing Business Survey Infrastructure Éric Pelletier Statistics Canada ICES-III Montréal, Québec, Canada.
My presentation will be on the use of paradata… By
1 Sharing best practices for the redesign of three business surveys Charles Tardif, Business Survey Methods Division,Statistics Canada presented at the.
Survey of Electronic Commerce and Technology: Past, Present and Future Challenges Jason Raymond Third International Conference on Establishment Surveys.
Integrated Business Statistics Program (IBSP) Introduction Daniela Ravindra Director, Enterprise Statistics Division November 9th, 2010.
Improvements to the Quality of Tax Data in the Context of their Use in Business Surveys at Statistics Canada François Brisebois, Martin Beaulieu, Richard.
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva,
The Many Ways of Improving the Industrial Coding for Statistics Canada’s Business Register Yanick Beaucage ICES III June 2007.
François Brisebois, Statistics Canada International Total Survey Error Workshop June 15, 2010 Improvements to Economic Survey Methodologies to Reduce Revisions.
Workshop on Energy Statistics, China September 2012 Electricity and Heat Statistics 1.
Examining the use of administrative data for annual business statistics Joanna Woods, Ria Sanderson, Tracy Jones, Daniel Lewis.
1 Editing Administrative Data and Combined Data Sources Introduction.
Sampling Prepared by Dr. Manal Moussa. Sampling Prepared by Dr. Manal Moussa.
WORKSHOP ON INDUSTRIAL STATITICS, 8 – 10 JULY 2013 COUNTRY PRESENTATION MALDIVES.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
André Loranger New York, June 2014 The Integrated Business Statistics Program at Statistics Canada Presentation to the UNCEEA Assistant Chief Statistician.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
E&I of administrative data used for producing business statistics Vera Costa, Frances Krsinich, Rudi Van der Mescht 2008 UNECE Work Session on Statistical.
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
18/08/2015 Statistics Canada Statistique Canada Responsive Collection Design (RCD) for CATI Surveys and Total Survey Error (TSE) François Laflamme International.
RTI International is a trade name of Research Triangle Institute 3040 Cornwallis Road ■ P.O. Box ■ Research Triangle Park, North Carolina, USA
1 Development and Application of Statistical Business Registers in Africa Key findings Besa Muwele Besa Muwele Michael Colledge Michael Colledge 9th African.
The Integrated Approach to Economic Statistics “The Canadian Approach” Friends of the Chair Group on Integrated Economic Statistics Marie Brodeur, Michel.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
Nonresponse issues in ICT surveys Vasja Vehovar, Univerza v Ljubljani, FDV Bled, June 5, 2006.
Use of administrative data in short term economic indicators Statistics NZ Rochelle Barrow.
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,
Improvements in stratification in the UK's Office for National Statistics Pete Brodie, Martina Portanti & Emily Carless UK Office for National Statistics.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
The Future of Administrative Data ICES III End Panel Discussion Don Royce Statistics Canada June 2007.
Impact of using fiscal data on the imputation strategy of the Unified Enterprise Survey of Statistics Canada Ryan Chepita, Yi Li, Jean-Sébastien Provençal,
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
Part III Gathering Data.
ICASIII Cancun Mexico, November 2004 Establishing a survey frame for agriculture: The New Zealand experience Andrew Hunter Manager Business, Financial.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
ICES III - Montréal, Canada Listening to Respondents for Better Results Alexander Hays Distributive Trades Division Statistics Canada.
Statistics Canada Statistique Canada Integrated Questionnaire Design Friends of the Chair Group on Integrated Economic Statistics Marie Brodeur, Michel.
The new multiple-source system for Italian Structural Business Statistics based on administrative and survey data Orietta Luzi, Ugo Guarnera, Paolo Righi.
Prioritizing Follow-up of Non- Respondents Using Scores for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics.
Prioritizing Follow-up for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics Canada ICES III, Montréal Statistique.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Methodology of Allocating Generic Field to its Details Jessica Andrews Nathalie Hamel François Brisebois ICESIII - June 19, 2007.
Understanding Sampling
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
A Quality Driven Approach to Managing Collection and Analysis
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
1 Overview of Economic Statistics in Africa UNECA Andry Andriantseheno Regional Workshop on Basic Economic Statistics Addis-Ababa October 2007.
Unified Enterprise Survey New Horizons International Conference on Establishment Surveys Daniela Ravindra and Marie Brodeur Montreal, June 2007 Statistics.
Processing Methodology of Tax Data at Statistics Canada Authors: François Brisebois, Richard Laroche and Rossana Manriquez (Statistics Canada) Presenter:
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
The Evolution of Administrative Data Use for the Canadian Business Register (BR) IAOS Conference Gaétan St-Louis October 2008.
Use of the Statistics New Zealand Business Register for the agriculture industry and the not for profit sector Geoff Mead
Small Business and Special Surveys Division Statistics Canada Entrepreneurship Indicators Project Steering Group Meeting Istanbul June 25-27, 2007.
Frédéric Picard and Steve Matthews
Étienne Saint-Pierre and Serge Godbout, Statistics Canada
Quality Aspects and Approaches in Business Statistics
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
ADMINISTRATIVE DATA IN ANNUAL BUSINESS STATISTICS OF LATVIA
The Swedish survey on turnover in the service sector
Étienne Saint-Pierre, Statistics Canada
Presentation transcript:

Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada June 18-21, 2007

2 Outline Overview of the UES Characteristics of the target population Current use of tax data At sampling At imputation At estimation Issues and Challenges Towards a better use of tax data Conclusion

3 Overview of the UES Unified Enterprise Survey (UES) started in 1997 Objectives Integrate all annual business surveys into one unified survey framework To produce quality financial and commodity estimates National and sub-national levels Industrial levels

4 Overview of the UES Target population All Canadian businesses within the covered industries The UES is an Establishment based survey Coverage over time 1997: Seven Industries 1998: Sixteen more (including Wholesale) 1999: Four more (including Retail) 2000: Four more (including Manufacture) …. 2007: Now covers over 60 major industries

5 Characteristics of the Target Population Divided into two main types of businesses: unincorporated (T1) and incorporated (T2) General Index of Financial Information (GIFI) data are available electronically for the entire T2 population T1 data are only available electronically for about half the T1s (e-filers)

6 Characteristics of the Target Population An enterprise is Complex: Multi-provincial and/or Multi- industry and/or Multi-legal Simple: The opposite An enterprise is also Single: Only one establishment Multi: More than one establishment Simple-Single enterprises represent about 95% of the population, although only about 40% of the economy

7 Current Use of Tax Data Why would someone use tax data? Improve efficiency of the sample design Reduce the response burden Reduce the collection cost Improve quality of the estimates

8 Current Use of Tax Data At sampling Some key variables taken from different tax files are put on the sampling frame Total Revenue, Total Expenses from GIFI Total Sales from Goods & Services Tax (GST) Salaries & Wages, # Employees from Payroll Deductions (PD7) Used to define a size measure (Total Revenue) for each establishment on the frame Used to stratify the population by size and to define the Take-None (T-N) portion

9 Current Use of Tax Data At imputation Used to replace survey data (financial variables) for a predetermined sub-sample of selected Simple-Single units Also used to replace survey data for some non-respondents Used as auxiliary data during imputation

10 Current Use of Tax Data At estimation GIFI data are used to produce estimates for all T2 units falling in the T-N portion T1 e-filer data are used to produce estimates for all T1 units falling in the T-N portion

11 UES Survey Design at a Glance T2 T2 Take-None: Census of GIFI EXCLUSION THRESHOLD Main sample to be surveyed For variables available from tax: Total estimate = Survey estimate (T1,T2) + T2 Take-None + T1 Take-none e-filer estimate For variables not available from tax (Characteristics): Total estimate= Survey estimate (T1, T2) Not eligible for tax : full questionnaire Tax replaced Characteristic quest. (services surveys) or full questionnaire (other surveys) T1 Main sample to be surveyed T1 Take- None: Sample of e-filers

12 Issues and Challenges At sampling Sometimes we get inconsistent tax data Ex: GIFI Total Revenue=$2M GST Total Sales=$25M What do we do? We use a conservative approach, i.e. we take the maximum We manually verify and adjust the extreme cases (we’ll make use of survey data if available)

13 Issues and Challenges At sampling (cont’d) Sometimes all we get is # Employees or Salaries & Wages (Revenues =. or $0) What do we do? We model Total Revenue using what’s available

14 Issues and Challenges At imputation Sometimes we can’t find the link to tax data (ex.: not-for-profit organizations) Sometimes we link to 2 or more tax files We currently use direct tax replacement (i.e. Y survey = X tax ). Should we instead use a modelling approach (i.e. Y survey = f(X tax )? Studies have shown that in some cases it might be more appropriate to use f(X)

15

16 Issues and Challenges At estimation Currently, we use the one-phase Horvitz-Thompson estimator It’s a very simple, and fairly efficient estimator Unfortunately, it could be severely biased if the model y = x doesn’t hold

17 Issues and Challenges At estimation (cont’d) Estimates for variables not available from tax file (characteristics/commodity) do not cover the T-N portion For some characteristics the T-N portion can count for a lot more than 10%

18 Issues and Challenges Data quality Response rates (What is a respondent?) Respond to tax but not to the characteristic questionnaire Reported tax data vs imputed tax data Planned tax replacement vs tax replacement for non- response Variance & CV A lot of imputation occurs in the current strategy (incl. tax replacement) Shouldn’t we include the variance due to imputation?

19 Towards a Better Use of Tax Data Understand the particularities of the different tax data sources (ex.: GST vs T2 is currently under investigation) Explore different administrative files to help with particular sub-populations (ex.: not-for-profit organizations)

20 Towards a Better Use of Tax Data Keep investigating why Y survey ≠ X tax even when they should conceptually be equal Explore the idea of using Y survey = f(X tax ) Fine-tune our definition of who is eligible for tax replacement and who is not Currently studying the possibility of using a more robust estimator to protect against the potential bias Developing a strategy to cover the entire population for all variables of interest

21 Start taking into account the variability introduced by imputation when computing variances and CVs A framework is under development to define response rates when both tax data and survey data are used for the same units Explore the possibility of making use of all the GIFI data, not only for the T-N and the sample Towards a Better Use of Tax Data

22 Towards a Better Use of Tax Data T2 T2 Take-None: Census of GIFI EXCLUSION THRESHOLD Main sample to be surveyed For variables available from tax: Total estimate = Survey estimate (T1,T2) + T2 Take-None + T1 Take-none e-filer estimate For variables not available from tax (Characteristics): Total estimate= Survey estimate (T1, T2) Not eligible for tax : full questionnaire Tax replaced Characteristic quest. (services surveys) or full questionnaire (other surveys) T1 T1 Take- None: Sample of e-filers EligibleIneligible

23 Conclusion Since the introduction of the UES, the use of tax data has increased consistently It has significantly reduced response burden and the cost of the survey Unfortunately, sometimes at the expense of a reduced data interpretability Fortunately, it was recently decided that we would take a few steps back to evaluate how we currently do things, and to determine how we could improve our strategy

For more information please contact Pour plus d’information, veuillez contacter Visit our web site at Claude Turmelle (613)